Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cudla input Tensort, output Tensort data layout, quantization, and inverse quantization issues #192

Open
jacker-whh opened this issue Mar 17, 2023 · 0 comments

Comments

@jacker-whh
Copy link

I ran a command
./cuDLAStandaloneMode ../model_test/joint_model_cygnus_simple_batch1_512960_230220_dla_opt_sub_1.cudla 2

The output tensor information is as follows
Printing input tensor descriptor
TENSOR NAME : input'
size: 1966080
dims: [1, 4, 512, 960]
data fmt: 2
data type: 4
data category: 0
pixel fmt: 12
pixel mapping: 0
stride[0]: 1
stride[1]: 3840
stride[2]: 0
stride[3]: 0
-----------------------------------------tensor input:0
Printing output tensor descriptor
TENSOR NAME : M0_519'
size: 131072
dims: [1, 64, 32, 60]
data fmt: 1
data type: 4
data category: 3
pixel fmt: 0
pixel mapping: 0
stride[0]: 1
stride[1]: 64
stride[2]: 2048
stride[3]: 2048
-----------------------------------------tensor output:0

CUDLA_DATA_FORMAT_NHWC,The format of the input tensor is CUDLA_DATA_FORMAT_NHWC and the type is int8.
I have an input to an NCHW float, and I should get the quantization parameters and write them into input tensor in CUDLA_DATA_FORMAT_NHWC format.I don't quite understand what this street means.
Then where can I get the quantized parameters of Tensor.

The output tensort is int8, CUDLA_ DATA_ FORMAT_ NCHW format. How should I inversely convert it to float 32 and convert it to nchw format.
I didn't find its scale and zero in the model for either Tensor_ Parameter of point

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant