Add a feature of UniformQDQ - support CV/NLP model's OPs, includingConv, DepthwiseConv2D, MatMul, etc. Additional op support be added upon request. #2155

qgao007 · 2025-03-27T22:47:00Z

Type of Change

A feature of UniformQDQ

Description

A feature of UniformQDQ

support CV/NLP model's OPs, includingConv, DepthwiseConv2D, MatMul, etc. Additional op support be added upon request.

Expected Behavior & Potential Risk

Generate a UniformQDQ int8 model. No foreseeable risk.

How has this PR been tested?

This PR is tested on selected models with representative ops from image-classfication, object detection, and nlp.

Dependency Change?

Need Tensorflow with qunit8-support.
https://github.com/intel-innersource/frameworks.ai.tensorflow.private-tensorflow/tree/mabuzain/add-quint8-support

Signed-off-by: zehao-intel <[email protected]>

Signed-off-by: Qun Gao <[email protected]>

Signed-off-by: Gao, Qun <[email protected]>

for more information, see https://pre-commit.ci

thuang6 · 2025-03-31T13:54:18Z

one general question, I saw the changes in both 2.x and 3.x folders, so this PR enables UniformQDQ supported in both 2.x and 3.x API?

qgao007 · 2025-04-01T17:28:17Z

one general question, I saw the changes in both 2.x and 3.x folders, so this PR enables UniformQDQ supported in both 2.x and 3.x API?

Thanks @thuang6 for the question! We ran it in new pythonic 3.x API, assuming compatibility with 2.x. We will confirm this shortly with @nhatleSummer22

qgao007 · 2025-04-01T21:54:19Z

loop in @ashahba and @mahmoud-abuzaina also.

thuang6 · 2025-04-03T02:55:01Z

one general question, I saw the changes in both 2.x and 3.x folders, so this PR enables UniformQDQ supported in both 2.x and 3.x API?

Thanks @thuang6 for the question! We ran it in new pythonic 3.x API, assuming compatibility with 2.x. We will confirm this shortly with @nhatleSummer22

new feature is only expected to 3.x API.

Please note, 3.x has separated packaging for each framework, TF 3.x packing is listed below, which only contains common & tensorflow folder under neural_compressor. If the function relies on code in other 2.x folders, it does not work.

"neural_compressor_tf": {
"project_name": "neural_compressor_tf",
"include_packages": find_packages(
include=[
"neural_compressor.common",
"neural_compressor.common.",
"neural_compressor.tensorflow",
"neural_compressor.tensorflow.",
],
),
"package_data": {"": ["*.yaml"]},
"install_requires": fetch_requirements("requirements_tf.txt"),
},

mahmoud-abuzaina · 2025-04-03T15:47:33Z

Supporting only 3.x API is fine, we don't really need to depend on 2.x API.

neural_compressor/data/datasets/dataset.py

neural_compressor/strategy/strategy.py

neural_compressor/tensorflow/utils/utility.py

Signed-off-by: Gao, Qun <[email protected]>

zehao-intel and others added 25 commits March 20, 2024 10:59

Enable Uniform QDQ for Keras Models

4c9ff61

Signed-off-by: zehao-intel <[email protected]>

fix bugs

bfd0524

Signed-off-by: zehao-intel <[email protected]>

fix import

f5e6726

Signed-off-by: zehao-intel <[email protected]>

support saved_model out

70cb8d3

Signed-off-by: zehao-intel <[email protected]>

fix import

5c03b79

Signed-off-by: zehao-intel <[email protected]>

Merge branch 'master' of https://github.com/intel/neural-compressor

178449d

Merge branch 'master' into zehao/uniform_qdq

ed31916

fix issues

b1ca538

Signed-off-by: zehao-intel <[email protected]>

add hf resnet50 example

9dd4beb

Signed-off-by: zehao-intel <[email protected]>

fix uint8 max

2a6d162

Signed-off-by: zehao-intel <[email protected]>

fix quint range for sequential or functional keras model

48013dc

Signed-off-by: zehao-intel <[email protected]>

modify bert example

1959471

Signed-off-by: zehao-intel <[email protected]>

refine zp calculation for uint8

d5223eb

Signed-off-by: zehao-intel <[email protected]>

fix resnet50

1db86f1

Signed-off-by: zehao-intel <[email protected]>

fix getting value dict for weight min max

2427209

Signed-off-by: zehao-intel <[email protected]>

fix zp and scale factor

15d5c28

Signed-off-by: zehao-intel <[email protected]>

Update for generating QdQ

506dfa5

Signed-off-by: Qun Gao <[email protected]>

Merge remote-tracking branch 'origin/master' into qg/refactor

58e7afa

Signed-off-by: Qun Gao <[email protected]>

update name changes

d564b77

Signed-off-by: Qun Gao <[email protected]>

Add raise error for Unsupported op type for per-channel quantization

ff5cb06

Signed-off-by: Qun Gao <[email protected]>

add ssd_mobilenet

7f47d66

Signed-off-by: Qun Gao <[email protected]>

remove debugging print

6564fa1

Signed-off-by: Qun Gao <[email protected]>

add support for ssd_mobile

2660fa3

Signed-off-by: Qun Gao <[email protected]>

clean debug

3bd7ba9

Signed-off-by: Gao, Qun <[email protected]>

Merge remote-tracking branch 'origin/master' into qg/refactor before PR

250a03c

qgao007 requested review from chensuyue and lvliang-intel March 27, 2025 22:47

[pre-commit.ci] auto fixes from pre-commit.com hooks

17b9e16

for more information, see https://pre-commit.ci

lvliang-intel requested review from thuang6 and yiliu30 March 30, 2025 13:40

lvliang-intel reviewed Apr 7, 2025

View reviewed changes

neural_compressor/data/datasets/dataset.py Outdated Show resolved Hide resolved

neural_compressor/strategy/strategy.py Outdated Show resolved Hide resolved

neural_compressor/tensorflow/utils/utility.py Outdated Show resolved Hide resolved

qgao007 added 2 commits April 7, 2025 11:19

Update to address Leon's feedback

af3910a

Signed-off-by: Gao, Qun <[email protected]>

Merge remote-tracking branch 'origin/master' into qg/refactor

e25cf25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a feature of UniformQDQ - support CV/NLP model's OPs, includingConv, DepthwiseConv2D, MatMul, etc. Additional op support be added upon request. #2155

Add a feature of UniformQDQ - support CV/NLP model's OPs, includingConv, DepthwiseConv2D, MatMul, etc. Additional op support be added upon request. #2155

qgao007 commented Mar 27, 2025 •

edited

Loading

thuang6 commented Mar 31, 2025

qgao007 commented Apr 1, 2025

qgao007 commented Apr 1, 2025

thuang6 commented Apr 3, 2025

mahmoud-abuzaina commented Apr 3, 2025

Add a feature of UniformQDQ - support CV/NLP model's OPs, includingConv, DepthwiseConv2D, MatMul, etc. Additional op support be added upon request. #2155

Are you sure you want to change the base?

Add a feature of UniformQDQ - support CV/NLP model's OPs, includingConv, DepthwiseConv2D, MatMul, etc. Additional op support be added upon request. #2155

Conversation

qgao007 commented Mar 27, 2025 • edited Loading

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

thuang6 commented Mar 31, 2025

qgao007 commented Apr 1, 2025

qgao007 commented Apr 1, 2025

thuang6 commented Apr 3, 2025

mahmoud-abuzaina commented Apr 3, 2025

qgao007 commented Mar 27, 2025 •

edited

Loading