-
train base model with imagenet100 dataset
-
Sparsify and export sparsity onnx
python onnx_export_sparsity.py- fine tuning
- sparsity_mode: "sparsegpt" or "sparse_magnitude"
-
generate tensorrt model
python onnx2trt.py
- fp16 sparse_magnitude
Gpu Mem: 138M
[TRT_E] Test Top-1 Accuracy: 83.28%
[TRT_E] Test Top-5 Accuracy: 96.72%
[TRT_E] 10000 iterations time: 6.7392 [sec]
[TRT_E] Average FPS: 1483.85 [fps]
[TRT_E] Average inference time: 0.67 [msec]