-
Notifications
You must be signed in to change notification settings - Fork 0
Everything Quantization
Botchkarev edited this page Oct 17, 2023
·
6 revisions
FINN-hls4ml Guidelines for Quantization-Aware Training
Medium Blog about GPT-2 Optimization
[Repo transfomers-silicon-research, with lots of links]https://github.com/alimpk/transfomers-silicon-research
[PyTorch Quantization Guide] (https://pytorch.org/docs/stable/quantization.html)
[PyTorch Quantization API Reference] (https://pytorch.org/docs/stable/quantization-support.html#quantization-api-reference)
[Deploying Int8 QAT Models on GPU with TensorRT] (https://pytorch.org/TensorRT/_notebooks/vgg-qat.html)
quantization-and-training-of-neural-networks
fasttextzip-compressing-text-classification