Mixed precision simple demo
docker build . -t mixed-precision
docker run -it --gpus=all --rm --ipc=host mixed-precision bash
- Without mixed precision:
python main.py --batch-size 256 - With mixed precision:
python main.py --batch-size 256 --use-mixed-precision O1 - With resnet50, no mixed precision and huge batch we get and
CUDA out of memory message:python main.py --arch resnet50 --batch-size 128 - With resnet50 and mixed preicsion, same huge batch training works:
python main.py --arch resnet50 --batch-size 128 --use-mixed-precision O1