Skip to content

Merge branch 'main' into add/quant_fp16_fp8_0.11.0

9064fb1
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Closed

feat: implement online bf16-to-fp8 conversion and inference in TurboMind #4237

Merge branch 'main' into add/quant_fp16_fp8_0.11.0
9064fb1
Select commit
Loading
Failed to load commit list.