-
Notifications
You must be signed in to change notification settings - Fork 549
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Backout Unifying TBE API using List (Frontend)
cla signed
fb-exported
#3803
opened Mar 11, 2025 by
spcyppt
Loading…
Migrate TBE benchmark utilities over to TBE, pt 5
cla signed
fb-exported
#3801
opened Mar 11, 2025 by
q10
Loading…
Add Preshuffled FP8 x INT4 Grouped Gemm Kernel
cla signed
fb-exported
#3800
opened Mar 11, 2025 by
jwfromm
Loading…
[fbgemm_gpu] Limit the number of ROCm hardware targets
ciflow/rocm
cla signed
module: rocm
#3797
opened Mar 11, 2025 by
q10
Loading…
Fast BF16 Reduction for AllReduce
cla signed
fb-exported
#3793
opened Mar 10, 2025 by
zjing14
Loading…
Log feature gate statuses in TBE init
cla signed
fb-exported
#3792
opened Mar 10, 2025 by
q10
Loading…
Performance Optimization: Improved TileShape Configuration for Large Llama Shapes
cla signed
#3790
opened Mar 10, 2025 by
MatrixAssembler
Loading…
Enable FP8 Triton dequantized block-wise kernel
cla signed
fb-exported
#3788
opened Mar 10, 2025 by
jiawenliu64
Loading…
Fix TBE inference per-sample weight
cla signed
fb-exported
#3787
opened Mar 10, 2025 by
sryap
Loading…
Handle zero inputs in F8I4 GEMM
cla signed
fb-exported
#3777
opened Mar 6, 2025 by
jiawenliu64
Loading…
Provide helper functions for int4 quantization
cla signed
fb-exported
#3775
opened Mar 6, 2025 by
jwfromm
Loading…
FBGEMM Add Columnwise Weight Scaling to F8I4 GEMM
cla signed
fb-exported
#3766
opened Mar 5, 2025 by
jwfromm
Loading…
fix: topology_utils.h compilation issue
cla signed
#3761
opened Mar 4, 2025 by
DevashishLal-CB
Loading…
Move float conversion functions from Types.h into new FloatConversion.h
cla signed
fb-exported
#3760
opened Mar 4, 2025 by
MatzeB
Loading…
Invoke CPU schema op for unweighted version TBE on MTIA
cla signed
fb-exported
#3757
opened Mar 3, 2025 by
egienvalue
Loading…
Enable multi-processing in CPU TBE micro-benchmarks
cla signed
fb-exported
#3753
opened Feb 28, 2025 by
excelle08
Loading…
Tuned fp8/bf16 GroupGEMM for MOE_17B
cla signed
fb-exported
#3752
opened Feb 28, 2025 by
zjing14
Loading…
Use GEMM kernel for KleidiAI to accelerate FP32Benchmark
cla signed
#3751
opened Feb 28, 2025 by
milpuz01
Loading…
Back out "Move
execute_backward_adagrad
into a class"
cla signed
fb-exported
#3749
opened Feb 28, 2025 by
q10
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.