-
-
Notifications
You must be signed in to change notification settings - Fork 796
Cpu fused kernel #1804
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
matthewdouglas
merged 83 commits into
bitsandbytes-foundation:main
from
jiqing-feng:cpu_fused_kernel
Nov 26, 2025
Merged
Cpu fused kernel #1804
Changes from all commits
Commits
Show all changes
83 commits
Select commit
Hold shift + click to select a range
6be1412
add template to support more dtypes
jiqing-feng 252ac0f
update cmake list
jiqing-feng f98c9e5
fix typo
jiqing-feng 902bf35
fix compile cpu
jiqing-feng fef8459
make different dtype works
jiqing-feng 55cbaa0
use bf16 on CPU
jiqing-feng bbef95b
fix state2 dtype
jiqing-feng e842513
remove torch
jiqing-feng d4473fa
rm torch
jiqing-feng dea8dd6
enable float to bf16
jiqing-feng e9bb4fe
rm dequantizeBlockwise4bitCpu
jiqing-feng cdc8d5e
fix check
jiqing-feng baacfac
enable dequant 4bit kernel
jiqing-feng eec3521
fix typo
jiqing-feng d7cc1c5
fix typo
jiqing-feng 124b754
fix dequantize
jiqing-feng 0f918c7
fix
jiqing-feng e1a8b20
fix
jiqing-feng eab45c8
test
jiqing-feng d9f5dd8
fix
jiqing-feng 070f8a0
fix
jiqing-feng a84addf
fix
jiqing-feng c4bb660
fix
jiqing-feng 4ba13fd
fix
jiqing-feng c0d05ec
change input param
jiqing-feng 62a16a6
fix typo
jiqing-feng d9ad828
fix input param
jiqing-feng 09ed6cb
spliut 8bit and 4bit
jiqing-feng a3f7b61
fix typo
jiqing-feng 4708470
fix typo
jiqing-feng 1dfe9f7
fix input params
jiqing-feng 00289c4
fix input params
jiqing-feng a2578ba
fix
jiqing-feng 72033dc
fix typo
jiqing-feng 1c20ae8
enable dequant4bit
jiqing-feng 7552fe2
fix
jiqing-feng 8b32a39
fix
jiqing-feng 8f1cc36
fix reverse
jiqing-feng 49d242a
fix dequant 4bit fallback path
jiqing-feng 4a9a6dc
fix fp4 dequant
jiqing-feng 6bcd19e
Merge branch 'main' into cpu_kernel
jiqing-feng d7e981d
rm _Float16
jiqing-feng 48739b0
tmp codes
jiqing-feng f784be8
enable gemv
jiqing-feng 92192c9
change to 4bit dequant
jiqing-feng bd02e71
fix def
jiqing-feng 8520069
fix type
jiqing-feng e921cbb
fix absmax dtype
jiqing-feng 9b5d97a
fix type
jiqing-feng fd6cff1
fix compile and type
jiqing-feng 46d6e47
enable gemv
jiqing-feng 3271c30
fix shape
jiqing-feng 176a2b6
fix lib name
jiqing-feng 196984a
debug
jiqing-feng 7652115
update
jiqing-feng ea0e649
enable gemv 4bit bf16
jiqing-feng 9277d24
enable avx512 check
jiqing-feng 4fb315b
fix check
jiqing-feng 81f1984
fix endif
jiqing-feng 0f78bad
fix format
jiqing-feng fcb8456
fix format
jiqing-feng c5e1894
fix def
jiqing-feng f2029c6
rebase
jiqing-feng df1d669
fix position
jiqing-feng bb3ac8d
fix format
jiqing-feng 26b5685
rm duplicated func
jiqing-feng 445725b
Merge branch 'main' into cpu_fused_kernel
jiqing-feng 580010c
rm useless code comments
jiqing-feng 57b89bf
fix out shape
jiqing-feng 302a5fe
Merge branch 'main' into cpu_fused_kernel
jiqing-feng de5fb9c
fix comments
jiqing-feng 6858a90
add reverse format
jiqing-feng 3b3d609
check avx512bf15
jiqing-feng fbb911b
fix has_avx512bf16
jiqing-feng 3179b42
fix tests
jiqing-feng 0c88d43
fix absmax shhape
jiqing-feng feb8ad2
fix compile
jiqing-feng c6b714d
fix tests
jiqing-feng 5497111
fix test_gemv
jiqing-feng 0045c4b
Merge branch 'main' into cpu_fused_kernel
jiqing-feng bdb25c0
disable binsearch
jiqing-feng 6cec12d
fix lint
jiqing-feng 692a8e1
fix save
jiqing-feng File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.