Enable ipex and other optimizations #1628

jiqing-feng · 2025-05-08T07:23:19Z

This PR enables ipex and other optimizations including:

ipex fused op
enable fp4 on cpu
enable has_rem on quantize/dequantize 4bit
Simple 8bit matmul so can make finetune faster on CPU

Also, it fixed the parameter patch for cpu.

It could pass all transformers tests

After this PR merged, I will update the installation guide.

@matthewdouglas @Titus-von-Koeller

Signed-off-by: jiqing-feng <[email protected]>

bitsandbytes/functional.py

Signed-off-by: jiqing-feng <[email protected]>

github-actions · 2025-05-08T18:05:34Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

bitsandbytes/autograd/_functions.py

bitsandbytes/backends/cpu/ops.py

bitsandbytes/utils.py

bitsandbytes/backends/cpu/ops.py

bitsandbytes/functional.py

bitsandbytes/nn/modules.py

jiqing-feng · 2025-05-09T08:16:24Z

I am cleaning the CPU and XPU tests, process 50%

Signed-off-by: jiqing-feng <[email protected]>

Devjiu · 2025-05-09T11:28:35Z

bitsandbytes/functional.py

+            quant_state.blocksize,
+            quant_state.shape,
+            quant_state.dtype,
+        )


Is there reason why this change can't be in bitsandbytes/backends/cpu/ops.py?

@Devjiu See here: https://huggingface.slack.com/archives/C07FCTCJKU0/p1746584773504889

bitsandbytes/autograd/_functions.py

Signed-off-by: jiqing-feng <[email protected]>

tests/test_functional.py

tests/test_linear4bit.py

jiqing-feng · 2025-05-12T07:10:49Z

pytest --ignore test_optim.py --ignore test_triton.py --ignore test_cuda_setup_evaluator.py

CPU previous: 378 passed, 1537 failed, 1638 skipped, 197 xfailed, 153 warnings in 613.27s
CPU current: 2079 passed, 1498 skipped, 153 deselected, 9 xfailed, 59 warnings in 1192.94s

XPU previous: not enabled
XPU current: 2093 passed, 1493 skipped, 153 deselected, 63 warnings in 562.25s

It also could pass all transformers tests

I also updated the installation guide.

Hi @matthewdouglas . Please take the next round review.

Signed-off-by: jiqing-feng <[email protected]>

matthewdouglas · 2025-05-12T14:32:47Z

docs/source/installation.mdx

@@ -316,15 +316,29 @@ pip install -e .   # `-e` for "editable" install, when developing BNB (otherwise
 > [!TIP]
 > Intel CPU/XPU backend only supports building from source; for now, please follow the instructions below.

-It does not need compile CPP codes, all required ops are in [intel_extension_for_pytorch](https://pytorch-extension.intel.com/), please follow the instruction to install ipex.
+It requires [intel_extension_for_pytorch](https://pytorch-extension.intel.com/), please follow the instruction to install ipex.


I would expect IPEX to be optional. Especially so for CPU on Windows or for Linux/macOS on aarch64.

Signed-off-by: jiqing-feng <[email protected]>

jiqing-feng added 3 commits May 7, 2025 14:53

enable ipex

ba79025

Signed-off-by: jiqing-feng <[email protected]>

fix cpu 8bit quantization

958d75b

Signed-off-by: jiqing-feng <[email protected]>

fix int8 and nf4 cpu inference

f5c0b01

Signed-off-by: jiqing-feng <[email protected]>

jiqing-feng marked this pull request as ready for review May 8, 2025 07:25

add cpu fp4 and rem

7f2d8a8

Signed-off-by: jiqing-feng <[email protected]>

Devjiu reviewed May 8, 2025

View reviewed changes

bitsandbytes/functional.py Outdated Show resolved Hide resolved

jiqing-feng added 2 commits May 8, 2025 14:14

fix dequantize nf4 xpu

97d5bd1

Signed-off-by: jiqing-feng <[email protected]>

Merge branch 'main' into ipex

5563c35

matthewdouglas requested changes May 8, 2025

View reviewed changes

matthewdouglas added Intel Integration Cross Platform x64 CPU labels May 8, 2025

matthewdouglas self-assigned this May 8, 2025

jiqing-feng added 4 commits May 9, 2025 10:32

fix ipex op

7b72673

Signed-off-by: jiqing-feng <[email protected]>

fix dequantize nf4 name

52e32af

Signed-off-by: jiqing-feng <[email protected]>

fix dequantize nf4 ipex

fda3d70

Signed-off-by: jiqing-feng <[email protected]>

Merge branch 'main' into ipex

5ce3296

Devjiu reviewed May 9, 2025

View reviewed changes

bitsandbytes/autograd/_functions.py Outdated Show resolved Hide resolved

jiqing-feng added 5 commits May 9, 2025 14:04

fix matmul8bitfp

f51678e

Signed-off-by: jiqing-feng <[email protected]>

enable cpu tests

7c9281c

Signed-off-by: jiqing-feng <[email protected]>

fix format

83cea6b

Signed-off-by: jiqing-feng <[email protected]>

fix quantize blockwise output shape

bc8723e

Signed-off-by: jiqing-feng <[email protected]>

fix quant_storage bf16 and gemv cpu

3c07023

Signed-off-by: jiqing-feng <[email protected]>

matthewdouglas reviewed May 9, 2025

View reviewed changes

tests/test_functional.py Outdated Show resolved Hide resolved

matthewdouglas reviewed May 9, 2025

View reviewed changes

tests/test_functional.py Outdated Show resolved Hide resolved

matthewdouglas added this to the v0.47.0 milestone May 9, 2025

matthewdouglas reviewed May 9, 2025

View reviewed changes

tests/test_linear4bit.py Show resolved Hide resolved

jiqing-feng added 9 commits May 12, 2025 12:56

fix cpu tests

9fbed05

Signed-off-by: jiqing-feng <[email protected]>

Merge branch 'main' into ipex

59e682d

fix xpu tests

c17e2ff

Signed-off-by: jiqing-feng <[email protected]>

fix lib

974c60a

Signed-off-by: jiqing-feng <[email protected]>

skip xpu dequantize blockwise op check

a21c290

Signed-off-by: jiqing-feng <[email protected]>

fix matmul8bit

a5d4a27

Signed-off-by: jiqing-feng <[email protected]>

skip not used function teests

959a0d4

Signed-off-by: jiqing-feng <[email protected]>

fix matmul8bit fp

f44d4a2

Signed-off-by: jiqing-feng <[email protected]>

check ipex before MatMul8bitFp

b9f3c40

Signed-off-by: jiqing-feng <[email protected]>

matthewdouglas reviewed May 12, 2025

View reviewed changes

update ipex install guide

21cf8c1

Signed-off-by: jiqing-feng <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable ipex and other optimizations #1628

Enable ipex and other optimizations #1628

jiqing-feng commented May 8, 2025 •

edited

Loading

github-actions bot commented May 8, 2025

jiqing-feng commented May 9, 2025

Devjiu May 9, 2025

matthewdouglas May 9, 2025

jiqing-feng commented May 12, 2025

matthewdouglas May 12, 2025 •

edited

Loading

Enable ipex and other optimizations #1628

Are you sure you want to change the base?

Enable ipex and other optimizations #1628

Conversation

jiqing-feng commented May 8, 2025 • edited Loading

github-actions bot commented May 8, 2025

jiqing-feng commented May 9, 2025

Devjiu May 9, 2025

Choose a reason for hiding this comment

matthewdouglas May 9, 2025

Choose a reason for hiding this comment

jiqing-feng commented May 12, 2025

matthewdouglas May 12, 2025 • edited Loading

Choose a reason for hiding this comment

jiqing-feng commented May 8, 2025 •

edited

Loading

matthewdouglas May 12, 2025 •

edited

Loading