-
Notifications
You must be signed in to change notification settings - Fork 3.5k
[Ascend][Doc] updated installation guide for Ascend NPU #13585
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
| @@ -0,0 +1,69 @@ | |||
| ## Examples | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think it would be better to change the file name to deepseek-family-examples.md
| #### vLLM | ||
|
|
||
| vLLM is still a major prerequisite on Ascend NPU. Because of `torch==2.6.0` limitation, only vLLM v0.8.5 is supported. | ||
| vLLM is still a major prerequisite on Ascend NPU. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think vLLM is not a major prerequisite on Ascend NPU, but for SGLang
| #### Triton on Ascend | ||
| ```shell | ||
| BISHENG_NAME="Ascend-BiSheng-toolkit_aarch64.run" | ||
| BISHENG_URL="https://sglang-ascend.obs.cn-east-3.myhuaweicloud.com/sglang/${BISHENG_NAME}" | ||
| wget -O "${BISHENG_NAME}" "${BISHENG_URL}" && chmod a+x "${BISHENG_NAME}" && "./${BISHENG_NAME}" --install && rm "${BISHENG_NAME}" | ||
| ``` | ||
| ```shell | ||
| pip install triton-ascend==3.2.0rc4 | ||
| ``` | ||
| If you want to install Triton on Ascend nightly build or from sources, follow [installation guide](https://gitcode.com/Ascend/triton-ascend/blob/master/docs/sources/getting-started/installation.md) | ||
|
|
||
| _Notice:_ We recommend installing triton-ascend from source due to its rapid development, the version on PYPI can't keep up for know. This problem will be solved on Sep. 2025, afterwards `pip install` would be the one and only installing method. | ||
|
|
||
| Please follow Triton-on-Ascend's [installation guide from source](https://gitee.com/ascend/triton-ascend#2%E6%BA%90%E4%BB%A3%E7%A0%81%E5%AE%89%E8%A3%85-triton-ascend) to install the latest `triton-ascend` package. | ||
| #### SGLang Kernels NPU | ||
| For installation of SGLang Kernels NPU check the [installation guide](https://github.com/sgl-project/sgl-kernel-npu/blob/main/python/sgl_kernel_npu/README.md). | ||
|
|
||
| #### DeepEP-compatible Library | ||
|
|
||
| We are also providing a DeepEP-compatible Library as a drop-in replacement of deepseek-ai's DeepEP library, check the [installation guide](https://github.com/sgl-project/sgl-kernel-npu/blob/main/python/deep_ep/README.md). | ||
|
|
||
| #### CustomOps | ||
| _TODO: to be removed once merged into sgl-kernel-npu | ||
| ```shell | ||
| wget https://sglang-ascend.obs.cn-east-3.myhuaweicloud.com/ops/CANN-custom_ops-8.2.0.0-$DEVICE_TYPE-linux.aarch64.run | ||
| chmod a+x ./CANN-custom_ops-8.2.0.0-$DEVICE_TYPE-linux.aarch64.run | ||
| ./CANN-custom_ops-8.2.0.0-$DEVICE_TYPE-linux.aarch64.run --quiet --install-path=/usr/local/Ascend/ascend-toolkit/latest/opp | ||
| wget https://sglang-ascend.obs.cn-east-3.myhuaweicloud.com/ops/custom_ops-1.0.$DEVICE_TYPE-cp311-cp311-linux_aarch64.whl | ||
| pip install ./custom_ops-1.0.$DEVICE_TYPE-cp311-cp311-linux_aarch64.whl | ||
| ``` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How about using a table to collect these prerequisites? Personally, I think it's a bit long to read
|
|
||
| ### Running DeepSeek-V3 | ||
|
|
||
| Running DeepSeek with PD disaggregation on 2 x Atlas 800I A3. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please add another qwen-family-examples.md
Motivation
Modifications
Accuracy Tests
Benchmarking and Profiling
Checklist