You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+2-4Lines changed: 2 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,9 +5,9 @@ Contribution guide refer to [Contribution Guide](docs/developer_guide/contributi
5
5
6
6
## Quick start
7
7
8
-
[DeepEP](https://github.com/deepseek-ai/DeepEP) is a communication library tailored for Mixture-of-Experts (MoE) and expert parallelism (EP).
8
+
DeepEP-Ascend: Ascend Implementation of DeepEP. [README](https://github.com/sgl-project/sgl-kernel-npu/blob/main/python/deep_ep/README.md)
9
9
10
-
[DeepEP-Ascend](https://github.com/sgl-project/sgl-kernel-npu/blob/main/python/deep_ep/README.md) is the adaptation implementation of DeepEP on Ascend.
10
+
SGL-Kernel-NPU: Other SGLang Kernels for Ascend NPU. [README](https://github.com/sgl-project/sgl-kernel-npu/blob/main/python/sgl_kernel_npu/README.md)
11
11
12
12
## DeepEP-Ascend Performance
13
13
@@ -32,5 +32,3 @@ We test normal kernels on A3 384 SuperPOD. And we follow a typical DeepSeek-V3/R
32
32
| 8 | 132 us | 58 GB/s (HCCS) | 8 | 126 us | 116 GB/s (HCCS) |
33
33
| 16 | 139 us | 55 GB/s (HCCS) | 16 | 135 us | 109 GB/s (HCCS) |
34
34
| 32 | 153 us | 49 GB/s (HCCS) | 32 | 151 us | 97 GB/s (HCCS) |
0 commit comments