Skip to content

Commit dbb3cc2

Browse files
oagniqgnatoagniqgnat
andauthored
Update README (#115)
Co-authored-by: oagniqgnat <[email protected]>
1 parent bf9af58 commit dbb3cc2

File tree

1 file changed

+2
-4
lines changed

1 file changed

+2
-4
lines changed

README.md

Lines changed: 2 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -5,9 +5,9 @@ Contribution guide refer to [Contribution Guide](docs/developer_guide/contributi
55

66
## Quick start
77

8-
[DeepEP](https://github.com/deepseek-ai/DeepEP) is a communication library tailored for Mixture-of-Experts (MoE) and expert parallelism (EP).
8+
DeepEP-Ascend: Ascend Implementation of DeepEP. [README](https://github.com/sgl-project/sgl-kernel-npu/blob/main/python/deep_ep/README.md)
99

10-
[DeepEP-Ascend](https://github.com/sgl-project/sgl-kernel-npu/blob/main/python/deep_ep/README.md) is the adaptation implementation of DeepEP on Ascend.
10+
SGL-Kernel-NPU: Other SGLang Kernels for Ascend NPU. [README](https://github.com/sgl-project/sgl-kernel-npu/blob/main/python/sgl_kernel_npu/README.md)
1111

1212
## DeepEP-Ascend Performance
1313

@@ -32,5 +32,3 @@ We test normal kernels on A3 384 SuperPOD. And we follow a typical DeepSeek-V3/R
3232
| 8 | 132 us | 58 GB/s (HCCS) | 8 | 126 us | 116 GB/s (HCCS) |
3333
| 16 | 139 us | 55 GB/s (HCCS) | 16 | 135 us | 109 GB/s (HCCS) |
3434
| 32 | 153 us | 49 GB/s (HCCS) | 32 | 151 us | 97 GB/s (HCCS) |
35-
| 64 | | | 64 | | |
36-
| 128 | | | 128 | | |

0 commit comments

Comments
 (0)