-
Hi! Can I deploy DeepSeek V3 on 4 nodes with 14 H100s? The setup: My commands are:
Then got error: |
Beta Was this translation helpful? Give feedback.
Answered by
ispobock
Feb 12, 2025
Replies: 1 comment
-
The TP size needs to ensure that some dimensions of model weights are divisible. It is recommended to set TP size as the power of 2. |
Beta Was this translation helpful? Give feedback.
0 replies
Answer selected by
HandH1998
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
The TP size needs to ensure that some dimensions of model weights are divisible. It is recommended to set TP size as the power of 2.