You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I like the training of transformers+deepspeed very much. After reading the content of deepspeed MOE, I want to see if deepseek v3 can be supported through deepspeed. It seems that there is still a long way to run. Is there a plan?
The text was updated successfully, but these errors were encountered:
I like the training of transformers+deepspeed very much. After reading the content of deepspeed MOE, I want to see if deepseek v3 can be supported through deepspeed. It seems that there is still a long way to run. Is there a plan?
The text was updated successfully, but these errors were encountered: