-
Notifications
You must be signed in to change notification settings - Fork 3.6k
Open
Labels
featureIs an improvement or enhancementIs an improvement or enhancementneeds triageWaiting to be triaged by maintainersWaiting to be triaged by maintainers
Description
Description & Motivation
Hi, is there any tutorial on how to setup the expert parallel in lightning trainer? Maybe the shortest way is to use DeepSpeed MoE (https://www.deepspeed.ai/tutorials/mixture-of-experts/) since lightning support deep speed strategy. But it needs to set some optimizer parameter groups or something. Can anyone give me some hints to achieve this?
Pitch
No response
Alternatives
No response
Additional context
No response
Metadata
Metadata
Assignees
Labels
featureIs an improvement or enhancementIs an improvement or enhancementneeds triageWaiting to be triaged by maintainersWaiting to be triaged by maintainers