You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I find that , for apps.plm.train, PLM has been applied FSDP. It can not be fine-tuned with two 80G GPU mem (OOM error), setting tp_size=2 and batch_size=1 . the code can be trained with Deepspeed?