Replies: 1 comment
-
Hey, can you try enable cpu offload and reduce batch size? Theoretically, it should fit. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi team and @winglian ,
I get OOM error when pretraining Yi-34 with 8*A100 80GB, flash_attn, deepspeed zero3, sequence_len=2048. So what is minimum gpus for this?
Beta Was this translation helpful? Give feedback.
All reactions