Error while finetune with deepspeed zero3 for multi-node (multi-GPUS) #2314
-
What is the accelerate.yaml and the config.yaml configuration to finetune on multi-nodes and each node with multi gpus?
|
Beta Was this translation helpful? Give feedback.
Answered by
hahmad2008
Feb 11, 2025
Replies: 1 comment 25 replies
-
@winglian @NanoCode012 Could you please check this? |
Beta Was this translation helpful? Give feedback.
25 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
@NanoCode012 I tried to enable this
use_reentrant: true
which fix the error I got.