About training epochs and time for each stage #18

raymond-huang-sony · 2025-02-24T17:47:58Z

Thank you for the brilliant work!

I would like to run the training of VILA-U for a better understanding but I failed to find the detailed training setup in the paper. It seems like the default number of epochs set in the pre-training script is one. Could you confirm whether this is the same as for the models reported in the paper?

Besides, it would be great if you could suggest the training time for each stage, i.e. training the vision tower, pre-training, and supervised fine-tuning. P.S. I found the overall training cost reported in #2 .

Thank you.

raymond-huang-sony closed this as completed Feb 25, 2025

raymond-huang-sony changed the title ~~Download datasets for supervised fine-tuning~~ About training epochs and time for each stage Feb 25, 2025

raymond-huang-sony reopened this Feb 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About training epochs and time for each stage #18

About training epochs and time for each stage #18

raymond-huang-sony commented Feb 24, 2025 •

edited

Loading

About training epochs and time for each stage #18

About training epochs and time for each stage #18

Comments

raymond-huang-sony commented Feb 24, 2025 • edited Loading

raymond-huang-sony commented Feb 24, 2025 •

edited

Loading