Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance of num_head=1? #2

Open
FacePoluke opened this issue Jan 2, 2025 · 2 comments
Open

Performance of num_head=1? #2

FacePoluke opened this issue Jan 2, 2025 · 2 comments

Comments

@FacePoluke
Copy link

Thank you for your excellent work. I noticed that your codebook head number is 4, which means downstream generation tasks need to output 4 tokens at once, potentially making training more challenging. I would like to know how much the performance would differ if the head number is 1.
Looking forward to your response.

@zbr17
Copy link
Owner

zbr17 commented Jan 2, 2025

Hello!

Thank you for your interest in our OptVQ project. In the early stages of the project, I experimented with the setting num-head=1, and generally, we found that the performance was inferior compared to when num-head=4. However, due to limited laboratory resources, I am currently using an 8-GPU 4090 server, and in the latest version of the training code, I only have results for num-head=4 and num-head=8. Please bear with me for a moment, as I plan to include checkpoints and evaluation results for num-head=1 in the project in the future.

Best regards,
Borui Zhang

@FacePoluke
Copy link
Author

Hello!

Thank you for your interest in our OptVQ project. In the early stages of the project, I experimented with the setting num-head=1, and generally, we found that the performance was inferior compared to when num-head=4. However, due to limited laboratory resources, I am currently using an 8-GPU 4090 server, and in the latest version of the training code, I only have results for num-head=4 and num-head=8. Please bear with me for a moment, as I plan to include checkpoints and evaluation results for num-head=1 in the project in the future.

Best regards, Borui Zhang

Thanks for your reply. Looking forward to it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants