Request Pre-Training Dataset Ratio #190

zyx006 · 2025-04-10T09:21:29Z

I would like to understand the ratio of programming language (PL) to natural language (NL) in the pre-training datasets of the codet5-base and codet5p-220m-bimodal models, such as whether the ratio is PL:NL=2:1.

yuewang-cuhk · 2025-04-10T09:22:03Z

你好，你的邮件我已收到，我会尽快查看~~~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Request Pre-Training Dataset Ratio #190

Request Pre-Training Dataset Ratio #190

zyx006 commented Apr 10, 2025

yuewang-cuhk commented Apr 10, 2025 via email

Uh oh!

Request Pre-Training Dataset Ratio #190

Request Pre-Training Dataset Ratio #190

Comments

zyx006 commented Apr 10, 2025

yuewang-cuhk commented Apr 10, 2025 via email

Uh oh!