Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FT-Data Ranker_大语言模型微调数据赛, 是否可以分享该比赛的数据用于对Data-Juicer项目的使用。 #603

Open
user2311717757 opened this issue Mar 3, 2025 · 1 comment
Assignees
Labels
question Further information is requested

Comments

@user2311717757
Copy link

尊敬的Data-Juicer框架开发者,你们好。最近,我们有对大模型数据进行处理的需求。从论文“Data-Juicer: A One-Stop Data Processing System for Large Language Models”调研到Data-Juicer的开源大模型数据处理框架。我们想进一步使用和探索这个框架。正好,我们看到了你们在天池比赛中发布了“FT-Data Ranker_大语言模型微调数据赛(7B模型赛道)”比赛。但是比赛已经结束无法获取原始数据。是否可以提供原始数据以供我们探索和使用Data-Juicer框架。万分感谢🙏。

@HYLcool
Copy link
Collaborator

HYLcool commented Mar 4, 2025

@user2311717757 ,感谢你对 Data-Juicer 的关注与使用!

比赛结束后,我们为系列赛开放了日常学习赛,那里可以获取到数据等相关资料并继续提交结果参与打榜,其中7B赛道的比赛地址为:https://tianchi.aliyun.com/competition/entrance/532291?spm=a2c22.12281976.0.0.15a638969XbMsh

欢迎进行尝试~

@HYLcool HYLcool added the question Further information is requested label Mar 4, 2025
@HYLcool HYLcool self-assigned this Mar 4, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants