Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

搭建好环境后执行python tools/process_data.py --config configs/demo/process.yaml 命令报错 #580

Open
ctgushiwei opened this issue Feb 18, 2025 · 1 comment
Assignees
Labels
question Further information is requested

Comments

@ctgushiwei
Copy link

环境:
python3.10
ubuntu20.04
报错如下:

Image

@HYLcool
Copy link
Collaborator

HYLcool commented Feb 21, 2025

@ctgushiwei ,感谢你对 Data-Juicer 的关注与使用!

根据日志来看,应该是在下载语言分类器模型时因为网络等问题失败了,你可以根据红色的错误提示信息手动将模型下载到对应的目录

Downloading model [lid.176.bin] error. Please retry later or download it into /root/.cache/data _juicer/models manually from https://dail-wlcb.oss-cn-wulanchabu.aliyuncs.com/data_juicer/models/lid.176.bin or https://dl.fbaipublicfiles.com/fasttext/supervised-models/lid.176.bin

@HYLcool HYLcool self-assigned this Feb 21, 2025
@HYLcool HYLcool added the question Further information is requested label Feb 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants