Skip to content

embedding的推理问题 #3

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
dywlegend1002 opened this issue Apr 11, 2024 · 0 comments
Open

embedding的推理问题 #3

dywlegend1002 opened this issue Apr 11, 2024 · 0 comments

Comments

@dywlegend1002
Copy link

embedding的推理,根据日志好像是频繁加载再推理的,这个在embedding模型很大的时候,很浪费时间,所以希望优化一下,做到一次加载,多次推理。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant