Fix slow gguf tests #2844

For-rest2005 · 2025-03-26T09:42:24Z

I fixed the performance problem for gguf tests. By modifying the llama-cpp-python and building a new API model, we only need to use about 1/10 time compared to previous methods.
For details, please refer to abetlen/llama-cpp-python#1983 and https://github.com/For-rest2005/lm-evaluation-harness
The API model is not well written and require the modification in llama-cpp-python. If needed, I can make a pull request.
This problem is solved with the help of @PolluxyShi

baberabb · 2025-03-26T11:55:44Z

Hi! A PR will be appreciated!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix slow gguf tests #2844

Fix slow gguf tests #2844

For-rest2005 commented Mar 26, 2025

baberabb commented Mar 26, 2025

Uh oh!

Fix slow gguf tests #2844

Fix slow gguf tests #2844

Comments

For-rest2005 commented Mar 26, 2025

baberabb commented Mar 26, 2025

Uh oh!