Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU Layers for No Model mode. #1425

Open
razvanab opened this issue Mar 15, 2025 · 6 comments
Open

GPU Layers for No Model mode. #1425

razvanab opened this issue Mar 15, 2025 · 6 comments

Comments

@razvanab
Copy link

razvanab commented Mar 15, 2025

Describe the Issue
Gamma 2 with Ollama run slowly with the default GPU Layers in No Model mode.
Please add a setting to change the GPU Layers when running in No Model mode

Additional Information:

koboldcpp-1.86

Windows 11
CPU: Ryzen 2700x
RAM: 32GB
GPU: Nvidia RTX 3050
VRAM: 8GB

@LostRuins
Copy link
Owner

Does it not predict the layers on your device? What does the estimate show? Do you have a screenshot of your GUI launcher

@razvanab
Copy link
Author

razvanab commented Mar 15, 2025

I use it with the nomodel switch, which doesn't load the koboldcpp GUI.

koboldcpp_cu12.exe --nomodel

@razvanab
Copy link
Author

I found out that you can launch the GUI with --showgui even with --nomodel. I guess changing GPU layers on the fly is not yet supported.

@LostRuins
Copy link
Owner

LostRuins commented Mar 15, 2025

Oh well that makes a lot of sense. How can it estimate gpu layers without a model lol. It has no idea what you want to use, so how will it predict the layers to use.

@henk717
Copy link

henk717 commented Mar 15, 2025

Just don't use ollama, our own backend is much more suitable for our UI.
I made a script here that can liberate ollama models and provide you with the GGUF : https://gist.github.com/henk717/4ffd6b7486d2cb1e74fb8251ec46d56e

@razvanab
Copy link
Author

Oh, nice, thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants