-
Notifications
You must be signed in to change notification settings - Fork 433
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GPU Layers for No Model mode. #1425
Comments
Does it not predict the layers on your device? What does the estimate show? Do you have a screenshot of your GUI launcher |
I use it with the nomodel switch, which doesn't load the koboldcpp GUI. koboldcpp_cu12.exe --nomodel |
I found out that you can launch the GUI with --showgui even with --nomodel. I guess changing GPU layers on the fly is not yet supported. |
Oh well that makes a lot of sense. How can it estimate gpu layers without a model lol. It has no idea what you want to use, so how will it predict the layers to use. |
Just don't use ollama, our own backend is much more suitable for our UI. |
Oh, nice, thanks. |
Describe the Issue
Gamma 2 with Ollama run slowly with the default GPU Layers in No Model mode.
Please add a setting to change the GPU Layers when running in No Model mode
Additional Information:
koboldcpp-1.86
Windows 11
CPU: Ryzen 2700x
RAM: 32GB
GPU: Nvidia RTX 3050
VRAM: 8GB
The text was updated successfully, but these errors were encountered: