Skip to content

koboldcpp-1.57 - CUDA 12.3 build

Compare
Choose a tag to compare
@kalomaze kalomaze released this 08 Feb 23:31
· 102 commits to concedo since this release

I have merged the (currently unmerged) llama.cpp PR for Mixtral prompt processing to be faster. Should be about a ~1.25x prompt processing speed improvement for all CPU layers.