kobold.cpp-elephantastic_experimental_v1.43.b1216

Nexesenex released this 16 Sep 13:44

· 5556 commits to concedo since this release

1.43.b1216

2dc9668

Kobold CPP v1.43 with CUDA/CUBLAS MMQ fixed (buffers are allocated properly from the start), and unrestricted context.
CodeLlama2 c34b in Q4_K_S can run with 16384 context on a GTX 3090/4090 used as a second graphic card.

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kobold.cpp-elephantastic_experimental_v1.43.b1216