Make a sense of the new GGML Quantized Methods? #559
Unanswered
JacobGoldenArt
asked this question in
Q&A
Replies: 1 comment
-
I'm curious if/when we will get Metal support for these other types of GGML quants. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi, I've been using LLAMA-CPP-Python successfully on my M2 mac specifically. So far I've been only using Q4_0 versions of ggml models given the metal installation steps :
But as I see from The Blokes model cards, There's a bunch of new quantized methods. I've read the descriptions of each method but not being a ML dev, I'm not really sure of the benefits of each and also if these knew methods are compatible with llama-cpp-python (specifically for mac (metal). Any thoughts Appreciated.
Beta Was this translation helpful? Give feedback.
All reactions