What do the different quantize types actually do? #1102
-
I’m fairly new to AI, but I’ve been playing around with llama for a week or so. I noticed in the code for quantize there are four different types implemented but the readme only shows using type 2. I’m having trouble following what the four modes are for. What do the other 3 quantize types actually do to the model when run? They produce different outputs, but I’m not following what the purpose of it is. Thanks! |
Beta Was this translation helpful? Give feedback.
Answered by
sw
Apr 22, 2023
Replies: 1 comment 1 reply
Answer selected by
halfburnttoast
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
See #1121