-
Notifications
You must be signed in to change notification settings - Fork 255
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
License
casper-hansen/AutoAWQ
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
About
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
License
Stars
Watchers
Forks
Packages 0
No packages published