-
Notifications
You must be signed in to change notification settings - Fork 254
Issues: casper-hansen/AutoAWQ
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
An error occurred when quantizing deepseek-r1-bf16 with longer data.
#729
opened Mar 19, 2025 by
taishan1994
Quantizing DeepSeek-R1-Distill-Qwen-7B produces garbage and repetitive tokens
#724
opened Mar 11, 2025 by
hav4ik
What is the best format of the calib_data for instract model?
#721
opened Mar 3, 2025 by
planetroger2020
Unpinning transformers or upgrading its latest supported version
#719
opened Feb 25, 2025 by
mirekphd
"Clarification on Multimodal Model Quantization and Default Calibration Dataset"
#714
opened Feb 17, 2025 by
donghong1
Same Memory (VRAM) with different batch_size, Prefill Length, Decode Length.
#691
opened Jan 15, 2025 by
rayzr0123
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.