You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit was created on GitHub.com and signed with GitHub’s verified signature.
Changes
Add ExLlamaV3 support (#6832). This is done through a new ExLlamav3_HF loader that uses the same samplers as Transformers and ExLlamav2_HF. Wheels compiled with GitHub Actions are included for both Linux and Windows, eliminating manual installation steps. Note: these wheels require compute capacity of 8 or greater, at least for now.