Quantin'
-
text-generation-webui Public
Forked from oobabooga/text-generation-webuiA Gradio web UI for Large Language Models. Supports transformers, GPTQ, llama.cpp (GGUF), Llama models.
-
formatron Public
Forked from Dan-wanna-M/formatronFormatron empowers everyone to control the format of language models' output with minimal overhead.
-
lm-format-enforcer Public
Forked from noamgat/lm-format-enforcerEnforce the output format (JSON Schema, Regex etc) of a language model
Python MIT License UpdatedSep 14, 2024 -
tabbyAPI Public
Forked from theroyallab/tabbyAPIAn OAI compatible exllamav2 API that's both lightweight and fast
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
-
exllama Public
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
-
alpaca_lora_4bit Public
Forked from johnsmith0031/alpaca_lora_4bit