A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to copy code and launch discussions about the problems you hav…

Python 55 2 Updated Jul 4, 2023

pytorch / PiPPy

Pipeline Parallelism for PyTorch

Python 759 88 Updated Aug 21, 2024

jllllll / flash-attention

Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention - Windows wheels

Python 33 4 Updated Mar 3, 2024

theroyallab / tabbyAPI

An OAI compatible exllamav2 API that's both lightweight and fast

Python 865 104 Updated Mar 19, 2025

Flode-Labs / vid2densepose

Convert your videos to densepose and use it on MagicAnimate

Python 1,092 132 Updated Dec 10, 2023

unslothai / unsloth

Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 35,473 2,724 Updated Mar 22, 2025

yxli2123 / LoftQ

Python 220 20 Updated Jun 11, 2024

turboderp-org / exui

Web UI for ExLlamaV2

JavaScript 487 46 Updated Feb 5, 2025

latitudegames / AIDungeon

Infinite adventures await!

Python 3,204 551 Updated Jul 25, 2020

ChrisHayduk / qlora-multi-gpu

Forked from artidoro/qlora

QLoRA with Enhanced Multi GPU Support

Jupyter Notebook 36 5 Updated Aug 8, 2023

jllllll / bitsandbytes-windows-webui

Windows compile of bitsandbytes for use in text-generation-webui.

HTML 350 38 Updated Nov 18, 2023

FuPeiJiang / VD.ahk

Windows Virtual Desktop, AutoHotkey, Windows 11 support, Windows Server 2022, switch desktop, move window(wintitle) to current desktop; createDesktop, PinWindow, getCount, getDesktopNumOfWindow -> …

AutoHotkey 405 50 Updated Mar 20, 2025

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 5,460 519 Updated Mar 22, 2025

dvlab-research / LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Python 2,652 282 Updated Aug 14, 2024

turboderp-org / exllamav2

A fast inference library for running LLMs locally on modern consumer-class GPUs

Python 4,056 305 Updated Mar 15, 2025

kaiokendev / alpaca_lora_4bit

Forked from johnsmith0031/alpaca_lora_4bit

Python 8 Updated Jun 2, 2023

s0md3v / ifnude

nudity detector that works

Python 88 18 Updated May 10, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,692 2,278 Updated Mar 13, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 8,923 982 Updated Mar 22, 2025

eugenepentland / landmark-attention-qlora

Forked from epfml/landmark-attention

Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA

Python 123 6 Updated Jun 16, 2023

Jupyter Notebook 3,264 440 Updated Jun 12, 2024

acpopescu / bitsandbytes

Forked from bitsandbytes-foundation/bitsandbytes

8-bit CUDA functions for PyTorch

Python 44 9 Updated May 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

grimulkan

Block or report grimulkan

Stars

Tencent / HunyuanVideo-I2V

deepseek-ai / open-infra-index

Tencent / HunyuanVideo

vllm-project / vllm

CoinCheung / gdGPT

SparkJiao / llama-pipeline-parallel