Stars
Understanding R1-Zero-Like Training: A Critical Perspective
Vector (and Scalar) Quantization, in Pytorch
Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A course on aligning smol models.
언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.