Change the repository type filter
All
Repositories list
6 repositories
sglang
Public- Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
auto-evolution
Publictriton
Publicvllm
PublicUp to 4x faster decoding than vLLM using HiP Attention: https://github.com/DeepAuto-AI/hip-attention