Skip to content

Commit e90bf8b

Browse files
authored
Update README.md
1 parent 3aabad2 commit e90bf8b

File tree

1 file changed

+6
-0
lines changed

1 file changed

+6
-0
lines changed

README.md

+6
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,12 @@
22

33
ExLlamaV2 is an inference library for running local LLMs on modern consumer GPUs.
44

5+
The official and recommended backend server for ExLlamaV2 is [TabbyAPI](https://github.com/theroyallab/tabbyAPI/),
6+
which provides an OpenAI-compatible API for local or remote inference, with extended features like HF model
7+
downloading, embedding model support and support for HF Jinja2 chat templates.
8+
9+
See the [wiki](https://github.com/theroyallab/tabbyAPI/wiki/1.-Getting-Started) for help getting started.
10+
511

612
## New in v0.1.0+:
713

0 commit comments

Comments
 (0)