Add LoRA & Multi-LoRA support for V0.7.3 dev by Cherry Pick #700

ZhengJun9 · 2025-04-28T06:41:56Z

What this PR does / why we need it?

According to this RFC and this, we pull request relavant code to support (1) Multi-LoRA and (2) Multi-LoRA Dynamic Serving.
LoRA reference is here: LoRA reference

Does this PR introduce any user-facing change?

Following openai HTTP apis will be supported:
/v1/load_lora_adapter
/v1/unload_lora_adapter

How was this patch tested?

git clone https://github.com/vllm-project/vllm.git
cd vllm/examples/offline_inference/ && python3 multilora_inference.py

[Release]: vLLM Ascend v0.7.3 release checklist

…roject#521) ### What this PR does / why we need it? According to this RFC [[RFC]: Join the MultiLora and MultiLora Dynammic Serving feature develop vllm-project#396](vllm-project#396) and this [vLLM Ascend Roadmap Q2 2025 vllm-project#448](vllm-project#448), we pull request relavant code to support (1) Multi-LoRA and (2) Multi-LoRA Dynamic Serving. LoRA reference is here: [LoRA reference](https://docs.vllm.ai/en/latest/features/lora.html) ### Does this PR introduce _any_ user-facing change? Following openai HTTP apis will be supported: /v1/load_lora_adapter /v1/unload_lora_adapter ### How was this patch tested? git clone https://github.com/vllm-project/vllm.git cd vllm/examples/offline_inference/ && python3 multilora_inference.py --------- Signed-off-by: paulyu <paulyu0307@gmail.com> Co-authored-by: paulyu <paulyu0307@gmail.com>

### What this PR does / why we need it? Fix the import error that vllm-project#592 mentioned. Signed-off-by: paulyu <paulyu0307@gmail.com> Co-authored-by: paulyu <paulyu0307@gmail.com>

wangxiyuan · 2025-04-28T11:27:22Z

LGTM, it's a cherry-pick from main. The code is very clear.

paulyu12 and others added 2 commits April 28, 2025 10:05

[Bugfix] fix import error (vllm-project#600)

f090737

### What this PR does / why we need it? Fix the import error that vllm-project#592 mentioned. Signed-off-by: paulyu <paulyu0307@gmail.com> Co-authored-by: paulyu <paulyu0307@gmail.com>

github-actions bot added the module:core label Apr 28, 2025

wangxiyuan mentioned this pull request Apr 28, 2025

[Release]: vLLM Ascend v0.7.3 release checklist #644

Closed

46 tasks

wangxiyuan approved these changes Apr 28, 2025

View reviewed changes

ganyi1996ppo merged commit 1791113 into vllm-project:v0.7.3-dev Apr 28, 2025
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add LoRA & Multi-LoRA support for V0.7.3 dev by Cherry Pick #700

Add LoRA & Multi-LoRA support for V0.7.3 dev by Cherry Pick #700

Uh oh!

ZhengJun9 commented Apr 28, 2025

Uh oh!

wangxiyuan commented Apr 28, 2025

Uh oh!

Uh oh!

Uh oh!

Add LoRA & Multi-LoRA support for V0.7.3 dev by Cherry Pick #700

Add LoRA & Multi-LoRA support for V0.7.3 dev by Cherry Pick #700

Uh oh!

Conversation

ZhengJun9 commented Apr 28, 2025

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

wangxiyuan commented Apr 28, 2025

Uh oh!

Uh oh!

Uh oh!