[Guide]: How to quickly run a perf benchmark to determine if performance has improved #864

Potabk · 2025-05-15T01:28:19Z

Your current environment

None

How would you like to use vllm on ascend

Assume you are using vllm-acend v0.7.3, and you want to know if the tuning strategy make sense, the following step may as a reference:

Run with docker

export DEVICE=/dev/davinci0
# Update the vllm-ascend image
export IMAGE=quay.io/ascend/vllm-ascend:v0.7.3
docker run --rm \
--name vllm-ascend \
--device $DEVICE \
--device /dev/davinci_manager \
--device /dev/devmm_svm \
--device /dev/hisi_hdc \
-v /usr/local/dcmi:/usr/local/dcmi \
-v /usr/local/bin/npu-smi:/usr/local/bin/npu-smi \
-v /usr/local/Ascend/driver/lib64/:/usr/local/Ascend/driver/lib64/ \
-v /usr/local/Ascend/driver/version.info:/usr/local/Ascend/driver/version.info \
-v /etc/ascend_install.info:/etc/ascend_install.info \
-v /root/.cache:/root/.cache \
-it $IMAGE bash

Launch vllm server

# note set `load-format=dummy`, for a lightweight test, we don't need real download weights
export HF_ENDPOINT="https://hf-mirror.com"
python3  -m vllm.entrypoints.openai.api_server  --model Qwen/Qwen2.5-7B-Instruct --tensor-parallel-size 1 --swap-space 16 --disable-log-stats  --disable-log-requests  --load-format dummy

Install necessary dependencies

pip config set global.index-url https://pypi.tuna.tsinghua.edu.cn/simple
pip install pandas datasets

Run benchmark for online serving
need wait for vllm serving ready

cd /vllm-workspace/vllm/benchmarks
python benchmark_serving.py --model Qwen/Qwen2.5-7B-Instruct --dataset-name random --random-input-len 200 --num-prompts 200 --request-rate 1 --save-result --result-dir ./

Yikun added the guide guide note label May 15, 2025

Yikun changed the title ~~[Usage]: How to quickly run a perf benchmark to determine if performance has improved~~ [Guide]: How to quickly run a perf benchmark to determine if performance has improved May 15, 2025

shen-shanshan mentioned this issue May 15, 2025

[Performance]: vllm-ascend + mindie-turbo Performance Optimization #815

Closed

6 tasks

Yikun mentioned this issue May 19, 2025

[Guide] Official Guide Index #840

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Guide]: How to quickly run a perf benchmark to determine if performance has improved #864

[Guide]: How to quickly run a perf benchmark to determine if performance has improved #864

Potabk commented May 15, 2025 •

edited

Loading

[Guide]: How to quickly run a perf benchmark to determine if performance has improved #864

[Guide]: How to quickly run a perf benchmark to determine if performance has improved #864

Comments

Potabk commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Your current environment

How would you like to use vllm on ascend

Potabk commented May 15, 2025 •

edited

Loading