[Bug]: CUDA error: unknown error when running vllm serve on WSL2 Ubuntu22.04 #19077
Open
1 task done
Labels
bug
Something isn't working
Your current environment
The output of
python collect_env.py
π Describe the bug
I installed Ubuntu22.04 for WSL2 on Windows10 21H2(19044.3086) and followed the steps in the NVIDA documentation to install the driver.
Then I executed 'vllm serve' (Refer to the output information below for parameters) in python==3.11 version of the conda environment after
pip install vllm==0.7.0
(Tsinghua source) .Then, "CUDA error: unknown error" is thrown before loading module.
The output of run
vllm serve /root/modules/QwQ-32B/ --trust-remote-code --served-model-name qwq --tensor-parallel-size 2 --pipeline-parallel-size 1 --max-model-len 1024 --gpu-memory-utilization 0.95 --max-num-seqs 256 --disable-custom-all-reduce
Before submitting a new issue...
The text was updated successfully, but these errors were encountered: