Skip to content

Commit 00dd03d

Browse files
WoosukKwonweilong.yu
authored and
weilong.yu
committed
[V1] Fix non-cudagraph op name (vllm-project#10166)
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
1 parent 93bc3f4 commit 00dd03d

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/v1/worker/gpu_model_runner.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -411,7 +411,7 @@ def load_model(self) -> None:
411411
set_compilation_config(
412412
CompilationConfig(
413413
use_cudagraph=True,
414-
non_cudagraph_ops=["vllm.unified_flash_attention"],
414+
non_cudagraph_ops=["vllm.unified_v1_flash_attention"],
415415
use_inductor=True,
416416
))
417417

0 commit comments

Comments
 (0)