Skip to content

Commit bb423d7

Browse files
WoosukKwonJC1DA
authored andcommitted
[V1] Fix non-cudagraph op name (vllm-project#10166)
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu> Signed-off-by: Loc Huynh <jc1da.3011@gmail.com>
1 parent 47a01a9 commit bb423d7

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/v1/worker/gpu_model_runner.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -411,7 +411,7 @@ def load_model(self) -> None:
411411
set_compilation_config(
412412
CompilationConfig(
413413
use_cudagraph=True,
414-
non_cudagraph_ops=["vllm.unified_flash_attention"],
414+
non_cudagraph_ops=["vllm.unified_v1_flash_attention"],
415415
use_inductor=True,
416416
))
417417

0 commit comments

Comments
 (0)