Skip to content

Commit 8dd0671

Browse files
jinhuang12Jin Huang
and
Jin Huang
authored
[Bugfix][V1] Only get input embeddings w/ multi-modal models if first PP (#17916)
Signed-off-by: Jin Huang <jinhun@amazon.com> Co-authored-by: Jin Huang <jinhun@amazon.com>
1 parent f0d610a commit 8dd0671

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/v1/worker/gpu_model_runner.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1107,7 +1107,7 @@ def execute_model(
11071107
else:
11081108
mm_embeds = []
11091109

1110-
if self.is_multimodal_model:
1110+
if self.is_multimodal_model and get_pp_group().is_first_rank:
11111111
# NOTE(woosuk): To unify token ids and soft tokens (vision
11121112
# embeddings), we always use embeddings (rather than token ids)
11131113
# as input to the multimodal model, even when the input is text.

0 commit comments

Comments
 (0)