Skip to content

Commit e1684a7

Browse files
authored
[Bugfix] Fix hard-coded value of x in context_attention_fwd (#6373)
Signed-off-by: Thomas Parnell <tpa@zurich.ibm.com>
1 parent a27f87d commit e1684a7

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

vllm/attention/ops/prefix_prefill.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -718,7 +718,7 @@ def context_attention_fwd(q,
718718
b_ctx_len,
719719
alibi_slopes,
720720
v_cache.shape[3],
721-
8,
721+
k_cache.shape[4],
722722
o,
723723
b_loc.stride(0),
724724
b_loc.stride(1),
@@ -768,7 +768,7 @@ def context_attention_fwd(q,
768768
b_seq_len,
769769
b_ctx_len,
770770
v_cache.shape[3],
771-
8,
771+
k_cache.shape[4],
772772
o,
773773
b_loc.stride(0),
774774
b_loc.stride(1),

0 commit comments

Comments
 (0)