Skip to content

Commit 76f491f

Browse files
Isotr0pygarg-amit
authored andcommitted
[Hardware][CPU] Cross-attention and Encoder-Decoder models support on CPU backend (vllm-project#9089)
Signed-off-by: Amit Garg <mitgarg17495@gmail.com>
1 parent 02f3c8d commit 76f491f

File tree

6 files changed

+834
-287
lines changed

6 files changed

+834
-287
lines changed

.buildkite/run-cpu-test.sh

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -23,6 +23,7 @@ docker exec cpu-test-avx2 bash -c "python3 examples/offline_inference.py"
2323
# Run basic model test
2424
docker exec cpu-test bash -c "
2525
pip install pytest matplotlib einops transformers_stream_generator datamodel_code_generator
26+
pytest -v -s tests/models/encoder_decoder/language
2627
pytest -v -s tests/models/decoder_only/language \
2728
--ignore=tests/models/test_fp8.py \
2829
--ignore=tests/models/decoder_only/language/test_jamba.py \

0 commit comments

Comments
 (0)