Skip to content

Commit 7fcac78

Browse files
committed
add quantization tests
1 parent fc871a5 commit 7fcac78

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

tests/basic_correctness/test_cpu_offload.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,3 +4,5 @@
44
def test_cpu_offload():
55
compare_two_settings("meta-llama/Llama-2-7b-hf", [],
66
["--cpu-offload-gb", "4"])
7+
compare_two_settings("nm-testing/llama7b-one-shot-2_4-w4a16-marlin24-t",
8+
[], ["--cpu-offload-gb", "1"])

0 commit comments

Comments
 (0)