Commit 3d82dbc

authored

ggml : block interleaving support for Q4_K quantization for x86 AVX2 architecture (ggml-org#12332)

* Add block interleaving support for Q4_K quantization * Remove whitespaces and fix CI/CD issues * Update pointer of bsums from int16_t to const int16_t * Add vector version of quantize_q8_K_4x8 function * Update code formatting based on review comments

1 parent 732b5fb commit 3d82dbcCopy full SHA for 3d82dbc

1 file changed

+1493

-12

lines changed

ggml/src/ggml-cpu
- ggml-cpu-aarch64.cpp

1 file changed

+1493

-12

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit 3d82dbc

1 file changed

1 file changed

File tree

1 file changed

1 file changed

0 commit comments