[WIP] [Transform] Compress, decompress #333

kylesayrs · 2025-05-31T04:46:28Z

idea: submodule structure handles most serialization for us
let's not couple apply_transform_config with apply_quantization_config, otherwise we'd have potential conflicts with the QuantizationMixin

somehow, we need to allow the model_compressor to know the q_config and t_config. In the case of q_config, it's actually built on the fly. That kinda works for q_config, since all the schemes are present (although you lose config_group names). That wouldn't directly work for t_config, since the schemes are still transparent.

A simple solution would be to move towards a pattern where q_config (and as a subfield, t_config) are attached as an attribute to the mode directly, then grabbed by model_compressor. This seems to make sense, I don't see many downsides

Need to decide if we want to keep the weight submodules in the compressed state. The issue is that, without saving them, then there's no way to go from compressed to decompressed. However, saving them requires extra storage and vllm has to ignore those weights

Let's not keep weight transforms, except when trainable. During decompression, let's add activation hooks (these will need to be added by quantization anyways)

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

…orm_status

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

wip: compression

70c2dfe

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs changed the base branch from main to kylesayrs/transform_permutations May 31, 2025 04:46

kylesayrs changed the title ~~[Transform] Apply, serialize, deserialize~~ [WIP] [Transform] Apply, serialize, deserialize May 31, 2025

Merge branch 'kylesayrs/transform_permutations' into kylesayrs/transf…

9f7d298

…orm_status

kylesayrs changed the title ~~[WIP] [Transform] Apply, serialize, deserialize~~ [WIP] [Transform] Apply, compress, decompress May 31, 2025

kylesayrs changed the title ~~[WIP] [Transform] Apply, compress, decompress~~ [WIP] [Transform] Compress, decompress May 31, 2025

kylesayrs added 4 commits June 10, 2025 16:52

Merge branch 'kylesayrs/transform_permutations' into kylesayrs/transf…

e4e3cdc

…orm_status

test compression decompression

78dce63

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

wip update tests

b0b82a1

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

fix weight transform offloading

62fd754

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] [Transform] Compress, decompress #333

[WIP] [Transform] Compress, decompress #333

Uh oh!

kylesayrs commented May 31, 2025 •

edited

Loading

Uh oh!

Uh oh!

[WIP] [Transform] Compress, decompress #333

Are you sure you want to change the base?

[WIP] [Transform] Compress, decompress #333

Uh oh!

Conversation

kylesayrs commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

kylesayrs commented May 31, 2025 •

edited

Loading