[WIP] Fix quantization for adapter v2 #314

rasbt · 2023-05-22T21:27:47Z

Arg, I noticed that llm.int8() quantization breaks for adapter-v2-finetuned models, giving a

    F.linear(input, self.weight, self.bias) + self.adapter_bias
RuntimeError: expected mat1 and mat2 to have the same dtype, but got: c10::BFloat16 != signed char

error.

I tried manually setting the dtype in the new v2 parameters (see this PR) but still get the same issue. Is there perhaps something I should be doing with

    with fabric.device:
        torch.set_default_tensor_type(...)

awaelchli · 2023-05-24T23:36:17Z

@rasbt This might be the fix we were looking for: #323 :)

rasbt · 2023-05-25T02:25:52Z

Oh yes, this may be it! Will test it out and continue the discussion in the other PR!

rasbt · 2023-06-02T19:03:58Z

We can probably close this because of #323

fix quantization for adapter v2

11cb0a4

rasbt requested review from awaelchli, carmocca and lantiga as code owners May 22, 2023 21:27

ArturK-85 referenced this pull request in ArturK-85/lit-parrot May 23, 2023

Update from lit-llama Lightning-AI#314

f8f9347

rasbt closed this Jun 2, 2023

journeadrien mentioned this pull request Jul 10, 2023

Error with self dataset falcon 7b: RuntimeError: expected mat1 and mat2 to have the same dtype, but got: c10::BFloat16 != signed char Lightning-AI/litgpt#223

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Fix quantization for adapter v2 #314

[WIP] Fix quantization for adapter v2 #314

rasbt commented May 22, 2023

awaelchli commented May 24, 2023

rasbt commented May 25, 2023

rasbt commented Jun 2, 2023

[WIP] Fix quantization for adapter v2 #314

[WIP] Fix quantization for adapter v2 #314

Conversation

rasbt commented May 22, 2023

awaelchli commented May 24, 2023

rasbt commented May 25, 2023

rasbt commented Jun 2, 2023