Clarification for the paper's needle-in-a-haystack section #14

yiyousong · 2025-04-07T03:12:10Z

In the needle-in-a-haystack section of your paper, you mentioned:
"However, linearizing with passkey samples (LoLCATs Llama 3 8B (Passkey)) recovers 100% accuracy."

Does this step involving lora-finetuning with passkey samples? Or only Attention-Transfer with passkey samples?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clarification for the paper's needle-in-a-haystack section #14

Clarification for the paper's needle-in-a-haystack section #14

yiyousong commented Apr 7, 2025

Clarification for the paper's needle-in-a-haystack section #14

Clarification for the paper's needle-in-a-haystack section #14

Comments

yiyousong commented Apr 7, 2025