stanley-fork / llama3_interpretability_sae Public

forked from PaulPauls/llama3_interpretability_sae

Notifications You must be signed in to change notification settings
Fork 0
Star 1

A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.

1 star 36 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md

Repository files navigation

Llama 3 Interpretability with Sparse Autoencoders

This project is currently taken down. My apologies.

About

A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and fully reproducible.

Custom properties

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%