GPT2 SAE

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 Oct 31 9:45
Editor
Edited
Edited
2025 Aug 10 22:9
Refs
Refs
GPT 2

Jbloo GPT2 SAE with
Ghost Gradient

notion image
 
 
Open Source Sparse Autoencoders for all Residual Stream Layers of GPT2-Small — LessWrong
Browse these SAE Features on Neuronpedia!  …
Open Source Sparse Autoencoders for all Residual Stream Layers of GPT2-Small — LessWrong
jbloom/GPT2-Small-SAEs-Reformatted at main
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
jbloom/GPT2-Small-SAEs-Reformatted at main

Distributed training

 
 
 

Attention SAEs for TGPT2

Attention SAEs Scale to GPT-2 Small — LessWrong
This is an interim report that we are currently building on. We hope this update + open sourcing our SAEs will be useful to related research occurrin…
Attention SAEs Scale to GPT-2 Small — LessWrong
ckkissane/attn-saes-gpt2-small-all-layers · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
ckkissane/attn-saes-gpt2-small-all-layers · Hugging Face

Induction head
of GPT2

We Inspected Every Head In GPT-2 Small using SAEs So You Don’t Have To — LessWrong
This is an interim report that we are currently building on. We hope this update will be useful to related research occurring in parallel. Produced a…
We Inspected Every Head In GPT-2 Small using SAEs So You Don’t Have To — LessWrong
 
 
 

Recommendations