Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Problem/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/Activation Engineering/Neuron SAE/
Matryoshka SAE
Search

Matryoshka SAE

Creator
Creator
Seonglae Cho
Created
Created
2025 Feb 6 14:57
Editor
Editor
Seonglae Cho
Edited
Edited
2025 Feb 6 15:26
Refs
Refs
notion image
 
 
 
Matryoshka Sparse Autoencoders — LessWrong
View trees here Search through latents with a token-regex language View individual latents here See code here (github.com/noanabeshima/matryoshka-sae…
Matryoshka Sparse Autoencoders — LessWrong
https://www.lesswrong.com/posts/zbebxYCqsryPALh8C/matryoshka-sparse-autoencoders
Matryoshka Sparse Autoencoders — LessWrong
notion image
Learning Multi-Level Features with Matryoshka SAEs — LessWrong
TL;DR: Matryoshka SAEs are a new variant of sparse autoencoders that learn features at multiple levels of abstraction by splitting the dictionary int…
Learning Multi-Level Features with Matryoshka SAEs — LessWrong
https://www.lesswrong.com/posts/rKM9b6B2LqwSB5ToN/learning-multi-level-features-with-matryoshka-saes
Learning Multi-Level Features with Matryoshka SAEs — LessWrong
 
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Problem/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/Activation Engineering/Neuron SAE/
Matryoshka SAE
Copyright Seonglae Cho