Matryoshka SAE

Creator

Creator

Seonglae Cho

Created

Created

2025 Feb 6 14:57

Editor

Editor

Seonglae Cho

Edited

Edited

2025 Jul 4 22:47

Refs

Refs

notion image

Matryoshka Sparse Autoencoders — LessWrong

View trees here Search through latents with a token-regex language View individual latents here See code here (github.com/noanabeshima/matryoshka-sae…

Matryoshka Sparse Autoencoders — LessWrong

https://www.lesswrong.com/posts/zbebxYCqsryPALh8C/matryoshka-sparse-autoencoders

Matryoshka Sparse Autoencoders — LessWrong

notion image

Learning Multi-Level Features with Matryoshka SAEs — LessWrong

TL;DR: Matryoshka SAEs are a new variant of sparse autoencoders that learn features at multiple levels of abstraction by splitting the dictionary int…

Learning Multi-Level Features with Matryoshka SAEs — LessWrong

https://www.lesswrong.com/posts/rKM9b6B2LqwSB5ToN/learning-multi-level-features-with-matryoshka-saes

Learning Multi-Level Features with Matryoshka SAEs — LessWrong

Recommendations

////////////