Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Risk/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/Activation Engineering/Activation Decomposition/Sparse Autoencoder/
Embedding SAE
Search

Embedding SAE

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 Feb 11 1:46
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2025 Dec 18 16:25
Refs
Refs
 
 
 
 
 
CompresSAE
recombee • Updated 2025 Dec 9 9:7
for
Recommend System
arxiv.org
https://arxiv.org/pdf/2505.11388

For black-box text-embedding-3-small embedding model

arxiv.org
https://arxiv.org/pdf/2408.00657v1
SAE Probing
for label prediction.
While using only embeddings shows higher performance, obtaining interpretable NLP features is a notable advantage.
arxiv.org
https://arxiv.org/pdf/2502.04382
Token embedding → text embedding by pooling
www.arxiv.org
https://www.arxiv.org/pdf/2506.04373
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Risk/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/Activation Engineering/Activation Decomposition/Sparse Autoencoder/
Embedding SAE
Copyright Seonglae Cho