Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Risk/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/Activation Engineering/Activation Decomposition/Sparse Autoencoder/Selective SAE/
BatchTopK SAE
Search

BatchTopK SAE

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 Jan 20 15:5
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2025 Oct 29 23:57
Refs
Refs
iid
Instead of selecting the top k activations for each individual sample, we select the top n × k activations across the entire batch of n samples,
 
 
 
BatchTopK
arxiv.org
https://arxiv.org/pdf/2412.06410
openreview.net
https://openreview.net/pdf?id=9ca9eHNrdH
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Risk/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/Activation Engineering/Activation Decomposition/Sparse Autoencoder/Selective SAE/
BatchTopK SAE
Copyright Seonglae Cho