BatchTopK SAE

Creator

Creator

Seonglae Cho

Created

Created

2025 Jan 20 15:5

Editor

Editor

Seonglae Cho

Edited

Edited

2025 Oct 29 23:57

Refs

Refs

Instead of selecting the top k activations for each individual sample, we select the top n × k activations across the entire batch of n samples,

BatchTopK

https://arxiv.org/pdf/2412.06410

https://openreview.net/pdf?id=9ca9eHNrdH

Recommendations

/////////////