Skip Transcoder

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 Nov 1 13:16
Editor
Edited
Edited
2025 Nov 1 13:18
Refs
Refs
 
 
 
 
 

Transcoders Beat Sparse Autoencoders for Interpretability

  • Narrower interpretation distribution and stronger monosemantic (single-meaning feature activation) characteristics.
  • Sparse Probing performance similar to or slightly better than SAE.
Skip Transcoder
can replace SAE for Residual Stream (when Identity skip is added).
 
 
 

Backlinks

Transcoder

Recommendations