Transcoder

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 Apr 6 17:39
Editor
Edited
Edited
2025 Nov 1 13:18
Refs

A neural network component that projects input to a sparse dimensional space and reconstructs the output

Per-layer transcoder
Transcoders
 
 
 

Transcoder

Transcoders Beat Sparse Autoencoders for Interpretability

  • Narrower interpretation distribution and stronger monosemantic (single-meaning feature activation) characteristics.
  • Sparse Probing performance similar to or slightly better than SAE.
Skip Transcoder
can replace SAE for Residual Stream (when Identity skip is added).
 

Recommendations