Neuronpedia Circuit Tracing

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 May 30 12:57
Editor
Edited
Edited
2025 Aug 5 23:55
Refs
Refs
 
 
 

Neuronpedia
Research
circuit-tracer
safety-researchUpdated 2025 Aug 15 3:34

Two-step reasoning (e.g., Dallas→Texas→Austin) actually uses intermediate holes. Language-agnostic reasoning followed by language-specific feature combination. CLT shows better replacement score/sparsity tradeoff compared to PLT, while skip PLT generally offers fewer benefits.
 
 
 

Recommendations