Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Risk/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/AI Circuit/
Circuit Discovery
Search

Circuit Discovery

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 Oct 24 11:15
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2025 Nov 26 18:25
Refs
Refs
SAE Feature Circuit
auto-circuit
UFO-101 • Updated 2025 Mar 28 5:54
RAG Interpretability

Causal abstraction

Circuit Discovery Methods
Circuit Tracing
AC DC
Sparse Feature Circuit
Feature Cluster Resampling
Attribution Patching
L3D
 
 
Circuit Discovery Usage
Attribution Graph
Circuit Performance Ratio
Circuit-Model Distance
InterpBench
Circuit Stability
 
 
 
 
 
Zoom In: An Introduction to Circuits
By studying the connections between neurons, we can find meaningful algorithms in the weights of neural networks.
https://distill.pub/2020/circuits/zoom-in/
Zoom In: An Introduction to Circuits
curve circuit (2020)
Curve Circuits
Reverse engineering the curve detection algorithm from InceptionV1 and reimplementing it from scratch.
https://distill.pub/2020/circuits/curve-circuits/
 
 

 

Backlinks

Causal abstractionFaithfulness InterpretabilityRAGWeight-sparse Transformers

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Risk/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/AI Circuit/
Circuit Discovery
Copyright Seonglae Cho