Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Problem/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/AI Circuit/Circuit Discovery/
Attribution Patching
Search

Attribution Patching

Creator
Creator
Seonglae Cho
Created
Created
2025 Apr 14 16:55
Editor
Editor
Seonglae Cho
Edited
Edited
2025 Apr 14 16:57
Refs
Refs
Activation Patching
 
 
 
 

Edge Attribution Patching (EAP)

as an approximation to
Activation Patching
aclanthology.org
https://aclanthology.org/2024.blackboxnlp-1.25.pdf
 
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Problem/AI Alignment/Explainable AI/Interpretable AI/Mechanistic interpretability/AI Circuit/Circuit Discovery/
Attribution Patching
Copyright Seonglae Cho