The decomposition should identify a set of components that sum to the parameters of the original network aclanthology.orghttps://aclanthology.org/2022.acl-long.345.pdfComprehensiveness & Plausibilityaclanthology.orghttps://aclanthology.org/2020.acl-main.408.pdfCircuit Discovery Transformer Circuit Evaluation Metrics Are Not RobustMechanistic interpretability work attempts to reverse engineer the learned algorithms present inside neural networks. One focus of this work has been to discover 'circuits' - subgraphs of the full...https://openreview.net/forum?id=zSf8PJyQb2#discussion