AI Neural Circuit history

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 Oct 14 11:19
Editor
Edited
Edited
2024 Oct 14 11:22
Refs
Refs
 
 
An Introduction to Circuits (OpenAI 2020) Chris Olah
An Introduction to Circuits (OpenAI 2020) Chris Olah
OpenAI claims
Superposition
,
AI Neural Circuit
in above paper with restating
Universality Hypothesis
 
A Mathematical Framework for Transformer Circuits (Anthropic 2021) Nelson Elhage
A Mathematical Framework for Transformer Circuits (Anthropic 2021) Nelson Elhage
 
 
Toward Transparent AI (2022 July) Tilman Rauker
Toward Transparent AI (2022 July) Tilman Rauker
 
INTERPRETABILITY IN THE WILD (2022 Nov) Kevin Wang
INTERPRETABILITY IN THE WILD (2022 Nov) Kevin Wang
  1. Identify all previous names in the sentence (Mary, John, John).
  1. Remove all names that are duplicated (in the example above: John).
  1. Output the remaining name (Mary).
 
 
 

GPT2 circuit analysis

Anthropic

OpenAI

One-layer skip trigram

 
 
 

Recommendations