Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Problem/AI Alignment/AI Safety/AI Safety Academia/
Anthropic Circuit Thread
Search

Anthropic Circuit Thread

Creator
Creator
Seonglae Cho
Created
Created
2025 Feb 4 15:27
Editor
Editor
Seonglae Cho
Edited
Edited
2025 Jul 12 21:44
Refs
Refs
 
 
 
 
 

Alignment Science Blog

Alignment Science Blog
We are Anthropic's Alignment Science team. We do machine learning research on the problem of steering and controlling future powerful AI systems, as well as understanding and evaluating the risks that they pose. Welcome to our blog!
Alignment Science Blog
https://alignment.anthropic.com/

Transformer Circuit Thread

Transformer Circuits Thread
Can we reverse engineer transformer language models into human-understandable computer programs?
Transformer Circuits Thread
https://transformer-circuits.pub/
Transformer Circuits Thread
 
 

Backlinks

Academic Paper Template

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Problem/AI Alignment/AI Safety/AI Safety Academia/
Anthropic Circuit Thread
Copyright Seonglae Cho