AI Alignment

Creator
Creator
Seonglae Cho
Created
Created
2020 Aug 23 9:36
Editor
Edited
Edited
2025 Apr 22 1:42

Alignment Problem

A Maximally Curious AI Would Not Be Safe For Humanity while I don’t think so
Alignment must occur faster than the model's capabilities grow. Also, Aligned doesn’t mean perfect (Controllability, reliability). We will need another neural network to observe and interpret the internal workings of neural networks.
AI Alignment is Alignment between taught behaviors and actual behaviors. AI is aligned with an operator - AI is trying to do what operator wants to do.
AI Alignment Notion
 
 
AI Alignment Externals
 
 
 
 

What is AI alignment

AI Control names

 
 

Recommendations