AI Alignment

Creator
Creator
Seonglae Cho
Created
Created
2020 Aug 23 9:36
Editor
Edited
Edited
2025 May 25 17:10

Alignment Problem

A Maximally Curious AI Would Not Be Safe For Humanity while I don’t think so
Alignment must occur faster than the model's capabilities grow. Also, Aligned doesn’t mean perfect (Controllability, reliability). We will need another neural network to observe and interpret the internal workings of neural networks.
AI Alignment is Alignment between taught behaviors and actual behaviors. AI is aligned with an operator - AI is trying to do what operator wants to do.
 
 
 
 
 
 

What is AI alignment

AI Control names

 
 

Recommendations