AI Safety

Creator
Creator
Seonglae Cho
Created
Created
2023 Jun 13 11:43
Editor
Edited
Edited
2025 Mar 8 15:45

(induced) incentive is key for safety

Risks such as generating illicit advice, choosing stereotyped responses, and succumbing to known jailbreaks

Communities & Forums

  • OpenAI
AI Safety Notion
 
 
 

Problem statements

 
 

 

Recommendations