Control RL Presentation

지브리 캐릭터가 지금 논란이 되고 있는 것처럼 옮고 그름을 떠나서, 지브리 feature 가 있다면 이 explicit interpretability 로 간단하게 막도록 regulartion 할 수 있다. 즉 safety 란 역사적으로 결국 explicit 한 rule 로 성취되기에 interpretability 가 중요하다

LLM - Human Brain

Consciousness (Working Memory) - Context Window (Tokens)

Unconsciousness - Hidden Dimension

(We usually increase test time compute on consciourness, how about contro lunconciousness)

Let me know if you have any questions!

Control RL Presentation

Recommendations