Agent Interpretability

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 Nov 1 16:11
Editor
Edited
Edited
2026 Mar 6 15:50

Action Interpretability, Decision Interpretability

Agent Interpretability Types
 
 
 
 
 

RNN transition model Interpretability RL

We confirmed that mechanisms very similar to the main components seen in classical search algorithms (plan/search) exist inside the RNN: plan representation, state transition model, and value function.
arxiv.org
 
 
 

Recommendations