OMD

Creator
Creator
Seonglae Cho
Created
Created
2025 Apr 6 17:48
Editor
Edited
Edited
2025 Apr 6 17:49
Refs

Optimal Model Design

notion image
OMD optimizes expected rewards by directly updating model parameters, where the Q-function is implicitly differentiated with respect to model parameters through Implicit Differentiation
 
 
 
 
 
 

Recommendations