AWR

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 May 1 2:24
Editor
Edited
Edited
2024 May 1 2:33

Advantage-Weighted Regression

Imitating only good transitions based on how good the actions are, with weighting each transition depending how good the action is
notion image
 
Can show that advantage-weighted objective approximates KL-constrained objective.
notion image
 
 
 
 
 
 
 
 

Recommendations