AWR

Creator

Created

2024 May 1 2:24

Editor

Edited

2024 May 1 2:33

Refs

Imitating only good transitions based on how good the actions are, with weighting each transition depending how good the action is

Can show that advantage-weighted objective approximates KL-constrained objective.

//////////