Advantage function

Creator

Created

2024 Mar 20 2:8

Editor

Edited

2024 Nov 21 10:21

Refs

How good advantage is an action compared to the policy?

Q - V

positive or negative

Advantage estimations

Can show that advantage-weighted objective approximates KL-constrained objective.

///////