Most commonly used in practiceHow good advantage is an action compared to the policy? Q - Vpositive or negativeusually average is near 0Advantage estimationsGAE N-step return Can show that advantage-weighted objective approximates KL-constrained objective.