Tradeoff between computational efficiency Sample efficient methods often requires heavy computation.Achieve better or comparable results with smaller training dataReuse same data from off-policy training includes this