Tradeoff between computational efficiency Sample efficient methods often requires heavy computation.Achieve better or comparable results with smaller training dataReuse same data from off-policy training includes this LeveragingWorld Model for Offline Learning to Online Learning RL forDistribution Shift groundingopenreview.nethttps://openreview.net/pdf?id=oBXfPyi47m