Expert policy가 필요한 단점Compounding Error 에러에 도움은 된다 to address Distribution Shift Roll out learned policy Query expert action at visited states Aggregate corrections with existing data Update policy human gated DAggerExpert intervenes at time when policy makes mistake