Limitations
- Collecting expert demonstrations can be difficult or impossible in some scenarios
- Learned behavior will never be better than expert
- Does not provide a framework for learning from experience, indirect feedback
it has limitations in that it cannot exceed supervised performance through environmental interactions, which is an advantage of RL. Trying to overcome this through Self Play
We cannot resolve Compounding Error so we only predict single chunk.
Imitation Learnings
Imitation Learning Models