Latent dynamics model
Too much information → slow & inaccurate, so latent dynamics model only focused on predictive of reward by only retaining information related to reward. Specifically, It does not retain initial state and it only utilize latent state.
We fit the latent dynamics model by using