CEM Iteration

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 May 10 7:34
Editor
Edited
Edited
2024 Jun 17 4:28

Cross-Entropy Method

Sampling-based optimization
Sample mean and variance of elite samples are minimizing CE between the current sampling distribution and target distribution
CEM Iteration (more planning → better performance) 5 is a good point.
Set action distribution and update distribution parameters re-fit distribution using top-K elite actions
notion image
CEM can generate the action only by planning through the model, without any explicit policy.
 
 

Random shooting

Guess & check without elite fitting. We improved this distribution by the selected elites.
 
 
 
 
 
 

Recommendations