Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/Reinforcement Learning/Reinforcement Learning Term/
Policy Rollout
Search

Policy Rollout

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 Mar 7 12:34
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2024 Oct 22 23:53
Refs
Refs
Importance sampling

Episode, Trajectory Rollout

Policy Rollout Techniques
Replay Buffer
Epsilon Greedy
RL Frame skip
RL Target Network
Entropy Bonus
Maximum Entropy Objective
UCB exploration
 
 

Trajectory can end in two ways

  • catastrophic failure, like crashing
  • truncation like exceeding the maximum episode length
 
 
 
 
Rollout policy
AI에 관련된 논문과 지식을 포스팅한 블로그입니다.
Rollout policy
https://ai-information.blogspot.com/2019/03/rollout-policy.html
Rollout policy
 
 

Backlinks

Reinforcement Learning TermModel based RL

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/Reinforcement Learning/Reinforcement Learning Term/
Policy Rollout
Copyright Seonglae Cho