Texonom
/
Engineering
/
Data Engineering
/
Artificial Intelligence
/
Machine Learning
/
Machine Learning Theory
/
Reinforcement Learning
/
Reinforcement Learning Method
/
Actor Critic
/
TRPO
Search
TRPO
Creator
Creator
Seonglae Cho
Created
Created
2023 Jul 15 17:9
Editor
Editor
Seonglae Cho
Edited
Edited
2024 Apr 30 11:32
Refs
Refs
PPO
Importance sampling
Trust Region Policy Optimization
KL divergence로 implementation이 어렵고 느려서 ppo가 선호된다
Recommendations
Texonom
/
Engineering
/
Data Engineering
/
Artificial Intelligence
/
Machine Learning
/
Machine Learning Theory
/
Reinforcement Learning
/
Reinforcement Learning Method
/
Actor Critic
/
TRPO