Texonom
/
Engineering
/
Data Engineering
/
Artificial Intelligence
/
Machine Learning
/
Machine Learning Theory
/
Reinforcement Learning
/
Policy Gradient Theorem
/
Advantage function
/
N-step return
Search
N-step return
Created
Created
2024 Apr 30 7:58
Editor
Editor
Seonglae Cho
Creator
Creator
Seonglae Cho
Edited
Edited
2024 Apr 30 8:4
Refs
Refs
Hard to find optimized N so we use
GAE
Recommendations
Texonom
/
Engineering
/
Data Engineering
/
Artificial Intelligence
/
Machine Learning
/
Machine Learning Theory
/
Reinforcement Learning
/
Policy Gradient Theorem
/
Advantage function
/
N-step return