State-value function

Creator
Creator
Seonglae Cho
Created
Created
2023 Jul 18 9:0
Editor
Edited
Edited
2024 May 1 2:14

V-function

Expected return starting from a particular state under a given policy
Vπ(st)=E[Qπ(st,at)]V^\pi(s_t) = E[Q^\pi(s_t,a_t)]
notion image
notion image
State-value estimations
 
 
 
Value function is distributional expectation of State-value-action function
notion image
 
 
 
 

Recommendations