State-value function

Creator

Creator

Seonglae Cho

Created

Created

2023 Jul 18 9:0

Editor

Editor

Seonglae Cho

Edited

Edited

2026 Feb 23 23:14

Refs

Refs

Action-value function

Value-Based Learning

V-function

Expectation of Q-function (

Action-value function); Expected return starting from a particular state under a given policy.

notion image

notion image

State-value estimations

Policy Gradient Baseline

Value function is distributional expectation of State-value-action function

notion image

(3) 가치함수와 벨만방정식

앞 장에서 문제를 MDP로 정의하는 방식에 대해 살펴보았다. 이제 본격적으로 가치함수와 큐함수, 벨만 기대 방정식과 벨만 최적 방정식에 대해 톺아보자.

https://jang-inspiration.com/bellman-equation

(3) 가치함수와 벨만방정식

Recommendations

///////