Temporal difference learning

Creator

Creator

Seonglae Cho

Created

Created

2025 Mar 6 23:23

Editor

Editor

Seonglae Cho

Edited

Edited

2025 Mar 6 23:26

Refs

Refs

State-value function updating method based on immediate feedback before the episode ends

Andrew Barto and Richard Sutton

Andrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning.

For developing the conceptual and algorithmic foundations of reinforcement learning.

https://awards.acm.org/about/2024-turing

Andrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning.

Recommendations

///////