State-value function updating method based on immediate feedback before the episode ends Andrew Barto and Richard SuttonAndrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning.For developing the conceptual and algorithmic foundations of reinforcement learning.https://awards.acm.org/about/2024-turing