REINFORCEMENT
Monte-Carlo policy gradient with model-free approach

Policy Gradient Learning Methods
link.springer.com
https://link.springer.com/article/10.1007/BF00992696
[HUFS RL] 강화학습 : Reinforcement Learning: Policy Gradient (REINFORCEMENT)
강화학습 정의 : 주어진 환경(environment)에서 에이전트(Agent)가 최대 보상(Reward)를 받을 수 있는 활동(Action)을 할 수 있도록 Policy를 학습하는 것! 환경(Environemt) : 에이전트가 액션을 취하는 환경을 말합니다. 슈퍼마리
https://velog.io/@uonmf97/Reinforcement-Learning-Policy-Gradient-REINFORCEMENT
![[HUFS RL] 강화학습 : Reinforcement Learning: Policy Gradient (REINFORCEMENT)](https://velog.velcdn.com/images/uonmf97/post/91d613b1-d2bb-4101-8139-1c1694120afe/Screen%20Shot%202022-02-23%20at%207.54.41%20PM.png)

Seonglae Cho