Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/Reinforcement Learning/Reinforcement Learning Method/Imitation Learning/Adversarial Imitation Learning/
VAIL
Search

VAIL

Creator
Creator
Seonglae Cho
Created
Created
2024 Jun 29 13:15
Editor
Editor
Seonglae Cho
Edited
Edited
2024 Jun 29 13:18
Refs
Refs
Variational Inference
보상함수를 직접적으로 추론하는 것보다는 전문가의 행동 패턴을 모방하여 학습하는 접근
 
 
 
 
 
Variational Adversarial Imitation Learning (VAIL) 논문리뷰
리뷰 작성: 김한결 / 석사과정 (gksruf621@postech.ac.kr) Variational Adversarial Imitation Learning이 등장하는 논문의 본래 이름은 Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow입니다. Variational Discriminator Bottleneck (VDB)논문에서는 GAN과 같은 Adversarial learning methods를 제안한 것이기 때문에 Imitation Learning 뿐만 아니라 이미지 생성과 같은 다른 Task들이 존재합니다. 저희는 Imitation Le..
Variational Adversarial Imitation Learning (VAIL) 논문리뷰
https://rlwithme.tistory.com/7
Variational Adversarial Imitation Learning (VAIL) 논문리뷰
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/Machine Learning/Reinforcement Learning/Reinforcement Learning Method/Imitation Learning/Adversarial Imitation Learning/
VAIL
Copyright Seonglae Cho