7 HW, only count top 6
Discussion is fine, copy is not allowed (list up all names of collaborators)
Plan
- Imitation learning
- Actor critic algorithms
- Q learning algorithms
- On policy RL algorithm
- Model-based RL
- Offline RL
- RLHF
Changed detail
- Behavior cloning & Dagger (Imitation learning)
- Policy Gradient & PPO (Policy Gradient & Actor critic algorithms)
- DQN (Q learning)
- SAC (Soft actor critic)
- CQL (Offline-RL)
- (Model-based) Dreamer
YSU RL HWs
Render
export DEBIAN_FRONTEND=noninteractive apt-get update apt-get install -y libosmesa6-dev libgl1-mesa-glx libglfw3 libgl1-mesa-dev libglew-dev patchelf ffmpeg conda install -c conda-forge gcc=12.1.0 conda install pyopengl
Seonglae Cho