make environment
Ant-V4
- 8 action space
CartPole-v1
HalfCheetah-v4
- can have negative reward
Reacher-v4
Mujoco envs
Classic envs
CartPole-v1
CartPole-v0
the agent can choose to push the cart to the left or to the right. The goal of the agent is to keep the pole balanced on the cart for as long as possible
Toy Text
FrozenLake-v1