Cost inefficientHard to parallelizeLack of reward signalSome trajectories are only reproduce possible in simulationRL for real-world termsSim2real Gap RL methods for real-worldRMA