Balance between Exploration and ExploitationWith probability , select a random action Entropy usually for discontinuous space Epsilon Schedulingchange epsilon over training