When you have multiple options (e.g., several slot machines, advertisements, types of medicine, etc.) and don't know the success probability of each, this is a method to experimentally find "which one works best?"
It manages each option's success probability as a "probability distribution" (e.g., beta distribution), then randomly samples one probability from each option's distribution. Based on the results, it updates the distribution for that option.