Good–Turing frequency estimation
Good–Turing frequency estimation is a statistical technique for estimating the probability of encountering an object of a hitherto unseen species, given a set of past observations of objects from different species. In drawing balls from an urn, the 'objects' would be balls and the 'species' would be the distinct colours of the balls (finite but unknown in number). After drawing
R
red
{\displaystyle R_{\text{red}}}
red balls,
R
black
{\displaystyle R_{\text{black}}}
black balls and
R
green
{\displaystyle R_{\text{green}}}
green balls, we would ask what is the probability of drawing a red ball, a black ball, a green ball or one of a previously unseen colour.
https://en.wikipedia.org/wiki/Good%E2%80%93Turing_frequency_estimation