t-Stochastic Neighbor Embedding
most popular tool for visualization
because for visualization, the goal is similarity preserving rather than information preserving
The locations of the points in the map are determined by minimizing the KL divergence
of the distance distributions
P calculates the similarity x based on Gaussian distribution
Q calculates the similarity between y based on t-distribution