K-means

Creator
Created
Created
2019 Nov 5 3:14
Editor
Edited
Edited
2023 Jun 13 9:15

Optimize minimize sum of similarity measure (
Euclidean Distance
)

notion image
notion image
K is hyper-parameter and K center, N number of data point
  1. Starting from randomly chosen K centroids
  1. Given μ, find optimal assignment variable z - calculate distance based on dimension
  1. Given z, find the optimal centroids (mean) μ
  1. Until Convergence (proved always)
K-means Notion
 
 

Limitation

It is sensitive to initial centroids, data outlier
K-means Substitutes
 
 
 
 
 

Recommendations