PCA

Created
Created
2023 May 4 4:50
Editor
Creator
Creator
Seonglae Cho
Edited
Edited
2025 Mar 25 2:16
Refs
Refs
SVD
LDA

Principal Component Analysis (PCA)

Principal Component Analysis (PCA) is an unsupervised machine learning technique that analyzes and transforms high-dimensional data by exploring the covariance relationships between features. It projects data into a lower-dimensional space while preserving maximum variance of the original data.

Key Characteristics

  • Operates by dual optimization an be written in its Lagrange form:
    • Minimizes projection error
    • Maximizes projection variance to retain original information
  • Maintains data variance without normalizing principal components to unit length

Mathematical Foundation

  • The components can be obtained by performing an eigen-decomposition of the covariance matrix with standards solvers.
  • Solution involves selecting:
    • M-largest eigenvalues for dimension retention
    • (D-M)-smallest eigenvalues for dimension reduction

Applications

  • Feature extraction and dimensionality reduction
  • Data compression
  • Data visualization

Computational Efficiency

For high-dimensional data, computational complexity can be reduced from O(D3D^3) to O(N3N^3) using
Kernel PCA
notion image
notion image
PCA Notion
 
 
 
 
 
 

Recommendations