Select a subset of features that are relevant for the model
- Relative importance only meaningful after scaling
- Correlation among features might make coefficients uninterpretable
- L1 regularization might pick up a random feature from a correlated group
- Regularization can hugely impact the interpretation of linear models
Sparse Models