Model Generalization

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2023 May 2 1:41
Editor
Edited
Edited
2024 Oct 31 11:18

Central goal of machine learning

To predict un-seen data and model’s generalization ability is model’s capability to adapt properly to new data.
Bias-Variance Trade-off
to minimize complexity and variance to improve model generalization.
Model Generalization Notion
 
 
AI Generalization Methods
 
 
 
 
OOD generalisation is crucial given the wide range of real-world scenarios in which these models are being used, while output diversity refers to the model’s ability to generate varied outputs and is important for a variety of use cases
RLHF generalises better than SFT to new inputs, particularly as the distribution shift between train and test becomes larger. However, RLHF significantly reduces output diversity compared to SFT across a variety of measures, implying a tradeoff in current LLM fine-tuning methods between generalisation and diversity.
 

Recommendations