Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Industry/AI Scaling/Model Merging/
Model soups
Search

Model soups

Creator
Creator
Seonglae Cho
Created
Created
2024 Mar 31 5:40
Editor
Editor
Seonglae Cho
Edited
Edited
2024 Mar 31 5:41
Refs
Refs

Linear merge

Weight average
 
 
 
 
 
 
Model soups: averaging weights of multiple fine-tuned models...
The conventional recipe for maximizing model accuracy is to (1) train multiple models with various hyperparameters and (2) pick the individual model which performs best on a held-out validation...
Model soups: averaging weights of multiple fine-tuned models...
https://arxiv.org/abs/2203.05482
Model soups: averaging weights of multiple fine-tuned models...
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Industry/AI Scaling/Model Merging/
Model soups
Copyright Seonglae Cho