Model soups: averaging weights of multiple fine-tuned models...
The conventional recipe for maximizing model accuracy is to (1) train multiple models with various hyperparameters and (2) pick the individual model which performs best on a held-out validation...
https://arxiv.org/abs/2203.05482