rAnked FineTuning
both large language models and diffusion models.
selects the high-quality samples, discarding those that exhibit undesired behavior, and subsequently enhancing the model by fine-tuning on these filtered samples
arxiv.org
https://arxiv.org/pdf/2304.06767.pdf

Seonglae Cho