rAnked FineTuning
both large language models and diffusion models.
selects the high-quality samples, discarding those that exhibit undesired behavior, and subsequently enhancing the model by fine-tuning on these filtered samples
Seonglae Cho
Seonglae Cho