RAFT

Creator

Seonglae Cho

Created

2023 Sep 9 17:12

Editor

Seonglae Cho

Edited

2023 Sep 9 17:17

Refs

rAnked FineTuning

both large language models and diffusion models.

selects the high-quality samples, discarding those that exhibit undesired behavior, and subsequently enhancing the model by fine-tuning on these filtered samples

arxiv.org

https://arxiv.org/pdf/2304.06767.pdf

Recommendations

///////