FLAN

Creator

Creator

Seonglae Cho

Created

Created

2023 Jun 11 10:3

Editor

Editor

Seonglae Cho

Edited

Edited

2025 Mar 2 18:31

Refs

Refs

Finetuned Language Models are Zero-Shot Learners

Instruction 데이터셋을 통해 fine-tuning을 진행하고 이를 통해 zero-shot 성능을 높이는 방법

Mismatch between LM objective and human preferences 문제가 있는데

RLHF 에서 개선

Introducing FLAN: More generalizable Language Models with Instruction Fine-Tuning

https://ai.googleblog.com/2021/10/introducing-flan-more-generalizable.html?m=1

Introducing FLAN: More generalizable Language Models with Instruction Fine-Tuning

Instruction Tuning이란?

LLM에 사용되는 Instruction tuning에 대해 알아보자

https://velog.io/@nellcome/Instruction-Tuning이란

Instruction Tuning이란?

Finetuned Language Models Are Zero-Shot Learners

This paper explores a simple method for improving the zero-shot learning abilities of language models. We show that instruction tuning -- finetuning language models on a collection of tasks...

https://arxiv.org/abs/2109.01652

The Flan Collection: Designing Data and Methods for Effective...

We study the design decisions of publicly available instruction tuning methods, and break down the development of Flan 2022 (Chung et al., 2022). Through careful ablation studies on the Flan...

https://arxiv.org/abs/2301.13688

Scaling Instruction-Finetuned Language Models

Finetuning language models on a collection of datasets phrased as instructions has been shown to improve model performance and generalization to unseen tasks. In this paper we explore instruction...

https://arxiv.org/abs/2210.11416

Recommendations

///////////