Function vector, ICLR 2024
LLMs contain vector representations of tasks themselves, and inserting these vectors enables zero-shot task performance without examples. A small number of mid-layer attention heads aggregate task information to form function vectors (FV), with causal effects confirmed. Some FVs can be composed through vector addition to create new composite tasks.
openreview.net
https://openreview.net/pdf?id=AwyxtyMwaG#:~:text=We%20characterize%20a%20key%20mechanism,transformer%20hidden%20states%20during%20ICL

Seonglae Cho