- I2CL compresses few-shot examples into a single context vector in activation space instead of tokens.
- Achieves few-shot performance at zero-shot cost using only linear combinations without attention or KV cache.
- Robust to example selection and ordering; learned coefficients are reusable for task-id/transfer learning.
ICLR 2025
arxiv.org
https://arxiv.org/pdf/2405.14660

Seonglae Cho