- I2CL compresses few-shot examples into a single context vector in activation space instead of tokens.
- Achieves few-shot performance at zero-shot cost using only linear combinations without attention or KV cache.
- Robust to example selection and ordering; learned coefficients are reusable for task-id/transfer learning.

Seonglae Cho