CODI

Creator

Seonglae Cho

Created

2025 Dec 27 23:57

Editor

Seonglae Cho

Edited

2025 Dec 27 23:58

Refs

An implicit CoT framework that simultaneously trains natural language CoT (teacher) within the same model while aligning the hidden state of a specific token (e.g., "The answer is:") before answer generation using L1 (=self-distillation) to compress and replace CoT with a small number of continuous latent thoughts

arxiv.org

https://arxiv.org/pdf/2502.21074

Recommendations

///////////