671B parameter DeepSeek R1
- Backtracking feature
- Answer Quickly feature which cuts thought trace
- Self-correction feature
- Attention Sink feature
Deepseek R1 Qwen 1.5b every layer mlp
transcoder
sae
Manifold Steering Manifold_SteeringAries-iai • Updated 2025 Dec 3 9:40
Manifold_Steering
Aries-iai • Updated 2025 Dec 3 9:40
LLM overthinking exists in a low-dimensional manifold of the activation space, and by aligning and intervening along it. tokens can be significantly reduced while maintaining accuracy. Manifold Steering: Estimate the low-dimensional subspace of reasoning activations using PCA, and steer only along it. Overthinking is not a single direction but a phenomenon bound to a low-dimensional manifold. Results: Token reduction of up to ~71% across math, code, and QA tasks, with accuracy maintained or slightly improved.

Seonglae Cho
