Steering Along Manifolds to Control Neural Networks
Concept geometry provides a blueprint for controlling the behavior of neural networks—if you know how to look.
https://www.goodfire.ai/research/manifold-steering

The Confidence Manifold: Geometric Structure of Correctness...
When a language model asserts that "the capital of Australia is Sydney," does it know this is wrong? We characterize the geometry of correctness representations across 9 models from 5 architecture...
https://arxiv.org/abs/2602.08159


Seonglae Cho