AI Control Notion
AI Control Benchmarks
arxiv.org
https://arxiv.org/pdf/2312.06942
Concordance AI Control
Token injection likeSteering Vector https://github.com/concordance-co/quote
Token Injection as a Steering Mechanism for Large Language Models
Lightweight steering of LLMs through token injection at inference time
https://www.concordance.co/blog/token-injection-steering-llms


Seonglae Cho