CRL Paper

Creator

Creator

Seonglae Cho

Created

Created

2025 Mar 17 15:38

Editor

Editor

Seonglae Cho

Edited

Edited

2025 Sep 11 16:34

Refs

Refs

Steering on correlated sae features improve benchmmakrs not only probing

CRL (Control model training with RL), CTRL (Control model Training with RL)

SPOT (Sparse Policy Optimization for Circuit control)

OSCAR (Optimizing Sparse Circuits via Autoencoder Reinforcement)

declartion file after acc knolwedgement

Connected papers

Connected Papers | Find and explore academic papers

A unique, visual tool to help researchers and applied scientists find and explore papers relevant to their field of work.

https://www.connectedpapers.com/

Connected Papers | Find and explore academic papers

Control RL Papers

CRL Introduciton

CRL Background & Relvent works

Steer RL Experiment

Control RL Future Work

Control RL Appendix

Control RL Presentation

Recommendations

/////