CAST (Conditional Activation Steering) activation-steeringIBM • Updated 2025 Jun 26 23:43
activation-steering
IBM • Updated 2025 Jun 26 23:43
- Conditional SAE clamping
- Conditional SAE steering
- Constant SAE clamping
Sieve (2024.12)
for code generation specifically not using regex (very simple and naive task)
Compare Alpaca Dataset / Sorry Bench
- AI Condition Vector (extract to prompt)
- Refusal vector (apply to response)
