CAST (Conditional Activation Steering) activation-steeringIBM • Updated 2025 Oct 30 12:7Compare Alpaca Dataset / Sorry Bench AI Condition Vector (extract to prompt)Refusal vector (apply to response)arxiv.orghttps://arxiv.org/pdf/2409.05907