- Concept detection
- classification performance
- Model steering
- LLM judge to rate steered output
- Concept score
- Instruct score
- Fluency score
Limitation
Concept detection did not show significant difference while Model steering discrete is mad e with instruction-following dataset (Alpaca-Eval) which provides much benefit to Prompt-based steering.