Knowledge Distillation Attack
Detecting and preventing distillation attacks
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
https://www.anthropic.com/news/detecting-and-preventing-distillation-attacks

Seonglae Cho