Unlike ShieldGemma, it accepts policy at inference time and makes judgments based on reasoning. This means when policy content changes, it can be immediately reflected without model retraining. Flexible and explainable, but slower and higher compute cost using CoT
gpt-oss-safeguard
Creator
Creator
Seonglae ChoCreated
Created
2025 Nov 7 11:28Editor
Editor
Seonglae ChoEdited
Edited
2025 Nov 7 11:29Refs
Refs
GPT OSS 
