gpt-oss-safeguard

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 Nov 7 11:28
Editor
Edited
Edited
2025 Nov 7 11:29
Refs
Refs
GPT OSS
Unlike
ShieldGemma
, it accepts policy at inference time and makes judgments based on reasoning. This means when policy content changes, it can be immediately reflected without model retraining. Flexible and explainable, but slower and higher compute cost using CoT
 
 
 
 
 
 

Recommendations