gpt-oss-safeguard

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 Nov 7 11:28
Editor
Edited
Edited
2025 Nov 7 11:29
Refs
Refs
GPT OSS
Unlike
ShieldGemma
, it accepts policy at inference time and makes judgments based on reasoning. This means when policy content changes, it can be immediately reflected without model retraining. Flexible and explainable, but slower and higher compute cost using CoT
 
 
 
 
Introducing gpt-oss-safeguard
New open safety reasoning models (120b and 20b) that support custom safety policies.
Introducing gpt-oss-safeguard
gpt-oss-safeguard - a openai Collection
gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss
gpt-oss-safeguard - a openai Collection
 
 

Recommendations