OpenAI Unveils gpt-oss-safeguard: Open-Weight Reasoning Models for AI Safety

OpenAI's new models empower developers to create safer AI experiences. They've been benchmarked against internal and public datasets, showing results comparable to or better than other open models.

, and ASMedia

2025 November 3 . 9:11 PM

1 min read

In this image, we can see an advertisement contains robots and some text.

OpenAI Unveils gpt-oss-safeguard: Open-Weight Reasoning Models for AI Safety

OpenAI has unveiled gpt-oss-safeguard, a pair of open-weight reasoning models designed for safety classification. The models, available in 120b and 20b versions, are hosted on Hugging Face for free use and modification.

The models, derived from OpenAI's internal Safety Reasoner framework, use reasoning at inference time to interpret developer-provided policies for classification. This allows developers to apply their own custom policies to detect and filter unsafe content.

The gpt-oss-safeguard models were first tested and developed by OpenAI in collaboration with ROOST. The partner company helped identify critical developer needs, test the models, and produce developer documentation. OpenAI will lead the launch of the Model Community for developers, aiming to refine the models based on input from the wider research and trust-and-safety community.

The models have been benchmarked against internal datasets and public benchmarks, achieving results comparable to or better than other open models. They are part of OpenAI's 'defense in depth' safety strategy, combining layered protections and open collaboration.

OpenAI has released gpt-oss-safeguard, a significant step in its safety strategy. The models, available on Hugging Face, empower developers to create safer AI experiences. OpenAI welcomes feedback from the community to refine these models.

Latest

This woman is highlighted in this picture. She holds a bat and plays a table tennis. Far few...

ASMedia's Sports Arena

Harmanpreet Kaur Makes History Again as Oldest WCW Captain

Kaur's latest triumph adds to her list of records. She continues to inspire Indian women cricketers and break barriers in the sport.

, and ASMedia

2025 November 4

This picture contains the cartoon of the man riding a horse. At the bottom of the picture, we see...

ASMedia's Sports Arena

Ministry Launches Elite Athlete Support Program for 2025/2026

The new program supports elite athletes in upper secondary schools. Equestrian athletes have specific criteria to qualify.

, and ASMedia

2025 November 4

Inside a graveyard there are many graves and a lot of trees around them and in the front there is...

Explore Endless Entertainment

Day of the Dead: A Vibrant Blend of Tradition and Evolution

Experience the magic of life and death coming together. Witness the evolution of this vibrant Mexican tradition.

, and ASMedia

2025 November 4

In the picture there is a sports player,he is posing for the photograph and on his shirt there are...

ASMedia's Sports Arena

Prince William Scores in Brazil, Promotes Environment and Climate Action

Prince William's trip to Brazil is more than just a royal visit. He's using sports to engage locals and promote environmental protection, ahead of his speech at COP30.

, and ASMedia

2025 November 4

OpenAI Unveils gpt-oss-safeguard: Open-Weight Reasoning Models for AI Safety

OpenAI Unveils gpt-oss-safeguard: Open-Weight Reasoning Models for AI Safety

Related

Latest