Empower Your Business with ASMedia — Tech

According to OpenAI, the board of directors has the final say on the safety of new AI models as it aims to mitigate "catastrophic risks."

, and ASMedia

2025 February 21 . 2:43 PM

2 min read

According to OpenAI, the board of directors has the final say on the safety of new AI models as it aims to mitigate "catastrophic risks."

OpenAI, the company behind viral chatbot ChatGPT, has published a 27-page document detailing their approach to mitigating "catastrophic risks" through modern AI models. This "Preparedness Framework" outlines how the company will monitor, evaluate, and safeguard against potential threats, with the board of directors holding the final say on the release of new AI models.

The board of directors, including three affluent white men, will ensure that the high-tech AI tools developed by OpenAI are utilized for the betterment of humanity. Critics have raised concerns about the lack of diversity and the reliance on self-regulation, prompting calls for greater legislative involvement in the safe development and application of AI technology.

OpenAI's strategy involves both robust safety measures and strategic decision-making. This approach is known as "deliberative alignment," which goes beyond standard techniques like RLHF and RLAIF by incorporating a more holistic process to align AI models with desired outcomes.

The company employs various safety strategies, such as moderation models, blocked websites, and real-time moderation systems, to prevent misuse and ensure ethical standards. By involving the board of directors and adhering to global AI policy frameworks, OpenAI strives to develop and deploy responsible AI models, minimizing the risk of catastrophic events.

Source:

Enrichment Data:

Deliberative Alignment: OpenAI employs a comprehensive approach to AI safety, known as "deliberative alignment." This method involves aligning AI models with desired outcomes beyond standard approaches like RLHF and RLAIF through a holistic process.
Safety Strategy: OpenAI's safety strategy for their o3 model focuses on responsible deployment through rigorous testing and validation, including measures to prevent misuse and adherence to ethical standards.
Board Involvement: The board of directors plays a significant role in overseeing the development and deployment of AI models, with co-founder Ilya Sutskever actively participating in discussions about safety and capabilities.
Safety Measures: OpenAI implements safety measures such as moderation models, blocked websites, and real-time moderation systems to mitigate risks associated with AI agents like Operator. These measures include user confirmation, website limits, and monitoring systems to prevent unintended actions.
Governance Initiatives: OpenAI engages in governance initiatives that align with global AI policy frameworks, including participation in AI safety summits and collaborations with other organizations to establish safety standards.
Transparency and Accountability: Despite concerns about transparency and accountability in AI development, OpenAI has taken steps to address these issues, with internal conflicts and external scrutiny leading to discussions about the need for stronger safety measures.

Latest

In the image there are football players in the ground and behind the ground there is a huge crowd.

ASMedia's Sports Arena

Cardinals Fans Celebrate Win, Cowboys Fan Spreads Ashes on Field; Dallas Trades for Quinnen Williams

A Cardinals win ends a losing streak, but a Cowboys fan's unusual tribute goes viral. Dallas boosts defense with big trade just before deadline.

, and ASMedia

2025 November 5

In the image we can see there is a safety bag.

Your Financial Hub

Germany Proposes Raising Workplace Safety Threshold Amidst Safety Concerns

The government wants to cut red tape, but critics fear it could compromise safety. The debate is heating up as the threshold for safety officers is under review.

, and ASMedia

2025 November 5

According to OpenAI, the board of directors has the final say on the safety of new AI models as it aims to mitigate "catastrophic risks."

Related

Latest