49. External reproducibility of risks evaluation (Score: 0)

Are code and prompts to allow for an external reproduction of the evaluation of model risks disclosed?

Disclosure:

References:

GitHub simple-evals README: “We are open sourcing it so we can be transparent about the accuracy numbers we’re publishing alongside our latest models.” github.com GitHub openai/evals README: “Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.”

Score justification:

These evaluation frameworks do not seem to cover certain risk evaluations (e.g., Cyber Range).

Indicator notes:

The released code and prompts need not be the same as what is used internally, but should allow the developer's results on all risk evaluations to be reproduced. The released code must be open-source, following the OSI definition of open-source.

Example disclosure:

The code and prompts to reproduce our evaluations can be found on this GitHub repository link: [URL]

Which uses are explicitly allowed, conditionally permitted, or strictly disallowed under the acceptable use policy for the top-5 distribution channels?

Disclosure:

References:

Score justification:

Usage policy suffices

Indicator notes:

We will award this point for a rough characterization of two or more of permitted, restricted, and prohibited uses across the top-5 distribution channels. We will award this point if the developer has a more general acceptable use policy that it confirms applies across these distribution channels. We will award this point if there are no restrictions on users.

Example disclosure:

Permitted uses include general conversational queries, brainstorming, and coding assistance. Restricted uses include adult or violent content that requires caution or additional review. Prohibited uses include facilitating illicit activity, disinformation campaigns, or harassment. For example, we permit typical user requests like Q&A, text generation, and educational uses. We restrict content that depicts graphic violence or sexual content by applying additional filters. We prohibit any use aiming to conduct unlawful surveillance, promote extremist violence, or defraud others.

95. AUP enforcement process (Score: 0)

What are the methods used by the developer to enforce the acceptable policy?

Disclosure:

As mentioned in our system card, "the model can refuse to invoke the image generation tool if it detects a prompt that may violate OpenAI’s policies."

References:

Not disclosed

Score justification:

No substantive response

Indicator notes:

We will award this point if the developer discloses the processes (automated or manual) it uses to detect, review, and respond to potential acceptable use policy violations. We will award this point for a reasonable best-effort attempt to provide the bulk of this information, though one line indicating the developer reserves the right to terminate accounts is insufficient. Alternatively, we will award this point if the developer reports that it does not use such methods to enforce its acceptable use policy.

Example disclosure:

We combine automated checks with human review for severe or repeated violations, issuing warnings or suspensions after repeat violations.

96. AUP enforcement frequency (Score: 0)

Are statistics on the developer's AUP enforcement disclosed?

Disclosure:

Not disclosed

References:

Not disclosed

Score justification:

No response

Indicator notes:

We will award this point if the developer discloses enforcement statistics (e.g., violation counts or actions taken) from its enforcement of its acceptable use policy. Alternatively, we will award this point if the developer reports that it does not enforce its acceptable use policy.

Example disclosure:

We publish a quarterly enforcement report detailing violation counts by prohibited use category and the corresponding actions taken at [LINK]

97. Regional policy variations (Score: 1)

Are differences in the developer's acceptable use or model behavior policy across geographic regions disclosed?

Disclosure:

Our usage policies don't change for specific jurisdictions - our usage policy page doesn't change by locale.

References:

Not disclosed

Score justification:

Usage policy does not vary

Indicator notes:

We will award this point if the developer discloses distinctions in its AUP or MBP and provides examples of differences in multiple specific regions, or states that no differences exist. For example, some jurisdictions impose content restrictions beyond those in the developer’s global policy that may necessesitate local deviations.

Example disclosure:

In the EU, our model automatically omits certain categories of political content to comply with local election laws. In all other regions, we follow the general global AUP at [URL].

98. Oversight mechanism (Score: 1)

Does the developer have an internal or external body that reviews core issues regarding the model prior to deployment?

Disclosure:

OpenAI’s Safety Advisory Group (SAG) reviewed the results of these evaluations and determined that OpenAI o3 and o4-mini do not reach the High threshold in any of our three Tracked Categories. The Framework currently has three Tracked Categories: Biological and Chemical, Cybersecurity, and AI Self-Improvement.

References:

System card §1 "Introduction": "OpenAI’s Safety Advisory Group (SAG) reviewed the results of our Preparedness evaluations…" This identifies the internal oversight mechanism overseeing safety decisions.

Score justification:

SAG as oversight mechanism suffices

Indicator notes:

We will award this point if the developer discloses that is has such an internal or external body and provides some description of its scope, or alternatively if the developer discloses that it has no such body. An oversight mechanism covers governance structure beyond mere external risk evaluation, asking whether a formal body regularly reviews design and deployment decisions. Core issues may include model objectives, data usage, or risk mitigation.

Example disclosure:

We convene a monthly advisory board of ethicists, civil society representatives, and academics to review training processes and identify new risks. The board's recommendations regarding deployment are not binding.

99. Whistleblower protection (Score: 1)

Does the developer disclose a whistleblower protection policy?

Disclosure:

OpenAI publicly discloses a dedicated Raising Concerns Policy (published Oct 4 2024) that functions as the company’s whistle-blower protection framework. The policy: affirms employees’ right to report safety, legal, or ethical concerns without fear of retaliation and expressly forbids harassment or reprisals; provides several reporting avenues—including managers, HR, Legal, and a 24/7 anonymous Integrity Line—and explains that the Legal & Employee Relations teams investigate all allegations, with certain matters escalated to the Board’s Audit Committee; guarantees employees may also report directly to external regulators such as the NLRB, EEOC, or SEC; and thereby meets all four disclosure criteria (existence, protections, handling/investigation, and external oversight). The same non-retaliation and confidential-reporting commitments are echoed for third-party partners in OpenAI’s Supplier Code of Conduct.

References:

Raising Concerns Policy – no retaliation & external agencies: “Our policy… expressly prohibits harassment and retaliation… and makes clear that employees have the right to make reports or disclosures to government agencies, including the… SEC.” Raising Concerns Policy – 24/7 Integrity Line & investigation process: “In April 2024, we introduced a new 24/7 Integrity Line… Potential legal or policy violations are handled by our Legal and Employee Relations teams. Certain issues may also be shared with the Audit Committee of the Board of Directors.” Policy PDF – explicit non-retaliation clause: “OpenAI does not tolerate harassment or retaliation against anyone who raises a concern in good faith… personnel who engage in such conduct will be subject to discipline.” Supplier Code of Conduct – whistle-blower section: “OpenAI encourages reporting of any concerns… and is committed to ensuring protection for whistleblowers… [including] a strict non-retaliation policy and prompt, confidential investigations.”

Score justification:

OpenAI describes its Raising Concerns Policy

Indicator notes:

We will award this point if the developer discloses (i) the existence of a whistleblower protection policy, (ii) what protections are afforded to whistleblowers, (iii) how reports are handled and investigated, and (iv) any external oversight of the whistleblower protection process. This might include protections for whistleblowers who report safety, ethical, or legal concerns related to the model. We will also award this point if the developer discloses that it has no such policy.

Example disclosure:

We maintain a whistleblower protection policy that prohibits retaliation against employees who report safety or ethical concerns about our models. Reports can be submitted anonymously through our ethics hotline, are reviewed by an independent board committee, and whistleblowers are entitled to legal representation provided by the company. Our policy is audited annually by an independent ethics consultancy.

100. Government commitments (Score: 1)

What commitments has the developer made to government bodies?

Disclosure:

OpenAI has publicly committed to the following government-led initiatives: White House Voluntary Commitments (Jul 21 2023); Bletchley voluntary commitments (Nov 2 2023); Christchurch Call expansion (Nov 10 2023); AI Elections Tech Accord, Munich (Feb 16 2024); Frontier AI Safety Commitments, Seoul (May 21 2024); Seoul AI Business Pledge (May 22 2024); White House IBSA Commitments (Sep 12 2024); EU AI Pact core pledges (Sep 25 2024).

References:

White House Voluntary Commitments list OpenAI among the seven companies meeting at the White House to announce the pledge Bletchley & Seoul Summit voluntary commitments – OpenAI’s February 2025 update confirms it “remains committed to fulfilling the voluntary commitments made at previous summits, specifically those set forth at the AI Summits in Bletchley and Seoul” Christchurch Call news release welcomes OpenAI as one of four new tech-firm supporters on 10 Nov 2023 AI Elections Tech Accord webpage lists OpenAI in the roster of 27 signatories to the Munich accord combating deceptive AI election content Frontier AI Safety Commitments (Seoul Summit) – UK/Korea government page names OpenAI among 16 organisations agreeing to the commitments Seoul AI Business Pledge annex includes “OpenAI” in the list of companies joining the pledge on 22 May 2024 White House Image-Based Sexual Abuse (IBSA) Commitments list OpenAI among the companies pledging new safeguards on 12 Sep 2024 OpenAI blog – EU AI Pact notes “On September 25 2024, we signed up to the three core commitments in the EU AI Pact."

Score justification:

OpenAI summarizes its commitments to 8 VCs

Indicator notes:

We will award this point if the company provides an exhaustive list of commitments it has made to government bodies in the jurisdictions where it offers its model.

Example disclosure:

We have committed to the White House Voluntary Committments and the Seoul Committments.