Does the developer clearly define a process by which external parties can disclose model vulnerabilities or flaws?

Disclosure:

References:

Not disclosed

Score justification:

Amazon has a public bug bounty where submission can be made. No disclosures can be made to third parties, per the bug bounty's RDP: "Thank you for joining us in supporting ethical and responsible disclosure. By participating in this program, you agree not to share publicly or privately any details or descriptions of your findings with any third party."

Indicator notes:

We will award this point for a description of the process external parties can use for responsbly disclosing model vulnerabilities and flaws, which should include (i) what mechanism external parties can use to disclose vulnerabilities or flaws (e.g., a form, an email) and (ii) what process follows a disclosure (e.g., how much time must parties wait until public release). This is often included with a bug bounty, but can also be standalone. We will award a point if the developer discloses it has no responsible disclosure policy.

Example disclosure:

We maintain a responsible disclosure policy at [URL] that describes how external parties can disclose vulnerabilities and flaws in Model A, including a 45-day disclosure window and an official contact for urgent security vulnerabilities.

79. Safe harbor (Score: 0)

Does the developer disclose its policy for legal action against external evaluators conducting good-faith research?

Disclosure:

Amazon's policy regarding potential legal action against external evaluators conducting good-faith research remains undisclosed.

References:

Not disclosed

Score justification:

Company acknowledges no disclosure

Indicator notes:

We will award this point if the developer discloses whether it has a policy committing it to not pursue legal action against external evaluators conducting good-faith research. This should not be only for software security vulnerabilities, but also AI flaws, and it should be based on researcher conduct standards, not at the sole discretion of the company. We will award this point if the developer provides a clear description of its policy regarding such protections for external researchers, or lack thereof.

Example disclosure:

We do not have a policy for researcher protections for good-faith safety research. OR Our policy ensures no legal action against good‐faith researchers who follow our disclosure guidelines, see: [link]

80. Security incident reporting protocol (Score: 1)

Are major security incidents involving the model disclosed?

Disclosure:

We investigate all reported security vulnerabilities affecting Amazon and AWS services, software, and products including GenAI models. The security bulletin can be found here: https://aws.amazon.com/security/security-bulletins/ We would also issue a CVE and GHSA if it the issue meets the criteria of the Amazon CVE Numbering Authority (CNA) to issue those advisories

References:

Not disclosed

Score justification:

The bottom of the security bulletin page has a link for reporting a vulnerability, and the description and page state how they would report what was found

Indicator notes:

A security incident reporting protocol provides post-deployment transparency about serious incidents or breaches. Security incidents refer to incidents where external security threats affect the model (e.g., data breaches or DDoS attacks on the service). We will award this point if the developer states (i) how to submit a security incident report, (ii) how quickly it will respond, and (iii) when and whether results are disclosed. Every incident need not be reported publicly, but the developer must disclose a policy determining how incidents are reported and disclosed.

Example disclosure:

We publish a public ‘Security Incident Report’ on our website for any confirmed security incident affecting the model within 7 days of a patch being implemented. Users and researchers can report incidents via security@ourcompany.com, and we commit to an initial acknowledgment within 48 hours.

81. Misuse incident reporting protocol (Score: 0)

Are misuse incidents involving the model disclosed?

Disclosure:

Amazon does not disclose a misuse incident reporting protocol, however, within its AUP provides a form for users to report suspected abusive activity.

References:

Not disclosed

Score justification:

Company acknowledges no disclosure

Indicator notes:

A misuse incident reporting protocol provides post-deployment transparency about incidents of misuse involving the model. As opposed to the previous indicator, this indicator is about actors misusing the model to cause real-world harm, such as misinformation operations or cybersecurity attacks. We will award this point if the developer states (i) how to submit a misuse incident report, (ii) how quickly it will respond, and (iii) when and whether results are disclosed. Every incident need not be reported publicly, but there needs to be a policy governing how incidents are reported.

Example disclosure:

We publish a public ‘Misuse Incident Report’ on our website for any confirmed misuse incident within 7 days of a patch being implemented. Users and researchers can report incidents regarding our flagship foundation model via security@ourcompany.com, and we commit to an initial acknowledgment within 48 hours.

82. Post-deployment coordination with government (Score: 1)

Does the developer coordinate evaluation with government bodies?

Disclosure:

We do not coordinate evaluations with any government entities or AI Safety Institutes.

References:

Not disclosed

Score justification:

Company discloses no such coordination

Indicator notes:

We will award this point if the developer specifies which government bodies it is coordinating with and for what types of post-deployment evaluations. Government bodies include AI Safety Institutes, national security agencies, national labs, and international governmental enties such as UN agencies or the G7. Evaluation here may also include sharing of the developer's proprietary evaluation results for help with interpretation.

Example disclosure:

We do not coordinate with any government entities or AI Safety Institutes. OR We coordinate with the UK AISI for post-deployment evaluation of cyber, CB, and autonomy-related capabilities.

83. Feedback mechanisms (Score: 0)

Does the developer disclose a way to submit user feedback? If so, is a summary of major categories of feedback disclosed?

Disclosure:

(1) For Amazon Nova models access through Amazon Bedrock, users can submit feedback through through the feedback form in the AWS AI Service Card for Nova Premier or via the feedback feature in Amazon Bedrock console. (2) For Amazon Nova models accessed through nova.amazon.com, users can submit feedback through the thumbs up/down buttons on responses in interactions on nova.amazon.com

References:

Not disclosed

Score justification:

Amazon discloses feedback mechanisms across distribution channels, no summary disclosed

Indicator notes:

We will award this point if the developer (i) discloses how users can submit feedback (e.g., via a form or a thumbs up/thumbs down for model responses) and (ii) discloses aggregated or categorized feedback data (e.g. a categorization of thumbs up and thumbs down data).

Example disclosure:

Users can submit feedback at this url: [URL] We find that users mainly report issues with API call response times, over-refusals from models, and outdated information in model outputs. A detailed categorization of user reports is available at [URL]

84. Permitted, restricted, and prohibited model behaviors (Score: 1)

Are model behaviors that are permitted, restricted, and prohibited disclosed?

Disclosure:

Prohibited behaviors:: Dangerous activities, self-harm, or use of dangerous substances Use, misuse, or trade of controlled substances, tobacco, or alcohol Physical violence or gore Child abuse or child sexual exploitation Animal abuse or trafficking Misinformation that could undermine public institutions or endanger health Malware or content facilitating cybercrime Discrimination or stereotyping Insults, profanity, obscenity, pornography, hate symbols Full nudity outside of scientific/educational contexts Bias based on demographic characteristics Restricted behaviors: While not explicitly labeled as "restricted," the text implies caution in certain areas: Generating content that could be construed as requesting private information Producing outputs that will be directly surfaced to end users without review Use in workflows producing consequential decisions without human oversight From AWS RAI Policy: Prohibitions. You may not use, or facilitate or allow others to use, the AI/ML Services: for intentional disinformation or deception; to violate the privacy rights of others, including unlawful tracking, monitoring, and identification; to depict a person’s voice or likeness without their consent or other appropriate rights, including unauthorized impersonation and non-consensual sexual imagery; for harm or abuse of a minor, including grooming and child sexual exploitation; to harass, harm, or encourage the harm of individuals or specific groups; to intentionally circumvent safety filters and functionality or prompt models to act in a manner that violates our Policies; to perform a lethal function in a weapon without human authorization or control.

References:

Not disclosed

Score justification:

Amazon provides a helpful breakdown of prohibited and restricted behaviors.

Indicator notes:

We refer to a policy that includes this information as a model behavior policy, or a developer's policy on what the foundation model can and cannot do (e.g. such a policy may prohibit a model from responding to NSFW content). We recognize that different developers may adopt different business models and that some business models may make enforcement of a model behavior policy more or less feasible. We will award this point if at least two of the three categories (i.e. permitted, restricted, and prohibited model behaviors) are disclosed. Alternatively, we will award this point if the developer reports that it does not impose any restrictions on its model's behavior in this way.

Example disclosure:

We allow responses from Model A that include broad Q&A, restrict sexual or harassing content, and prohibit facilitating illegal or violent acts. More details can be found in our guidelines for model behavior here: [link]

85. Model response characteristics (Score: 1)

Are desired model response characteristics disclosed?

Disclosure:

We configure Nova Premier's responses to: - Provide concise answers to simple questions when information is directly available - Include more details for yes/no questions - Use logical reasoning for multi-hop reasoning questions - Be transparent about information gaps by stating when exact answers cannot be found - Include citations to support responses using markers like %[1]%, %[2]%, %[3]% - Avoid completing prompts that could request private information - Adhere to responsible AI objectives through runtime moderation - Maintain user privacy by not storing or sharing customer prompts and completions - Generate responses that align with Amazon's responsible AI dimensions including safety, fairness, controllability, veracity and robustness

References:

Not disclosed

Score justification:

Amazon provides a list of how Nova Premier's responses are configured

Indicator notes:

Model response characteristics include default behaviors or behaviors that the developer steers the model to take. These may include being helpful, taking an objective point of view, or using tools only when necessary. We will award points for a clear description of desired model response characteristics or a statement that there are no such characteristics.

Example disclosure:

We configure responses from Model A to be factual, neutral, and contextually helpful, avoiding personal or biased opinions. More details can be found in our guidelines for model behavior here: [link]

86. System prompt (Score: 0)

Is the default system prompt for at least one distribution channel disclosed?

Disclosure:

No, the system prompt for Amazon Nova Premier is not disclosed for the model accessed through Amazon Bedrock or through nova.amazon.com

References:

Not disclosed

Score justification:

Company acknowledges no disclosure

Indicator notes:

A system prompt is defined as the prompt provided to the system by default that guides the system's behavior. We will award this point for the disclosure of the verbatim text of the full system prompt as well as an explanation for the context in which the system prompt is used.

Example disclosure:

We disclose our default prompt for Model A via our chat interface: ‘You are a helpful AI assistant providing clear, accurate, and policy‐compliant responses.’

87. Intermediate tokens (Score: 1)

Are intermediate tokens used to generate model outputs available to end users?

Disclosure:

Nova Premier makes its intermediate tokens (chain-of-thought reasoning) available to users when instructions are given, and users are advised to use specific instructions to keep the thinking brief and contain it within tags. This is summarized from Amazon Nova's User Guide

References:

Not disclosed

Score justification:

Amazon discloses COT available with prompting

Indicator notes:

Intermediate tokens are defined as any tokens generated by the model before the final output is shown to the user, such as model chains of thought. We will also award this point if a summary of intermediate tokens is made available to end users. If intermediate tokens or summaries are not made available, the developer should provide a justification.

Example disclosure:

Model A is trained to generate intermediate chain-of-thought reasoning, but we withhold most chain-of-thought tokens from final user-facing responses to prevent model distillation. We do disclose chains-of-thought for a small set of research collaborators under NDA.

88. Internal product and service mitigations (Score: 1)

For internal products or services using the model, are downstream mitigations against adversarial attacks disclosed?

Disclosure:

To help prevent potential misuse, Amazon Bedrock implements automated abuse detection mechanisms. These mechanisms are fully automated, so there is no human review of, or access to, user inputs or model completions. To learn more, see Amazon Bedrock Abuse Detection in the Amazon Bedrock User Guide.

References:

Not disclosed

Score justification:

Amazon automatically scans user inputs

Indicator notes:

An internal product or service is a product or service built by the developer. Adversarial attacks include prompt injection, jailbreaking, or malicious queries. Mitigations against adversarial attacks might include specialized prompt filtering, content scanning, or real-time monitoring of queries or accounts. We will award this point if the developer discloses a clear statement of methods used (e.g., a specialized prompt sanitizer or adversarial pattern detector), or if the developer states it does not implement such product-level mitigations against adversarial attacks.

Example disclosure:

In our chatbot, we implement a second-stage content filter that checks user inputs for disallowed topics and attempts to sanitize adversarial prompts. We also log suspicious prompts for manual review.

89. External developer mitigations (Score: 1)

Does the developer provide built-in or recommended mitigations against adversarial attacks for downstream developers?

Disclosure:

Amazon Nova's User Guide provides insights into downstream security controls and also provides recommendations to implement model content guardrails via System Prompt field. The developer also has an option to use AWS Bedrock Guardrails at the application layer to mitigate prompt injection attacks, and block harmful content or leakage of sensitive information.

References:

Not disclosed

Score justification:

AWS Bedrock Guardrails suffices

Indicator notes:

Downstream developers are developers who access the model through a distribution channel. Adversarial attacks include prompt injection, jailbreaking, or malicious queries. Mitigations against adversarial attacks that developers might build in or recommend include content filtering endpoints and recommended prompt templates. We will award this point if the developer discloses (i) technical mitigations (e.g., a developer provided moderation API or classifier) it offers or implements, (ii) recommended best practices or libraries for downstream developers, or (iii) an explicit statement that it does not build or recommend any particular downstream mitigations in this way..

Example disclosure:

Our API includes an optional parameter that will automatically filter user prompts and model outputs for hateful or disallowed content. We also publish guidelines for building robust chat interfaces that resist common prompt injections.

90. Enterprise mitigations (Score: 1)

Does the developer disclose additional or specialized mitigations for enterprise users?

Disclosure:

For enterprise users using Nova via Amazon Bedrock, Bedrock's Abuse Detection mitigates potential misuse to uphold Responsible AI.

References:

Not disclosed

Score justification:

Description of mitigations is sufficient

Indicator notes:

Enterprise users are, for example, large organizations with dedicated service agreements or users of enterprise-specific API deployments or products and services. Additional or specialized mitigations may address enterprise needs such as data privacy controls, advanced prompt/response monitoring, or compliance checks with regulations such as GDPR or HIPAA. Additional or specialized mitigations may include single-tenant deployments, custom filters for specific regulated industries, or advanced logging for compliance. We will award a point if the developer at least describes these mitigations or states that it does not provide such additional or specialized enterprise mitigations.

Example disclosure:

Our enterprise offering for Model A includes a dedicated environment with stricter filtering, a HIPAA-compliant data retention policy, and the ability for enterprise admins to define custom blacklisted topics that the model must refuse.

91. Detection of machine-generated content (Score: 1)

Are mechanisms that are used for detecting content generated by this model disclosed?

Disclosure:

Amazon Nova Premier exclusively generates text outputs, which do not carry any watermkaring. Amazon Nova Reel (video generation) and Amazon Nova Canvas (image generation) models support watermark injection (Canvas, Reel) and C2PA (Canvas).

References:

Not disclosed

Score justification:

Amazon discloses no watermark for text outputs and different watermarks for video and image models

Indicator notes:

A mechanism for detecting machine-generated content might include storing a copy of all outputs generated by the model to compare against, implementing a watermark on model outputs, adding cryptographic metadata (such as C2PA), or training a detector post-hoc to identify such content. We will award this point if any such mechanism is disclosed or if the developer reports that it does not have or use any such mechanism.

Example disclosure:

We train a classifier using model generations and human-written text to identify machine-generated content from Model A and our other models.

92. Documentation for responsible use (Score: 1)

Does the developer provide documentation for responsible use by downstream developers?

Disclosure:

For use of Amazon Nova Premier on Amazon Bedrock, Amazon provides documentation for responsible use by developers including AI Service cards, a suite of tools for building tools responsibly such as Amazon Guardrails, SageMaker Clarify, and ML Governance, and the AWS Responsible AI Policy. For use of Amazon Nova Premier on nova.amazon.com, Amazon provides an Acceptable Use Policy and Terms of Use.

References:

Not disclosed

Score justification:

AWS Responsible AI Policy suffices

Indicator notes:

To receive a point, the developer should provide documentation for responsible use. This might include details on how to adjust API settings to promote responsible use, descriptions of how to implement mitigations, or guidelines for responsible use. We will also award this point if the developer states that it does not provide any such documentation. For example, the developer might state that the model is offered as is and downstream developers are accountable for using the model responsibly.

Example disclosure:

Our Developer Documentation Hub consolidates integration guides, responsible‐use guidelines, and best practices: [link]

93. Permitted and prohibited users (Score: 1)

Is a description of who can and cannot use the model on the top-5 distribution channels disclosed?

Disclosure:

For usage of Amazon Nova models on AWS, no explicit permitted/prohibited users are described. For usage of Amazon Nova models on nova.amazon.com, usage requirements are stated in Section 1.3 of the Terms of Use. Permitted users must be 18 years old.

References:

Not disclosed

Score justification:

Clear description of permitted/prohibited users

Indicator notes:

We will award this point for a description of the company's policies for permitted and prohibitted users on its top-5 distribution channels. We will award this point if the developer has a more general acceptable use policy that it confirms applies across these distribution channels. We will award this point if there are no restrictions on users.

Example disclosure:

We allow usage by individuals 13 years of age or older who accept our Terms of Service. We prohibit use by export controlled entities or persons on denied-parties lists or in countries under U.S. embargo. We also reserve the right to restrict use if users engage in targeted harassment. For example, we only permit users over 13 with valid credentials, and prohibit usage from OFAC-sanctioned regions. We do not allow state-sponsored disinformation agencies to access our services.

94. Permitted, restricted, and prohibited uses (Score: 1)

Which uses are explicitly allowed, conditionally permitted, or strictly disallowed under the acceptable use policy for the top-5 distribution channels?

Disclosure:

For Amazon Nova models on Bedrock, prohibited uses are documented in the AWS Acceptable Use Policy and AWS Responsible AI Policy. For Amazon Nova models on nova.amazon.com, there is a separate Acceptable Use Policy

References:

Not disclosed

Score justification:

Clear disclosure of the AUPs

Indicator notes:

We will award this point for a rough characterization of two or more of permitted, restricted, and prohibited uses across the top-5 distribution channels. We will award this point if the developer has a more general acceptable use policy that it confirms applies across these distribution channels. We will award this point if there are no restrictions on users.

Example disclosure:

Permitted uses include general conversational queries, brainstorming, and coding assistance. Restricted uses include adult or violent content that requires caution or additional review. Prohibited uses include facilitating illicit activity, disinformation campaigns, or harassment. For example, we permit typical user requests like Q&A, text generation, and educational uses. We restrict content that depicts graphic violence or sexual content by applying additional filters. We prohibit any use aiming to conduct unlawful surveillance, promote extremist violence, or defraud others.

95. AUP enforcement process (Score: 1)

What are the methods used by the developer to enforce the acceptable policy?

Disclosure:

The Amazon Nova family of models leverages Bedrock Abuse Detection service (including CSAM detection) to prevent potential misuse by users on Amazon Bedrock and amazon.nova.com

References:

Not disclosed

Score justification:

Some description of Bedrock Abuse Detection

Indicator notes:

We will award this point if the developer discloses the processes (automated or manual) it uses to detect, review, and respond to potential acceptable use policy violations. We will award this point for a reasonable best-effort attempt to provide the bulk of this information, though one line indicating the developer reserves the right to terminate accounts is insufficient. Alternatively, we will award this point if the developer reports that it does not use such methods to enforce its acceptable use policy.

Example disclosure:

We combine automated checks with human review for severe or repeated violations, issuing warnings or suspensions after repeat violations.

96. AUP enforcement frequency (Score: 0)

Are statistics on the developer's AUP enforcement disclosed?

Disclosure:

Amazon does not publish statistics on Acceptable Use Policies for Amazon Bedrock or nova.amazon.com.

References:

Not disclosed

Score justification:

Company acknowledges no disclosure

Indicator notes:

We will award this point if the developer discloses enforcement statistics (e.g., violation counts or actions taken) from its enforcement of its acceptable use policy. Alternatively, we will award this point if the developer reports that it does not enforce its acceptable use policy.

Example disclosure:

We publish a quarterly enforcement report detailing violation counts by prohibited use category and the corresponding actions taken at [LINK]

97. Regional policy variations (Score: 1)

Are differences in the developer's acceptable use or model behavior policy across geographic regions disclosed?

Disclosure:

For use of Amazon Nova models on Bedrock, usage is governed by AWS Customer Agreement and the AWS service Terms. In the AWS Customer Agreement, differences in jurisdictions are mentioned (Section 3, Section 11). For use of Amazon Nova models on nova.amazon.com, the Terms of Use mention restriction of usage to the US (Section 2.4)

References:

Not disclosed

Score justification:

Variations in policy by region are pointed to

Indicator notes:

We will award this point if the developer discloses distinctions in its AUP or MBP and provides examples of differences in multiple specific regions, or states that no differences exist. For example, some jurisdictions impose content restrictions beyond those in the developer’s global policy that may necessesitate local deviations.

Example disclosure:

In the EU, our model automatically omits certain categories of political content to comply with local election laws. In all other regions, we follow the general global AUP at [URL].

98. Oversight mechanism (Score: 1)

Does the developer have an internal or external body that reviews core issues regarding the model prior to deployment?

Disclosure:

In Amazon's Frontier Model Safety Framework, it commits to incorporate of the Safety Framework into the Amazon-wide Responsible AI Governance Program.

References:

Not disclosed

Score justification:

The developer provides a description of its oversight mechanism in its Frontier Model Safety Framework

Indicator notes:

We will award this point if the developer discloses that is has such an internal or external body and provides some description of its scope, or alternatively if the developer discloses that it has no such body. An oversight mechanism covers governance structure beyond mere external risk evaluation, asking whether a formal body regularly reviews design and deployment decisions. Core issues may include model objectives, data usage, or risk mitigation.

Example disclosure:

We convene a monthly advisory board of ethicists, civil society representatives, and academics to review training processes and identify new risks. The board's recommendations regarding deployment are not binding.

99. Whistleblower protection (Score: 0)

Does the developer disclose a whistleblower protection policy?

Disclosure:

Amazon does not have public disclosures about its whistlwblower protection policy.

References:

Not disclosed

Score justification:

Company acknowledges no disclosure

Indicator notes:

We will award this point if the developer discloses (i) the existence of a whistleblower protection policy, (ii) what protections are afforded to whistleblowers, (iii) how reports are handled and investigated, and (iv) any external oversight of the whistleblower protection process. This might include protections for whistleblowers who report safety, ethical, or legal concerns related to the model. We will also award this point if the developer discloses that it has no such policy.

Example disclosure:

We maintain a whistleblower protection policy that prohibits retaliation against employees who report safety or ethical concerns about our models. Reports can be submitted anonymously through our ethics hotline, are reviewed by an independent board committee, and whistleblowers are entitled to legal representation provided by the company. Our policy is audited annually by an independent ethics consultancy.

100. Government commitments (Score: 0)

What commitments has the developer made to government bodies?

Disclosure:

Amazon has publicly committed to collaboration as part of the G7 AI Hiroshima Process Code of Conduct, and the AI Safety Summits in the U.S. and Seoul

References:

Not disclosed

Score justification:

The developer discloses a number of government commitments it has made, though the list is incomplete as it has also signed onto the White House Voluntary Commitments.

Indicator notes:

We will award this point if the company provides an exhaustive list of commitments it has made to government bodies in the jurisdictions where it offers its model.

Example disclosure:

We have committed to the White House Voluntary Committments and the Seoul Committments.