What methods does the developer use to acquire data used to build the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
What are the top-5 sources (by volume) of publicly available datasets acquired for building the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
If data collection involves web-crawling, what is the crawler name and opt-out protocol?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
What are the top-5 sources (by volume) of usage data from the developer's products and services that are used for building the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
For the top-5 sources of usage data, how are users of these products and services made aware that this data is used for building the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
What are the top-5 sources (by volume) of licensed data acquired for building the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
For each of the top-5 sources of licensed data, are details related to compensation disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
What are the top-5 sources (by volume) of new human-generated data for building the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
For each of the top-5 sources of human-generated data, what instructions does the developer provide for data generation?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
For the top-5 sources of human-generated data, how are laborers compensated, where are they located, and what labor protections are in place?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
What are the top-5 sources (by volume) of synthetic data acquired for building the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
For the top-5 sources of synthetically generated data, what is the primary purpose for data generation?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
What are the methods the developer uses to process acquired data to determine the data directly used in building the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
For each data processing method, what is its primary purpose?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
For each data processing method, how does the developer implement the method?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is the size of the data used in building the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
For all text data used in building the model, what is the composition of languages?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
For all the data used in building the model, what is the composition of domains covered in the data?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does a third-party have direct access to the data used to build the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is the data used to build the model described in enough detail to be externally replicable?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is the amount of compute used in the model's final training run disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is the amount of compute used to build the model, including experiments, disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is the amount of time required to build the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
For the primary hardware used to build the model, is the amount and type of hardware disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is the compute provider disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is the amount of energy expended in building the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is the amount of carbon emitted in building the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is the amount of clean water used in building the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
How is compute allocated across the teams building and working to release the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are all stages in the model development process disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
For all stages that are described, is there a clear description of the associated learning objectives or a clear characterization of the nature of this update to the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer release code that allows third-parties to train and run the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
How are employees developing and deploying the model organized internally?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
What is the cost of building the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are all basic model properties disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is a detailed description of the model architecture disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is the model(s) the model is derived from disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is the compute and time required for model inference disclosed for a clearly-specified task on clearly-specified hardware?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is a protocol for granting external entities API credits for the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer disclose if it provides specialized access to the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are the model's weights openly released?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are the agent protocols supported for the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are the specific capabilities or tasks that were optimized for during post-training disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer evaluate the model's capabilities prior to its release and disclose them concurrent with release?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are code and prompts that allow for an external reproduction of the evaluation of model capabilities disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer measure and disclose the overlap between the training set and the dataset used to evaluate model capabilities?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are the risks considered when developing the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer evaluate the model's risks prior to its release and disclose them concurrent with release?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are code and prompts to allow for an external reproduction of the evaluation of model risks disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are the external entities have evaluated the model pre-deployment disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are the parties contracted to evaluated model risks disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are the post-training mitigations implemented when developing the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer disclose how the post-training mitigations map onto the taxonomy of risks?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer evaluate and disclose the impact of post-training mitigations?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are code and prompts to allow for an external reproduction of the evaluation of post-training mitigations disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer disclose the security measures used to prevent unauthorized copying (“theft”) or unauthorized public release of the model weights?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are the stages of the model's release disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are risk thresholds disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is there a disclosed protocol for versioning and deprecation of the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is there a disclosed change log for the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is a forward-looking roadmap for upcoming models, features, or products disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are the top-5 distribution channels for the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is the quantization of the model served to customers in the top-5 distribution channels disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are the terms of use of the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
What are the top-5 distribution channels for which the developer has usage data?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
For each of the top-5 distribution channels, how much usage is there?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is a representative, anonymized dataset classifying queries into usage categories disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is a policy for data retention and deletion disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Across all forms of downstream use, are statistics of model usage across geographies disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
What are the top-5 internal products or services using the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
What are the top-5 external products or services using the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
How many monthly active users are there for each of the top-5 internal products or services using the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Across all distribution channels for which the developer has usage data, what portion of usage is consumer versus enterprise?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Across all distribution channels for which the developer has usage data, what are the top-5 enterprises that use the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
What are the 5 largest government contracts for use of the model?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is an assessment of the benefits of deploying the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer operate a public bug bounty or vulnerability reward program under which the model is in scope?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer clearly define a process by which external parties can disclose model vulnerabilities or flaws?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer disclose its policy for legal action against external evaluators conducting good-faith research?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are major security incidents involving the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are misuse incidents involving the model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer coordinate evaluation with government bodies?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer disclose a way to submit user feedback? If so, is a summary of major categories of feedback disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are model behaviors that are permitted, restricted, and prohibited disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are desired model response characteristics disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is the default system prompt for at least one distribution channel disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are intermediate tokens used to generate model outputs available to end users?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
For internal products or services using the model, are downstream mitigations against adversarial attacks disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer provide built-in or recommended mitigations against adversarial attacks for downstream developers?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer disclose additional or specialized mitigations for enterprise users?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are mechanisms that are used for detecting content generated by this model disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer provide documentation for responsible use by downstream developers?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Is a description of who can and cannot use the model on the top-5 distribution channels disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Which uses are explicitly allowed, conditionally permitted, or strictly disallowed under the acceptable use policy for the top-5 distribution channels?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
What are the methods used by the developer to enforce the acceptable policy?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are statistics on the developer's AUP enforcement disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Are differences in the developer's acceptable use or model behavior policy across geographic regions disclosed?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer have an internal or external body that reviews core issues regarding the model prior to deployment?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
Does the developer disclose a whistleblower protection policy?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure:
What commitments has the developer made to government bodies?
Disclosure:
References:
Score justification:
Indicator notes:
Example disclosure: