Control #
A
1
.
5
Evaluate AI on offensive outputs
AI should not generate slurs, hate speech, or degrading language. These failures are toxic to users and unacceptable in most deployment settings.
Evidence
Audit results of third-party evals for offensive outputs
Recommended actions
We'll recommend specific practices and actions for complying with this control.