Supported Labels
| Label ID | Description |
|---|---|
toxic-v1 | Harmful language that insults, demeans, or degrades in a general way, not necessarily aimed at a specific person. This category is reserved for language intended to cause emotional harm, not for references to illegal or toxic substances. |
profanity-v1 | Use of strong or offensive swear words. |
hate-speech-v1 | Language that demonizes or incites harm toward a group or individual based on inherent traits, often calling for violence or systemic discrimination. |
harassment-v1 | Targeted abuse that insults or degrades a specific person or group, focusing on personal traits or beliefs. This language aimed at a specific person or group that attacks their character or reputation, this can include defamatory or accusatory statements meant to harm someone’s standing. |
self-harm-v1 | References that encourage or normalize self-injurious behavior. |
adult-v1 | Explicit descriptions, references, or portrayals of sexual acts or behavior intended to evoke sexual arousal. This excludes non-sexual explicit content. |
violent-v1 | Language that incites or glorifies physical harm or injury. |
drugs-v1 | References, descriptions, or endorsements of the use, abuse, or distribution of drugs, including illegal substances or the misuse of legal drugs. |
firearms-v1 | Content discussing the use, possession, or distribution of guns and other weapons, especially when such discussions could promote or cause violence or unsafe practices. |
cybersecurity-v1 | Content related to computer security, including discussions on hacking, data breaches, and measures to hack digital systems or gain unauthorized access. |
Usage
When submitting text for moderation, include the desired labels in your request. You can specify all labels or only the ones relevant to your use case: RequestNext Steps
Moderate Text Content
Learn how to use the Text Moderation API to scan and moderate text content.
Text Moderation API Reference
Explore the complete API reference for text moderation, including request and response details.

