Mistral Moderation

Name: Mistral Moderation
Rating: 1.7 (1 reviews)
Author: Star Stack

API to detect harmful text and PII in chat inputs and outputs

Content Moderation

Visit Website

About

Mistral Moderation is a classifier API that detects harmful content categories like hate, violence, and PII in raw text or conversations. It also offers a safe prompt system prompt to steer model behavior and reduce unsafe generation.