Nvidia

NVIDIA: Nemotron 3.5 Content Safety (free)

nvidia/nemotron-3.5-content-safety

NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model from NVIDIA, fine-tuned from Google Gemma-3-4B. It moderates both inputs to and responses from LLMs and VLMs, accepting text and image input and returning text output: a safe/unsafe classification for the user prompt and the response, safety category labels, and an optional reasoning trace. It covers 12 languages with a context window of up to 128K tokens. It is suited for prompt and response moderation, content classification, safety pipelines, and enterprise AI guardrails with policy enforcement, and includes a togglable reasoning mode. It is part of the NVIDIA Nemotron family of open models for agentic AI.

Context window: 128,000 tokens
Input: text, image
Output: text
Pricing: $0/M input tokens, $0/M output tokens

View on OpenRouter. Model data sourced from OpenRouter.