Mistral AI

Mistral: Voxtral Small 24B 2507

mistralai/voxtral-small-24b-2507

Voxtral Small is an enhancement of Mistral Small 3, incorporating state-of-the-art audio input capabilities while retaining best-in-class text performance. It excels at speech transcription, translation and audio understanding. Input audio is priced at $100 per million seconds.

  • Context window: 32,000 tokens
  • Input: text, audio, file
  • Output: text
  • Pricing: $0.1/M input tokens, $0.3/M output tokens

View on OpenRouter. Model data sourced from OpenRouter.