Mistral Releases Voxtral: Its First Open Source AI Audio Model

Mistral Releases Voxtral: Its First Open Source AI Audio Model
French AI startup Mistral has entered the competitive audio AI market with the launch of Voxtral, its first family of open-weight audio models. This move aims to challenge the dominance of proprietary systems by offering businesses a more affordable and controllable alternative for speech intelligence.
Key Features and Benefits of Voxtral:
- Open-Source Alternative: Voxtral provides an open-weight model, contrasting with closed, expensive corporate solutions. This allows for greater control, customization, and potentially lower costs for developers and businesses.
- Production-Ready Speech Intelligence: Mistral claims Voxtral is the first open model capable of deploying "truly usable speech intelligence in production," addressing the limitations of cheaper open systems that often struggle with transcription accuracy and comprehension.
- Cost-Effective: The company states Voxtral is "less than half the price" of comparable commercial solutions, making advanced speech AI more accessible.
- Advanced Capabilities:
- Transcription: Can transcribe up to 30 minutes of audio.
- Comprehension: Leverages its LLM backbone (Mistral Small 3.1) to understand audio content, enabling users to ask questions, generate summaries, and trigger real-time actions (like API calls).
- Multilingual Support: Capable of transcribing and understanding multiple languages, including English, Spanish, French, Portuguese, Hindi, German, Dutch, and Italian.
Voxtral Variants:
Mistral is offering two main variants of its speech understanding models:
-
Voxtral Small:
- Parameters: 24 billion
- Target Use: Production-scale deployments.
- Competitors: Comparable to ElevenLabs Scribe, GPT-4o-mini, and Gemini 2.5 Flash.
-
Voxtral Mini:
- Parameters: 3 billion
- Target Use: Local and edge deployments.
- Voxtral Mini Transcribe: A stripped-down, fast API version optimized for transcription-only use cases. Mistral claims it outperforms OpenAI Whisper at less than half the price.
Availability and Pricing:
- Access: Users can try Voxtral for free by downloading the API on Hugging Face or testing the models via Mistral's chatbot, Le Chat.
- API Integration: Starts at $0.001 per minute.
Company Context:
This release follows Mistral's recent announcement of Magistral, its family of reasoning models. Mistral AI, a prominent European AI firm, is known for its commitment to open-source AI. The company is reportedly in talks to raise up to $1 billion in equity.
Image Credits:
- The main article image features the logo of the French company Mistral AI.
- An additional image displays Mistral's branding.
Related Topics:
- AI
- Mistral AI
- Voxtral
- Open Source AI
- Speech Recognition
- AI Models
- AI for Business
- AI Transcription
- AI Voice
- AI Chatbots
Author Information:
Rebecca Bellan is a senior reporter at TechCrunch, covering autonomy, AI, electrification, and more. She co-hosts the Equity podcast and writes the TechCrunch Daily newsletter.
Original article available at: https://techcrunch.com/2025/07/15/mistral-releases-voxtral-its-first-open-source-ai-audio-model/