Skip to main content
← All Tags

Speech Recognition

5 articles in this category

AI NewsSpeech RecognitionNatural Language Processing

Revolutionizing Voice AI: Deepgram's Quest for Universal Reliability

Deepgram improves speech-to-text accuracy by 30%

Read more
AI NewsSpeech RecognitionLow-Resource Languages

Elevating Voices in AI: Microsoft Research Launches Paza & PazaBench

Microsoft Research unveils Paza, a human-centered speech pipeline, and PazaBench, the first leaderboard for low-resource languages, covering 39 African languages and 52 models.

Read more
AI NewsSpeech RecognitionAI Agents

Microsoft Releases VibeVoice-ASR: A Unified Speech-to-Text Model for Long-Form Audio

Microsoft’s VibeVoice-ASR tackles long-form audio transcription, achieving 60-minute single-pass processing with structured output.

Read more
AI NewsSpeech RecognitionNVIDIA

NVIDIA Releases Nemotron Speech ASR: Low-Latency Speech Recognition

NVIDIA released Nemotron Speech ASR, an open-source transcription model achieving approximately 7.84% WER at a 0.16s chunk size for low-latency applications.

Read more
AI NewsGemini ModelsSpeech Recognition

Improved Gemini audio models for powerful voice interactions

Google’s upgraded Gemini 2.5 Native Audio model achieves a 71.5% score on ComplexFuncBench Audio, improving voice agent capabilities.

Read more