Tag:

speech

AI News
NVIDIA AI releases Nemotron Speech ASR: a new open source transcription model designed from the ground up for low-latency use cases like voice agents

by January 7, 2026

January 7, 2026

NVIDIA recently released its new streaming English transcription model (Nemotron Speech ASR) built specifically for low-latency voice agents and live captioning. outpost nvidia/nemotron-speech-streaming-en-0.6b On Hugging Face combines a cache-aware FastConformer …

0 Facebook Twitter Pinterest Email
Generative AI
Google Health AI releases MedASR: a conformer-based medical speech to text model for clinical dictation

by December 24, 2025

December 24, 2025

The Google Health AI team has released MedASR, an open source medical speech to text model that targets clinical dictation and doctor patient conversations and is designed to plug directly …

0 Facebook Twitter Pinterest Email
Generative AI
Google Translate brings real-time speech translation to any headphones

by December 14, 2025

December 14, 2025

The latest update to Google Translate brings live speech translation with support for over 70 languages, which is basically only available on the Pixel Buds, any headphones you want. Its …

0 Facebook Twitter Pinterest Email
AI News
Microsoft AI releases VibeVoice-RealTime: a lightweight real-time text-to-speech model that supports streaming text input and robust long-form speech generation.

by December 7, 2025

December 7, 2025

Microsoft has released vibevoice-realtime-0.5bA real-time text to speech model that works with streaming text input and long form speech output, aimed at agent style applications and live data narration. The …

0 Facebook Twitter Pinterest Email

Newer Posts

Older Posts