Text to Speech Synthesis

22h

The enterprise voice AI split: Why architecture — not model quality — defines your compliance posture

Enterprise voice AI has fractured into three architectural paths. The choice you make now will determine whether your agents ...

Microsoft

VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment

With the help of discrete neural audio codecs, large language models (LLM) have increasingly been recognized as a promising methodology for zero-shot Text-to-Speech (TTS) synthesis. However, sampling ...

Gemini Voice Brings Fast Multi-Speaker Audio, Rich Styles and 32k Context Window

Built on Gemini 2.5 Flash and Pro with a 32,000-token context window, you get faster results and precise delivery for ...

Year ender 2025: Tracing rise of AI assistants from reactive to proactive

In 2025, AI assistants crossed a tipping point, transforming from reactive tools into proactive partners, shaping how people ...

WinBuzzer

Microsoft Launches ‘Mini’ GPT Voice Models in Azure Foundry to Cut Latency and Cost

Microsoft's new gpt-realtime-mini and gpt-4o-mini models in Azure AI Foundry offer 70% lower costs and 50% better accuracy, targeting enterprise voice agents.

The Manila Times

Voximplant and Deepgram Bring Production Voice AI to Real-World Calls

New York, NY, Dec. 18, 2025 (GLOBE NEWSWIRE) -- Voximplant, a leading cloud communications platform, announced native support ...

Unite.AI

Adding Dialogue to Real Video With AI

A new AI framework can rewrite, remove or add a person’s words in video without reshooting, in a single end-to-end system. Three years ago, the internet would have been stunned by any one of the 20-30 ...

eLife

High-Fidelity Neural Speech Reconstruction through an Efficient Acoustic-Linguistic Dual-Pathway Framework

This study presents a valuable advance in reconstructing naturalistic speech from intracranial ECoG data using a dual-pathway model. The evidence supporting the claims of the authors is solid, ...

Voice&Data

Speech-to-speech translation enters the real-time AI era

Speech-to-speech translation is driving industry innovation as AI, edge, and cloud platforms enable real-time, privacy-aware ...

Slator

AppTek Pioneers Next-Generation Expressive Text-to-Speech for AI Dubbing

While AI has made significant progress in generating intelligible synthetic speech, a critical challenge remains: prosody. Text-to-speech systems struggle to replicate the rhythmic and melodic ...

The New York Times

In Their Own Words: Trump and Top Officials Change Tone on Free Speech

In the wake of Charlie Kirk’s assassination, the president’s pledges to guarantee free speech have been replaced by efforts to suppress — and even criminalize — what their critics have to say. By ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results