On Tuesday, Meta announced SeamlessM4T, a multimodal AI model for speech and text translations. As a neural network that can process both text and audio, it can perform text-to-speech, speech-to-text, ...
Researchers at Amazon have trained the largest ever text-to-speech model yet, which they claim exhibits “emergent” qualities improving its ability to speak even complex sentences naturally. The ...
If you're online in any capacity, chances are good a big chunk of your time is spent reading through mountains of content. Whether you find yourself scanning through articles, tutorials, emails, or ...
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More New York City startup Hume AI emerged from stealth two years ago and has ...
Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now ElevenLabs, the highly-valued AI voice ...
New research shows models can be directly edited to hide selected voices, even when users specifically ask for them. A technique known as “machine unlearning” could teach AI models to forget specific ...
Microsoft’s AI Manager Mustafa Suleyman recently unveiled in a social media post a new feature called “Scripted Mode” in ...
Google is enhancing Gemini's text-to-speech (TTS). On Tuesday at Google I/O 2025, the company previewed a new TTS feature, built on native audio output, that can "converse in more expressive ways." ...
Podcast recording and editing platform Podcastle is now joining other companies in the AI-powered, text-to-speech race by releasing its own AI model called Asyncflow v1.0. An API for developers will ...