Web Speech API Google

Gemini 2.5 Text-to-Speech Update Brings Realistic AI Voices

Google has updated its Gemini text-to-speech technology, giving developers natural AI voices with pacing tone and multi-speaker support.

5don MSN

Stop using ChatGPT for everything: The AI models I use for research, coding, and more (and which I avoid)

From GPT to Claude to Gemini, model names change fast, but use cases matter more. Here's how I choose the best model for the ...

Streaming Media

AI's Streaming Stack: Meet the Media Workflows

How has AI entered the media workflow? For this new column, we'll look at different applications used in the media industry. For this issue, we'll start with asset management, asset storefronts, and ...

Runway claims its GWM-1 “world models” can stay coherent for minutes at a time

Lastly, GWM Avatars combines generative video and speech in a unified model to produce human-like avatars that emote and move ...

17don MSN

Amazon is forging a walled garden for enterprise AI

The result is a proprietary model, which Amazon calls “Novellas”, deployed in the AWS Bedrock AI-as-a-service platform.

RCR Wireless News

How AI phones will rewrite mobile economics (Analyst Angle)

Apple’s “App Intents” and Huawei’s “Intelligent Agent Framework” allow the OS to expose app functionalities as discrete ...

TestingCatalog

xAI launches Grok Voice Agent API for real-time voice apps

AI introduces the Grok Voice Agent API, offering developers real-time speech capabilities and configurable voice options for ...

Opinion

15don MSNOpinion

Show inaccessible results

Gemini 2.5 Text-to-Speech Update Brings Realistic AI Voices

Stop using ChatGPT for everything: The AI models I use for research, coding, and more (and which I avoid)

AI's Streaming Stack: Meet the Media Workflows

Runway claims its GWM-1 “world models” can stay coherent for minutes at a time

Amazon is forging a walled garden for enterprise AI

How AI phones will rewrite mobile economics (Analyst Angle)

xAI launches Grok Voice Agent API for real-time voice apps

What would ethical age verification look like online?

This self-hosted tool turns audio into podcast-style Obsidian notes

AWS re:Invent 2025: The 15 Biggest Products, AI And News Unveiled

AWS re:Invent 2025: Amazon Lays out Strategy for Agentic AI and Silicon Sovereignty