How has AI entered the media workflow? For this new column, we'll look at different applications used in the media industry. For this issue, we'll start with asset management, asset storefronts, and ...
AI introduces the Grok Voice Agent API, offering developers real-time speech capabilities and configurable voice options for ...
Speakr is a self-hosted Docker-based tool that converts spoken audio to text. It provides automatic speech recognition (ASR) ...
Google Translate now boasts live speech-to-speech translation, thanks to Gemini. This means any pair of headphones—including ...
While OpenAI began this shift back in March 2025 with its Responses API, Google’s entry signals its own efforts to advance ...
On December 12, 2025, Google announced that it had rolled out a major upgrade to Google Translate, powered by its latest ...
Google updates Gemini 2.5 Flash Native Audio for smoother voice chats, stronger instruction following, and live speech translation in Translate and Gemini Live.
Credit: Shutterstock Today marks an exciting moment for the developer community as xAI officially introduces the Grok Voice ...
Dr. Chris Hillman, Global AI Lead at Teradata, joins eSpeaks to explore why open data ecosystems are becoming essential for ...
Google has unlocked Live Translate for all Android headphones using Gemini 2.5 and has added daily streaks to challenge ...