Tags: developer tools

OpenAI Unveils Three Real‑Time Voice Models, Expanding AI to Live Conversation, Translation and Streaming Transcription

OpenAI Unveils Three Real‑Time Voice Models, Expanding AI to Live Conversation, Translation and Streaming Transcription Digital Trends
OpenAI announced three new audio models for its Realtime API—GPT‑Realtime‑2, GPT‑Realtime‑Translate and GPT‑Realtime‑Whisper. The suite pushes voice AI beyond simple back‑and‑forth exchanges, offering live reasoning, on‑the‑fly translation across 70+ languages and streaming transcription. Developers can now build assistants that schedule home tours, manage travel bookings or provide real‑time captions, while pricing starts at $0.017 per minute for Whisper and $0.034 per minute for Translate, with GPT‑Realtime‑2 billed at $32 per million audio tokens. Read more

OpenAI adds real‑time voice, translation and transcription to its API

OpenAI adds real‑time voice, translation and transcription to its API TechCrunch
OpenAI announced Thursday that its API now supports three new voice‑focused models—GPT‑Realtime‑2, GPT‑Realtime‑Translate and GPT‑Realtime‑Whisper. The suite lets developers build applications that can converse, translate and transcribe speech on the fly, with support for more than 70 input languages and 13 output languages. Billing is split between per‑minute rates for translation and transcription and token‑based pricing for the conversational model. OpenAI says the tools target customer‑service, education, media and creator platforms, and includes guardrails to curb misuse. Read more

OpenAI Launches Chrome Extension for Codex, Expanding AI Coding Tools to Browsers

OpenAI Launches Chrome Extension for Codex, Expanding AI Coding Tools to Browsers Engadget
OpenAI unveiled a Chrome extension for its Codex platform, letting developers test web apps, pull context from multiple tabs, and run DevTools alongside other tasks. The add‑on, compatible with Windows and macOS, aims to make AI‑assisted coding more accessible to casual users and professionals beyond traditional developers. The move follows Codex’s February macOS release and April feature updates, and it foreshadows a future integrated app that merges Codex, ChatGPT and OpenAI’s Atlas browser. Read more

Anthropic Unveils ‘Dreaming’ Feature for Claude Managed Agents

Anthropic Unveils ‘Dreaming’ Feature for Claude Managed Agents Ars Technica2
San Francisco – At the Code with Claude developers’ conference, Anthropic announced a new “dreaming” capability for its Claude Managed Agents. The feature, now in research preview, scans recent interactions, extracts salient details and stores them in memory to improve future tasks. Anthropic says dreaming helps mitigate the limited context windows of large‑language models by preserving critical information across long‑running projects. The rollout is currently restricted to Managed Agents on the Claude Platform, a higher‑level alternative to the Messages API that lets multiple agents collaborate over extended periods. Read more