The Next Web OpenAI announced three new voice‑AI models—GPT‑Realtime‑2, GPT‑Realtime‑Translate, and GPT‑Realtime‑Whisper—bringing GPT‑5‑class reasoning to live audio, real‑time translation in over 70 languages, and low‑latency streaming transcription. The rollout promises faster turn‑taking, parallel tool calls, tone control and a 128K context window, while pricing undercuts most enterprise solutions. Early adopters such as Zillow and BolnaAI report significant gains in call success and word‑error rates, signaling a shift toward integrated, end‑to‑end voice agents.
Read more