Google Gemini Supera ChatGPT na Transcrição de Áudio com Rótulos de Falante

A user struggled with speaker‑less transcriptions generated by the iPhone Notes app. By exporting the audio file and feeding it to Google Gemini 3 Pro, the AI produced a full transcript that correctly identified each speaker. An attempt to achieve the same result with ChatGPT 5.1, even using a Plus account, failed because the model could not access the audio file. The experience highlights Gemini’s strength in handling raw audio and speaker identification, while exposing limitations in ChatGPT’s current audio‑processing capabilities. Ler mais

Sep 11, 2025

Google Gemini Adiciona Capacidade de Upload de Arquivos de Áudio

Google has expanded its Gemini AI assistant to accept audio file uploads, allowing users to obtain transcriptions, summaries and key information from recordings up to ten minutes long. The feature, described as the most‑requested addition by Gemini’s VP Josh Woodward, works through the web and mobile apps and complements existing Gemini Live voice interactions. While free‑tier users face daily limits and pricing details remain undisclosed, the update positions Gemini alongside competitors like Anthropic’s Claude and Perplexity, which also offer audio processing tools. Ler mais