← Volver a Noticias

Etiquetas: audio transcription

Google Gemini supera a ChatGPT en transcripción de audio con etiquetas de hablantes

Google Gemini supera a ChatGPT en transcripción de audio con etiquetas de hablantes
A user struggled with speaker‑less transcriptions generated by the iPhone Notes app. By exporting the audio file and feeding it to Google Gemini 3 Pro, the AI produced a full transcript that correctly identified each speaker. An attempt to achieve the same result with ChatGPT 5.1, even using a Plus account, failed because the model could not access the audio file. The experience highlights Gemini’s strength in handling raw audio and speaker identification, while exposing limitations in ChatGPT’s current audio‑processing capabilities. Leer más

Google Gemini Agrega Capacidad de Carga de Archivos de Audio

Google Gemini Agrega Capacidad de Carga de Archivos de Audio
Google has expanded its Gemini AI assistant to accept audio file uploads, allowing users to obtain transcriptions, summaries and key information from recordings up to ten minutes long. The feature, described as the most‑requested addition by Gemini’s VP Josh Woodward, works through the web and mobile apps and complements existing Gemini Live voice interactions. While free‑tier users face daily limits and pricing details remain undisclosed, the update positions Gemini alongside competitors like Anthropic’s Claude and Perplexity, which also offer audio processing tools. Leer más