← Retour aux actualités

Tags: Dall-E 3

AI Image Generators Still Struggle with Faces, Logos, and Complex Scenes

AI Image Generators Still Struggle with Faces, Logos, and Complex Scenes
AI image‑generation tools have made impressive strides, but they continue to falter on several fronts. Reviewers note recurring problems with realistic human faces, trademarked logos, and dense compositions. While services such as Dall‑E 3, Midjourney, and Google’s Gemini‑powered Pixel tools can produce striking visuals, they often misrender expressions, miss brand details, or produce nonsensical overlapping elements. Users are advised to simplify prompts, adjust adjectives, and use post‑generation editing tools to correct errors. The ongoing challenges highlight both the rapid progress and the current limits of AI‑driven visual creation. Lire la suite

Microsoft Launches Its First In-House AI Image Generator, MAI-Image-1

Microsoft Launches Its First In-House AI Image Generator, MAI-Image-1
Microsoft has introduced MAI-Image-1, its first internally developed text‑to‑image model, now integrated into Bing Image Creator and Copilot Audio Expressions. Announced in October, the model is praised for fast, photorealistic output, especially in food, nature and artistic lighting scenes. It will also supply visual art for AI‑generated audio stories in Copilot’s story mode. The rollout follows earlier releases of MAI-Voice-1 and MAI-1-preview, signaling Microsoft’s broader push to build its own AI stack while still offering OpenAI and Anthropic models for other services. Lire la suite