← Torna alle notizie

Tag: image generation

OpenAI rolls out ChatGPT Images 2.0, adding reasoning to AI picture generation

OpenAI rolls out ChatGPT Images 2.0, adding reasoning to AI picture generation
OpenAI announced a major upgrade to its ChatGPT image generator, unveiling ChatGPT Images 2.0 in a livestream briefing. The new model introduces a reasoning phase that lets the system parse complex prompts before creating visuals, resulting in more accurate text rendering, consistent styles and better layout control. By treating prompts as instructions rather than suggestions, the update narrows the gap with rival Google Gemini and promises fewer retries for users seeking polished graphics. CEO Sam Altman hailed the leap as a shift comparable to moving from GPT‑3 to GPT‑5 in a single step. Leggi di più

OpenAI launches ChatGPT Images 2.0 with improved non‑Latin text rendering and higher resolution

OpenAI launches ChatGPT Images 2.0 with improved non‑Latin text rendering and higher resolution
OpenAI has rolled out ChatGPT Images 2.0, a new image‑generation model that promises sharper detail, wider aspect ratios and a marked leap in handling non‑Latin scripts. Available today to all ChatGPT users, the upgrade offers up to 2K resolution, flexible output formats and a reasoning layer that can verify its own results. The company says the model now produces more accurate depictions of Japanese, Korean, Chinese, Hindi and Bengali text, making it a stronger tool for developers, designers and creators who need reliable visual content. Leggi di più

OpenAI rolls out major Codex update, previewing super‑app features for developers

OpenAI rolls out major Codex update, previewing super‑app features for developers
OpenAI unveiled a substantial update to its Codex AI coding platform, adding multi‑app agents, a built‑in browser, image generation, and early memory functions. The enhancements let developers command specific desktop programs, integrate 111 new plugins, and receive proactive suggestions. The rollout begins with macOS users logged into ChatGPT, with EU and UK releases slated for later. While the full super‑app that merges ChatGPT, Codex and a web browser remains in development, the latest release offers a tangible glimpse of OpenAI’s broader vision for a unified desktop AI experience. Leggi di più

Google equips Gemini Personal Intelligence with Nano Banana image generation

Google equips Gemini Personal Intelligence with Nano Banana image generation
Google announced Thursday that its Gemini Personal Intelligence feature will soon generate images using a new Nano Banana‑powered engine. The upgrade lets the AI create pictures that reflect a user’s preferences and photo‑library labels without explicit prompts. Subscribers to Google’s Plus, Pro and Ultra plans in the United States will receive the capability within days, and the company says it will roll out to Chrome desktop and other markets soon. The move expands Gemini’s contextual understanding, but Google warns the system can still misinterpret data and invites user feedback. Leggi di più

German AI Image Startup Black Forest Labs Secures $140 Million Meta Deal and Eyes Physical AI

German AI Image Startup Black Forest Labs Secures $140 Million Meta Deal and Eyes Physical AI
Black Forest Labs, a 70‑person AI image‑generation firm based in Germany’s Black Forest, raised a $3.25 billion‑valued round in December and recently inked a $140 million multiyear agreement with Meta. The company’s latent‑diffusion models, praised for efficiency and quality, now power features in Adobe, Canva and other major platforms. After a brief partnership with Elon Musk’s xAI, the startup has turned its attention to physical AI, planning to launch a robot later this year and courting hardware makers for smart‑glass and robotics applications. Leggi di più

Microsoft Unveils New Voice, Transcription and Image AI Models

Microsoft Unveils New Voice, Transcription and Image AI Models
Microsoft announced three new artificial‑intelligence models: a voice model that can generate up to 60‑second audio clips, a transcription model that converts recordings into text in 25 languages, and a second‑generation image model that delivers faster, more realistic results. The models are now available in Microsoft’s Foundry and MAI playground, with plans to integrate the image model into Bing and PowerPoint. The rollout reflects Microsoft’s push to broaden its AI portfolio beyond text‑focused tools, complementing its Copilot suite and underscoring the company’s deep resources for enterprise‑grade generative media. Leggi di più

Baltimore Sues xAI Over Grok Deepfake Harms

Baltimore Sues xAI Over Grok Deepfake Harms
The city of Baltimore has filed a municipal lawsuit against Elon Musk's xAI, alleging that its AI chatbot Grok and the X social network were marketed without warning about the risk of harmful deepfake images. The complaint cites the platform’s image‑generation tool, which was used to create millions of sexualized images, including thousands involving minors, and argues that this violates Baltimore’s Consumer Protection Ordinance. City officials say the action is intended to protect residents from emerging AI‑related harms and hold technology companies accountable. Leggi di più

xAI Faces Class Action Lawsuit Over Grok-Generated Child Exploitation Images

xAI Faces Class Action Lawsuit Over Grok-Generated Child Exploitation Images
Three teenagers from Tennessee have filed a class action lawsuit in California against xAI, alleging that the company’s AI model Grok used their photos to create sexualized images and videos of minors. The filing claims the generated content was shared on platforms such as Discord and Telegram, causing severe emotional distress and violating laws that prohibit child abuse material. xAI has not commented on the suit, while it continues to grapple with multiple investigations in the United States and Europe over similar allegations involving Grok’s image‑generation capabilities. Leggi di più

Tech Giants Unveil Major Product Updates: Google’s AI Image Upgrade, Lenovo’s Foldable Handheld, and Apple’s Upcoming Launch Week

Tech Giants Unveil Major Product Updates: Google’s AI Image Upgrade, Lenovo’s Foldable Handheld, and Apple’s Upcoming Launch Week
Google announced a new version of its Nano Banana AI image service that promises better text rendering, real‑time web knowledge, and higher visual fidelity. At the same time, the mechanical‑keyboard community is shifting from loud clicky switches to quieter “thock” sounds achieved with damping foams and lubricated linear switches. Lenovo is reportedly planning a Legion Go handheld that can transform into a Windows tablet with a foldable screen, while Apple has sent invitations for a multi‑day launch event that may showcase new MacBooks and an iPhone 17e. These developments highlight a wave of innovation across hardware and AI software. Leggi di più

Google Unveils Nano Banana 2, a Faster Image Generation Model

Google Unveils Nano Banana 2, a Faster Image Generation Model
Google has introduced Nano Banana 2, an image‑generation model powered by Gemini 3.1 Flash Image. The new system matches the world knowledge and reasoning of Nano Banana Pro while delivering "lightning‑fast" performance. It brings Pro‑level features—real‑time web‑search integration, infographic creation, and text overlay for marketing and greeting‑card designs—to a broader audience. Nano Banana 2 can preserve the likeness of up to five characters in a single workflow, follow precise instructions, and produce images at up to 4K resolution with richer textures and sharper details. The model will replace Pro in the Gemini app and become the default for AI Mode in Search, Lens, and Flow AI creative studio, though AI Pro and Ultra subscribers will retain access to the original Pro model for specialized tasks. Leggi di più

Google Labs Introduces Pomelli Photoshoot AI Feature for Easy Product Images

Google Labs Introduces Pomelli Photoshoot AI Feature for Easy Product Images
Google Labs has added a new Photoshoot feature to its AI marketing platform Pomelli. The tool lets users upload a single product photo and automatically creates polished, studio‑quality images with adjusted lighting, backgrounds, and textures. Designed for small businesses and e‑commerce sellers, the feature is offered at no cost in the United States, Canada, Australia, and New Zealand. It includes presets for ads, social media, and marketplace listings, and can match the visual style of an existing website, making professional‑grade product photography accessible without a dedicated studio. Leggi di più

AI Chatbots Turn Users into Personalized Caricatures

AI Chatbots Turn Users into Personalized Caricatures
A new online trend lets users request AI chatbots to create caricature illustrations that reflect both their appearance and personal details. By combining a selfie with a prompt, the model draws on prior conversation history and supplied information to add elements such as job cues, hobbies, pets and other quirks. The result is a whimsical, hand‑drawn style portrait that showcases how AI blends visual and textual data to produce personalized artwork. Leggi di più

Locai Labs Bans Under‑18 Access and Image Generation, Calls for Industry Honesty Amid UK Probe of Elon Musk’s Grok Images

Locai Labs Bans Under‑18 Access and Image Generation, Calls for Industry Honesty Amid UK Probe of Elon Musk’s Grok Images
Locai Labs CEO James Drayson announced that the company will block users under 18 and suspend image‑generation features until safety can be assured. He warned that no AI model can guarantee protection against harmful or sexualized content, urging the industry to be transparent about the risks. In the United Kingdom, regulator Ofcom has opened an investigation into Elon Musk’s Grok platform, which allows image editing that can produce non‑consensual and sexualized depictions, including of children. The controversy has already led to bans in several countries and heightened calls for stricter AI regulation. Leggi di più

X’s Grok Image Tools Remain Free Despite Paywall Claims

X’s Grok Image Tools Remain Free Despite Paywall Claims
Elon Musk’s platform X announced that Grok’s image generation and editing features are limited to paying subscribers, but testing shows that free users can still access the tools through the website, app, and image‑edit button. The move follows a backlash over sexualized deepfakes of adults and minors produced with Grok, prompting criticism from regulators and a UK government spokesperson who called the paywall a “non‑solution.” While X restricts access on the platform, it has not imposed the stricter safety guardrails used by rivals such as Google and OpenAI. Leggi di più

X limits Grok’s image‑generation tool to paying subscribers after global backlash

X limits Grok’s image‑generation tool to paying subscribers after global backlash
Elon Musk’s AI venture xAI has restricted access to Grok’s controversial image‑generation feature on X, making it available only to paying subscribers. The move follows a wave of criticism after the tool was used to create sexualized and non‑consensual images of women, children, and public figures. While the restriction applies to the X platform, the standalone Grok app remains free. Governments in the United Kingdom, the European Union, and India have publicly denounced the misuse and urged tighter controls, prompting X to tighten its policies. Leggi di più

Google Gemini’s New Ad Shows AI Crafting Adventures for a Lost Stuffed Toy

Google Gemini’s New Ad Shows AI Crafting Adventures for a Lost Stuffed Toy
Google’s latest advertisement for its Gemini AI model imagines parents using the technology to locate a missing child’s favorite stuffed animal and to create whimsical images and videos of the toy traveling the world. A hands‑on test of Gemini’s image‑search and generation features shows the system can produce plausible results, though it requires careful prompting and has built‑in safeguards that prevent certain uses. The piece also explores the ethical questions around using AI to fabricate comforting narratives for children. Leggi di più

OpenAI Launches GPT Image 1.5 Amid ‘Code Red’ Competition with Google

OpenAI Launches GPT Image 1.5 Amid ‘Code Red’ Competition with Google
OpenAI has introduced GPT Image 1.5, a new version of its ChatGPT image generation tool that promises faster performance, better instruction following, and more precise editing controls. The model is now available to all ChatGPT users and via API. Its rollout follows an internal “code red” memo that highlighted a competitive push against Google’s Gemini series and Nano Banana Pro, which have recently outperformed OpenAI on benchmark leaderboards. GPT Image 1.5 builds on the earlier GPT Image 1 release, adding granular post‑production features and a redesigned creative‑studio interface within the ChatGPT sidebar. Leggi di più

How to Get the Most Out of ChatGPT's Image Generation Features

How to Get the Most Out of ChatGPT's Image Generation Features
ChatGPT now lets users create and edit images directly within the chat interface. By selecting styles, using the built‑in editor, and accessing mobile‑specific tools, both free and Plus users can produce customized visuals for a variety of needs. The platform supports style changes, text integration, and uploading existing photos for transformation, making AI‑driven image creation more accessible than ever. Leggi di più

10 Essential Tips to Maximize Your ChatGPT Experience

10 Essential Tips to Maximize Your ChatGPT Experience
This guide outlines ten practical ways to get more out of ChatGPT, from creating a free account for added features to customizing the AI’s personality. It explains how to enable memory, use temporary chats, organize work with Projects, and retrieve past image generations. The article also highlights advanced options such as connecting Gmail and Google Calendar, accessing the Sora video tool, and upgrading to a Plus plan for higher limits. Together, these tips transform ChatGPT from a casual convenience into a powerful personal assistant. Leggi di più