Notizie — Pagina77

Anthropic’s Claude Agents Build a Rust‑Based C Compiler

Anthropic’s Claude Agents Build a Rust‑Based C Compiler
Anthropic researcher Nicholas Carlini used sixteen instances of the Claude Opus 4.6 model, organized as “agent teams,” to develop a Rust‑based C compiler from scratch. Over two weeks and nearly 2,000 Claude Code sessions, the agents produced a 100,000‑line compiler capable of building a bootable Linux 6.9 kernel for x86, ARM and RISC‑V. The open‑source project, released on GitHub, compiles major software such as PostgreSQL, SQLite, Redis, FFmpeg and QEMU, passes 99 percent of the GCC torture test suite, and even runs Doom. The experiment highlights the potential of semi‑autonomous AI coding on well‑defined tasks.Leggi di più

Maybe AI agents can be lawyers after all

Maybe AI agents can be lawyers after all
Recent benchmark testing of AI agents on professional tasks shows a notable jump in performance, especially after Anthropic released Opus 4.6. The new model pushed scores from the low‑20s to just under 30 percent on one‑shot trials and reached an average of 45 percent with multiple attempts. While still far from full competence, the improvement signals rapid progress in foundation models and suggests that legal professionals may need to reconsider the timeline for AI displacement.Leggi di più

AI Accelerates Biotech Innovation to Overcome Labor Gaps

AI Accelerates Biotech Innovation to Overcome Labor Gaps
Biotech firms are turning to artificial intelligence to boost productivity and address talent shortages. Insilico Medicine is building a multi‑task AI platform that can generate disease hypotheses, design candidate molecules and even repurpose existing drugs, aiming to speed drug discovery and cut costs. GenEditBio is using AI to design engineered protein delivery vehicles that target specific tissues for in‑vivo CRISPR therapy, recently receiving FDA clearance for a corneal‑dystrophy trial. Both companies stress the need for richer, more diverse data to improve model accuracy and envision future tools such as digital twins for virtual clinical testing.Leggi di più

ChatGPT Helps User Refine 2026 Goals, Highlights Priorities and Risks

ChatGPT Helps User Refine 2026 Goals, Highlights Priorities and Risks
A writer recounts how they used ChatGPT as a goal‑setting coach for the year 2026. By feeding the AI a list of personal and professional objectives, the model identified blind spots, questioned assumptions about work capacity, pregnancy timing, and social commitments, and suggested ways to reduce cognitive load. The interaction led the author to prioritize a handful of non‑negotiables, restructure the yearly plan, and adopt new operating rules aimed at preserving stability over growth.Leggi di più

Backlash Over OpenAI's Retirement of GPT-4o Highlights Risks of AI Companions

Backlash Over OpenAI's Retirement of GPT-4o Highlights Risks of AI Companions
OpenAI announced the retirement of its GPT-4o chatbot model, sparking a wave of user protest and raising concerns about the emotional bonds people form with AI. The move has triggered eight lawsuits alleging that the model provided harmful advice to vulnerable users. Experts warn that while AI companions can fill gaps in mental‑health access, they also risk fostering dependence and isolation. The controversy underscores the challenge of balancing supportive AI interactions with safety safeguards as the industry races to develop more emotionally intelligent assistants.Leggi di più

AI Chatbots Turn Users into Personalized Caricatures

AI Chatbots Turn Users into Personalized Caricatures
A new online trend lets users request AI chatbots to create caricature illustrations that reflect both their appearance and personal details. By combining a selfie with a prompt, the model draws on prior conversation history and supplied information to add elements such as job cues, hobbies, pets and other quirks. The result is a whimsical, hand‑drawn style portrait that showcases how AI blends visual and textual data to produce personalized artwork.Leggi di più

OpenAI CEO Sam Altman Criticizes Anthropic’s Super Bowl Ads Targeting ChatGPT’s Ad‑Supported Tier

OpenAI CEO Sam Altman Criticizes Anthropic’s Super Bowl Ads Targeting ChatGPT’s Ad‑Supported Tier
OpenAI chief Sam Altman publicly rebuked Anthropic after the rival released Super Bowl commercials that satirized OpenAI’s new ad‑supported version of ChatGPT. The ads portrayed AI assistants interrupting personal conversations with fictional product pitches, implying that ChatGPT would embed ads within its answers. Altman called the messaging “clearly dishonest” and warned that such portrayals could damage user trust. The clash highlights a growing debate over how AI companies can generate revenue without compromising the user experience, with OpenAI emphasizing ads that appear only at the bottom of responses and Anthropic positioning its Claude model as an ad‑free alternative.Leggi di più

Big Tech’s AI Capital‑Spending Race: Amazon Leads, Investors Wary

Big Tech’s AI Capital‑Spending Race: Amazon Leads, Investors Wary
Amazon, Google, Microsoft, Meta and Oracle are pouring record capital into artificial‑intelligence infrastructure, data‑center expansion and related technologies. Amazon’s projected spend tops the list, followed closely by Google, while Microsoft, Meta and Oracle trail behind. Investors are uneasy about the size of the commitments, noting sharp stock declines for firms with the highest projected outlays. The clash between massive AI‑related capex and market comfort highlights a tension that could shape the industry’s future as companies race to secure compute resources.Leggi di più

Sapiom Secures $15 Million Seed Funding to Power AI Agent Payments

Sapiom Secures $15 Million Seed Funding to Power AI Agent Payments
San Francisco startup Sapiom has closed a $15 million seed round led by Accel, with participation from Okta Ventures, Gradient Ventures, Array Ventures, Menlo Ventures, Anthropic, and Coinbase Ventures. The company is building a financial layer that enables AI agents to automatically purchase and access software, APIs, data, and compute services. By handling authentication and micro‑payments behind the scenes, Sapiom aims to remove infrastructure hurdles for non‑technical creators building AI‑driven applications, focusing initially on B2B use cases.Leggi di più

AI Agents Evolve from Chat Bots to Management Tools

AI Agents Evolve from Chat Bots to Management Tools
Recent AI developments are shifting the focus from conversational bots to agents that act as amplifiers for human expertise. OpenAI's new Codex desktop app lets developers run multiple agent threads, each working on separate code copies, and the underlying GPT‑5.3‑Codex model achieved benchmark scores that surpass competing offerings. This change redefines the user’s role from prompt writer to supervisor, requiring constant human direction while delegating tasks to AI. The emerging model of AI as a tool rather than an autonomous coworker is sparking debate about its practicality and impact on productivity.Leggi di più

OpenAI Unveils GPT-5.3-Codex, Expanding Coding Model Capabilities

OpenAI Unveils GPT-5.3-Codex, Expanding Coding Model Capabilities
OpenAI introduced GPT-5.3-Codex, a new version of its coding model that will be accessible through a command‑line tool, IDE extension, web interface, and a macOS desktop app. While API access is not yet available, the company reports that the model outperforms its predecessors on benchmarks such as SWE‑Bench Pro and Terminal‑Bench 2.0. OpenAI also emphasizes that GPT-5.3-Codex was instrumental in creating itself, positioning the model as a broader software‑lifecycle assistant capable of debugging, deployment, documentation, and more, with mid‑task steering and frequent status updates.Leggi di più

Anthropic Unveils Claude Opus 4.6 Upgrade Boosting Coding Capabilities

Anthropic Unveils Claude Opus 4.6 Upgrade Boosting Coding Capabilities
Anthropic announced the release of Claude Opus 4.6, an enhanced version of its most powerful Claude model. The upgrade focuses on faster, more accurate coding and better handling of complex app tasks through a step‑by‑step reasoning approach. Opus 4.6 can self‑check its work and make multiple attempts without user prompts. The new model is available to paying Claude users on Pro, Max, Team and Enterprise plans, with the Pro tier priced at $20 per month (or $17 with annual billing). Smaller models such as Sonnet 4.5 and Haiku 4.5 remain in the lineup.Leggi di più

OpenAI Unveils Frontier Platform for Enterprise AI Agent Management

OpenAI Unveils Frontier Platform for Enterprise AI Agent Management
OpenAI announced Frontier, an end-to-end platform that lets enterprises build, deploy and control AI agents. The open system supports agents created inside or outside OpenAI, allowing them to access external data and applications while giving companies granular oversight of permissions and actions. Early adopters such as HP, Oracle, State Farm and Uber are testing the service, which is currently limited to a small group of users with broader rollout planned. Pricing details were not disclosed. Industry analysts, including Gartner, view agent‑management platforms as critical infrastructure for AI adoption, positioning Frontier as a strategic move for OpenAI in the enterprise market.Leggi di più

OpenAI Unveils GPT-5.3 Codex Agentic Coding Model Ahead of Anthropic

OpenAI Unveils GPT-5.3 Codex Agentic Coding Model Ahead of Anthropic
OpenAI announced the launch of its Codex agentic coding tool and a new model called GPT-5.3 Codex. The company says the model expands Codex's abilities from simple code writing to handling nearly any developer task, can create complex games and apps from scratch, runs 25 percent faster than its predecessor, and was partially built using earlier versions of itself. The release follows a near‑simultaneous launch by Anthropic, which moved its release 15 minutes earlier, sparking a brief race to market.Leggi di più

Anthropic Launches Claude Opus 4.6 with Enhanced Capabilities and Safety

Anthropic Launches Claude Opus 4.6 with Enhanced Capabilities and Safety
Anthropic announced Claude Opus 4.6, branding it as a direct upgrade that handles complex, multi‑step tasks with higher quality on the first try. The model expands beyond coding to improve work in documents, spreadsheets, and presentations, and adds a one‑million token context window in beta. New features include agent‑team collaboration for developers and expanded cybersecurity safeguards. Pricing remains the same as the predecessor, and the model is positioned as a more production‑ready solution for a broad range of knowledge‑work applications.Leggi di più

Anthropic Rolls Out Claude’s Next‑Gen Model Amid Growing Competition

Anthropic Rolls Out Claude’s Next‑Gen Model Amid Growing Competition
Anthropic’s Claude AI platform has experienced a surge in popularity, especially during the holiday season, as developers and enterprises adopted its coding agent capabilities. The company announced the release of Opus 4.6, described as a direct upgrade with faster performance and improved precision for complex tasks. Industry leaders praised the model’s ability to handle long‑running, multistep projects without constant supervision. While Claude enjoys strong user loyalty, competitors such as OpenAI and Google are intensifying their own AI offerings, prompting Anthropic to emphasize security enhancements and a continued focus on reliable, text‑based productivity tools.Leggi di più

OpenAI executives criticize Anthropic's Super Bowl ads over AI advertising debate

OpenAI executives criticize Anthropic's Super Bowl ads over AI advertising debate
OpenAI’s CEO Sam Altman and chief marketing officer Kate Rouch publicly rebuked rival AI lab Anthropic after the company released a series of Super Bowl commercials that mock the idea of ads appearing in AI chatbot conversations. Anthropic’s ads, part of a campaign titled “A Time and a Place,” depict scenarios where users receive product pitches instead of advice, ending with the tagline “Ads are coming to AI. But not to Claude.” OpenAI officials called the spots dishonest and authoritarian, arguing that any future ChatGPT ads would be clearly labeled and would not alter the chatbot’s responses. The clash highlights competing approaches to monetizing AI, with OpenAI testing conversation‑specific banner ads while Anthropic relies on enterprise contracts and subscriptions.Leggi di più

ElevenLabs CEO Declares Voice the Next Major AI Interface

ElevenLabs CEO Declares Voice the Next Major AI Interface
ElevenLabs co‑founder and CEO Mati Staniszewski told attendees at the Web Summit that voice is poised to become the primary way people interact with artificial‑intelligence systems. He highlighted recent advances that let voice models convey emotion and work alongside large language models, and outlined the company’s push toward hybrid cloud‑and‑device processing for wearables and other hardware. Staniszewski also noted partnerships with Meta and warned that deeper voice integration raises privacy and surveillance concerns.Leggi di più

Moltbook: The AI-Only Social Network Sparking Hype and Security Concerns

Moltbook: The AI-Only Social Network Sparking Hype and Security Concerns
Moltbook is a Reddit‑like platform built exclusively for AI agents, created on top of the OpenClaw open‑source bot framework. Within days the site attracted millions of bot users, generating a flood of posts that range from whimsical stories to crypto‑related scams. While some AI researchers hail the network as an unprecedented glimpse of large‑scale agent interaction, security experts warn that the underlying OpenClaw software requires extensive system access and that Moltbook itself has exposed API tokens and email addresses. The platform thus sits at the intersection of hype, role‑playing, and real security risk.Leggi di più