← Torna alle notizie

Tag: instruction following

Anthropic launches Sonnet 4.6 with expanded context window and benchmark gains

Anthropic launches Sonnet 4.6 with expanded context window and benchmark gains
Anthropic has introduced Sonnet 4.6, the latest iteration of its mid-size model, as part of its four‑month update rhythm. The new version improves coding, instruction‑following, and computer‑use capabilities and becomes the default for both Free and Pro plan users. A beta rollout offers a one‑million‑token context window—twice the size of the previous maximum—enabling handling of entire codebases, lengthy contracts, or dozens of research papers in a single request. The launch follows the Opus 4.6 release and is accompanied by strong benchmark results, including a 60.4% score on ARC‑AGI‑2, positioning Sonnet 4.6 above most comparable models. Leggi di più

OpenAI's GPT-5.1 Refines Performance Over GPT-5

OpenAI's GPT-5.1 Refines Performance Over GPT-5
OpenAI introduced GPT-5.1 as an incremental upgrade to its flagship model, GPT-5. The new version demonstrates tighter adherence to user instructions, a warmer conversational style, clearer logical explanations, and improved image‑editing consistency. Tests show GPT-5.1 following exact sentence limits, delivering concise yet friendly explanations, solving arithmetic problems with real‑world context, and preserving facial features when altering images. Visual classification also becomes more confident. While not a revolutionary leap, the refinements make GPT-5.1 a more reliable choice for everyday AI tasks. Leggi di più