Tags: AI benchmarks

May 1, 2026

OmniCalculator Report Finds Grok Leads in Math While Claude Tops Writing Quality

TechRadar

A new OmniCalculator benchmark shows xAI's Grok 4.2 outperforms free AI chatbots in logical and math tasks, while Anthropic's Claude 4.6 delivers the best writing consistency. Despite a surge in Claude's popularity amid concerns over ChatGPT's ties to military projects, OpenAI's ChatGPT remains the most widely used model. The study highlights distinct strengths and instability rates across the leading bots, suggesting users may need to match tools to specific tasks rather than seeking a single "smartest" AI. Read more