← Back to News

Tags: cost efficiency

OpenAI Introduces Faster, Lower-Cost GPT-5.4 Mini and Nano Models

OpenAI Introduces Faster, Lower-Cost GPT-5.4 Mini and Nano Models
OpenAI has launched two smaller versions of its latest GPT-5.4 model—Mini and Nano—designed for developers who prioritize speed and cost over maximum reasoning power. The Mini model runs more than twice as fast as the full model while staying close on key benchmarks, and the Nano model focuses on simple classification and data‑extraction tasks. Both models support text and image inputs, tool use, function calling, and a 400,000‑token context window, and they are available today via the API, Codex, and ChatGPT. This tiered approach lets developers allocate cheaper models for routine work and reserve the full model for complex reasoning, reshaping how real‑time AI applications are built. Read more

Google Cloud VP Highlights Three Key Frontiers for AI Model Deployment

Google Cloud VP Highlights Three Key Frontiers for AI Model Deployment
Michael Gerstenhaber, product vice president for Google Cloud's Vertex AI platform, explains that AI models are being evaluated on three fronts: raw intelligence, response time, and cost‑effective scalability. He notes that while the technology shows promise, broader adoption of agentic AI is slowed by missing infrastructure for auditing, data authorization, and production‑ready patterns. Gerstenhaber also points to Google’s unique vertical integration—from data centers and custom chips to APIs and compliance tools—as a strategic advantage in addressing these challenges. Read more