Tags: inference

Gimlet Labs Secures $80 Million Series A to Boost AI Inference Efficiency

Gimlet Labs Secures $80 Million Series A to Boost AI Inference Efficiency TechCrunch
Gimlet Labs, founded by former Pixie co‑founders including Stanford adjunct professor Zain Asgar, announced an $80 million Series A led by Menlo Ventures. The startup’s “multi‑silicon inference cloud” software lets AI workloads run simultaneously across CPUs, GPUs, and high‑memory systems, promising 3‑to‑10× faster inference at the same cost and power. Partnerships with major chip makers such as NVIDIA, AMD, Intel, ARM, Cerebras and d‑Matrix support the platform, which targets large model labs and data‑center operators. The round brings total funding to $92 million and backs a team of 30 employees. Read more

Inside Amazon’s Austin Chip Lab: The Trainium Story and Its Impact on AI Partnerships

Inside Amazon’s Austin Chip Lab: The Trainium Story and Its Impact on AI Partnerships TechCrunch
Amazon invited a journalist on a private tour of its Austin chip lab, showcasing the development of the Trainium AI processor family. Lab leaders Kristopher King and Mark Carroll explained how Trainium, originally built for training, now powers inference for services like Bedrock and supports major partners such as Anthropic, OpenAI, and Apple. The lab’s work includes custom servers, liquid‑cooled chips, and a mesh network that reduces latency. Engineers described the intense silicon bring‑up process, welding stations, and a private testing data center. CEO Andy Jassy highlighted Trainium as a multibillion‑dollar business driving AWS’s AI strategy. Read more