General Compute, a fledgling AI‑inference cloud provider, closed a $15 million seed round at a $60 million post‑money valuation. The round was led by FUSE VC and included Carya Venture Partners and Village Global Ventures. The funding will accelerate the launch of the company’s “neocloud” service, which rents out processing power for the phase of AI where models generate responses rather than learn from data.

The startup’s founders, CEO Finn Puklowski and CTO Jason Goodison, cite two persistent bottlenecks in the AI ecosystem: finding the right chip for inference workloads and locating data‑center space that can host that hardware without massive infrastructure upgrades. While GPUs have dominated AI training, they are less efficient for inference, prompting the industry to explore purpose‑built silicon.

SambaNova chips power the new neocloud

General Compute has partnered with SambaNova, an Intel‑backed chipmaker that focuses on inference. SambaNova’s upcoming SN50 chips promise 600‑700 tokens per second, roughly three times the throughput of conventional GPUs, which manage about 250 tokens per second. The chips also feature a flexible architecture and larger memory buffers, allowing them to store more context during calculations.

Unlike many high‑performance AI accelerators that require water cooling and substantial power, the SN50 is air‑cooled and consumes less electricity. That design enables the hardware to be installed in existing data‑center facilities without new cooling infrastructure. Puklowski says the company has already ordered $300 million worth of SN50 units and will be the first neocloud to deploy them at scale.

To overcome the second hurdle—finding suitable rack space—General Compute is pursuing colocation agreements. The company plans to place its hardware not only in traditional data‑center sites but also in repurposed crypto‑mining operations, where excess capacity can be redirected to AI workloads. This approach mirrors recent trends where miners rent out idle hardware for alternative compute tasks.

General Compute launched its cloud offering last week, touting the fastest performance on MiniMax 2.7, an open‑source large language model. Venture investor Joe Hasselmann, who backed the startup through his new AI‑focused fund Evercrest Capital Partners, sees the SambaNova partnership as a strategic parallel to earlier collaborations such as CoreWeave’s reliance on Nvidia GPUs and Groq’s integration of its own chips with cloud services.

Industry observers note that the shift toward specialized inference hardware reflects a broader market realignment. Companies like Cerebras and Groq have recently secured massive financings or IPO proceeds, underscoring the demand for faster, cheaper inference. As AI applications move from single‑model deployments to multi‑agent ecosystems, speed and cost become decisive competitive factors.

Puklowski envisions turning hour‑long inference jobs—like code‑generation agents—into tasks that finish in five to ten minutes. Faster token generation also enables more responsive audio agents for customer‑service use cases. "If you use ChatGPT and it gives you 50 tokens per second, that’s still a heck of a lot faster than we can read," he said, highlighting the need for rapid inference as AI agents become more autonomous.

The seed round not only provides the capital to purchase and deploy the SN50 fleet but also validates General Compute’s business model, which hinges on delivering high‑throughput, low‑cost inference to a growing roster of customers. With the backing of seasoned investors and a clear hardware advantage, the company appears poised to capture a slice of the burgeoning inference market.

This article was written with the assistance of AI.
News Factory SEO helps you automate news content for your site.