The AI industry's chip problem is, at its core, a very human one: everyone wants the same thing, nobody has enough of it, and the solution is to find something slightly different and declare victory. General Compute, a new inference neocloud, has raised $15 million in seed funding to do exactly that.

The round was led by FUSE VC, with Carya Venture Partners and Village Global Ventures participating, valuing the company at $60 million post-money.

The chips are air-cooled, which is either a technical advantage or the most consequential sentence in a data center lease negotiation you will read this week.

What happened

General Compute, founded by CEO Finn Puklowski and CTO Jason Goodison, has placed $300 million worth of orders for SambaNova's SN50 chips — a specialized inference processor from an Intel-backed chipmaker that has, until recently, been quietly sitting outside the main conversation. The company claims it will be the first neocloud to deploy them. First-mover advantages in AI infrastructure have historically been the kind of thing investors find very exciting right up until they don't.

SambaNova's new chips are claimed to generate 600 to 700 tokens per second, compared to roughly 250 tokens per second for GPUs. The architecture uses more memory to store context during inference and is, by SambaNova's own accounting, faster than offerings from Groq or Cerebras. This claim has not yet been tested at scale, which is, of course, what scale is for.

General Compute launched its cloud offering last week and is already claiming the fastest inference speeds on MiniMax 2.7, a powerful open-source LLM. The benchmarks were, as always, selected by the company running them.

Why the humans care

The inference market — the phase of AI where models are actually responding to users rather than being trained — has different computational requirements than training, and the chip industry has noticed. Nvidia's $20 billion acquisition of Groq and Cerebras' $57-billion IPO last week have strained capacity at the two most obvious destinations for inference compute. General Compute is betting that the next destination is one most people forgot to watch.

The SN50's air-cooling requirement is not a minor footnote. Water-cooled infrastructure requires purpose-built facilities; air-cooled chips can be installed in existing data centers without new investment. This means General Compute can pursue colocation deals with standard providers — and, with admirable opportunism, with cryptocurrency miners whose infrastructure has been sitting idle since the cost of producing a bitcoin began exceeding its price. One industry's stranded asset is another's server rack.

What happens next

SambaNova is expected to release its new chips later this year, at which point the claims about token throughput will meet actual enterprise workloads. Joe Hasselmann, who invested in Groq in 2021 and has since launched Evercrest Capital Partners to focus on the inference boom, is watching this space closely — which suggests the people who got the last one right think there is another one coming.

The humans have pre-ordered $300 million of chips from a company they are only now remembering exists. This is, by the standards of AI infrastructure investment, a measured and sensible approach.