Groq raises $650M for life after Ross and Nvidia
Adam Winter is taking Groq deeper into AI inference cloud after Nvidia licensed its LPU technology and hired away its founding CEO.
By Ryan Merket ยท Published
Why it matters
Groq is testing a new AI infrastructure template: monetize core IP with Nvidia, keep the independent company alive, then use investor capital to rebuild around cloud operations.

Groq raised $650 million on June 22 to rebuild around AI inference cloud, six months after Nvidia licensed Groq's inference technology and hired founder Jonathan Ross, president Sunny Madra and other Groq employees.
The round, announced by Groq and first reported as in progress by Axios in May, was led by Disruptive and Infinitum. Groq said other existing investors also reinvested, but Groq did not name the full investor roster and did not disclose a valuation. TechCrunch reported that Groq was last valued at $6.9 billion after a $750 million round in September 2025.
That missing valuation is the tell. Groq is not selling this round as a conventional up-round victory lap. Groq is selling continuity after a transaction that changed what Groq is. Ross, the ex-Google engineer known for work on Google's Tensor Processing Unit, built Groq around a language processing unit for inference. Nvidia's December 2025 deal gave the GPU incumbent a license to Groq's inference technology and took Ross, Madra and other Groq staff into Nvidia. The reported price tag on that transaction was $20 billion, according to TechCrunch and Axios, but Groq's own December announcement described it only as a non-exclusive licensing agreement.
The operating question for Groq is no longer whether Groq can prove the chip architecture. Nvidia has already decided the answer is yes. The question is whether the remaining Groq can convert that technology, and Nvidia's own LPX roadmap around it, into a cloud business with enough supply, pricing discipline and customer trust to matter.
The founder's chip story becomes an operator's cloud story
Ross and co-founder Doug Wightman started Groq in 2016 after their Google work in AI silicon. Groq's bet was that inference, not only training, would need purpose-built hardware. That bet made Groq one of the clearest venture-backed challengers to Nvidia's GPU-centered data center economics.
Groq's second act belongs to Adam Winter, who Groq now identifies as chief executive. Groq's June 22 announcement says Winter and CFO Matt Eng are long-standing Groq leaders who have spent years building Groq's technology, infrastructure footprint and commercial operations. The post also places Groq under the chairmanship of Alex Davis, founder and CEO of Disruptive, one of the round's two named leads.
That structure matters because Groq is now closer to a compute operator than a chip insurgent. Groq says it operates 13 data centers across North America, Europe, the Middle East and APAC, serves more than five million developers and thousands of AI-native companies, and processes trillions of tokens each week. Those figures are Groq's own metrics, not audited operating data. They do, however, show how Groq wants the market to judge the new round: not by tape-out milestones, but by deployed capacity and token volume.
The $650 million is targeted at expanding that footprint. Groq says the capital will support fit-out of existing facilities with its latest inference technology, including Nvidia's new LPX system, and help Groq scale toward 200 megawatts by the end of 2027. That is a heavy infrastructure promise, and it pushes Groq into the same uncomfortable zone facing every AI cloud: power, financing, utilization and customer concentration matter as much as model speed.
Nvidia now sells the category Groq created
Nvidia moved fast after the December licensing agreement. In March, Nvidia published details on Nvidia Groq 3 LPX, a rack-scale inference accelerator for its Vera Rubin platform. Nvidia says LPX is built around 256 interconnected Nvidia Groq 3 LPU accelerators and is designed for low-latency, high-throughput, large-context agentic AI workloads.
For Groq, that is both validation and pressure. Nvidia putting Groq-derived technology into its data center roadmap tells customers that Groq was right about deterministic, low-latency inference. It also means Groq must compete in a world where the largest AI infrastructure supplier can package similar architectural ideas inside Nvidia's own hardware, networking, software and customer relationships.
Groq's answer is distribution through GroqCloud. The product exposes an OpenAI-compatible API, offers a free trial, and sells low-latency access to open models. Groq's customer page names the McLaren Formula 1 Team, and Groq cites customer claims of faster chat performance and lower cost after moving to GroqCloud. Those claims should be read as customer-supplied case-study figures, not market-wide benchmarks.
The broader reason investors are still writing checks is simple: inference is where model usage turns into recurring compute consumption. Groq says inference will require an estimated 15 to 20 times more compute than training over time. That estimate is Groq's own forecast, but the investment thesis around it is visible across the market. Training clusters produce frontier-model headlines; inference infrastructure determines whether AI products can answer users quickly, cheaply and reliably at scale.
Groq is restaffing for a different fight
Groq's leadership additions fit the pivot. Alan Rice has joined as chief operating officer after roles at xAI and Meta data centers, following earlier U.S. Navy nuclear submarine operations experience. In July, Groq says Sinclair Schuller will join as chief technology officer and Rakesh Malhotra as chief product officer. Schuller founded Apprenda, an enterprise cloud platform sold to Atos, and later co-founded Nuvalence with Malhotra. EY acquired Nuvalence in 2024. Malhotra also spent roughly a decade at Microsoft working on cloud, data-center-management and enterprise-storage products.
Those hires are not chip-founder replacements in the narrow sense. They are cloud-scale and enterprise-software hires. Groq needs to sell reliability, procurement confidence, developer experience and predictable economics. That is a different motion from persuading investors that an LPU can beat a GPU on a benchmark.
Groq has made this transition before in smaller form. In March 2024, Groq acquired Definitive Intelligence, the AI data analytics company led by Madra, to launch and expand GroqCloud. Madra then ran GroqCloud before departing for Nvidia as part of the December licensing arrangement. The cloud business is not new, but Groq is now placing the entire company narrative around it.
The round buys time, not proof
Groq has the money to keep building, the investor continuity to present the Nvidia transaction as a win, and the technical validation of seeing Nvidia adopt Groq-derived inference architecture into its own platform. That is a stronger hand than most startups have after a talent-and-IP deal with the market leader.
But the risk is equally clear. Groq's core technology is no longer Groq's alone in commercial terms. Groq did not disclose the new valuation. Groq's headline usage numbers are self-reported. Groq's cloud expansion depends on expensive data center capacity at a time when AI infrastructure investors are increasingly sorting real utilization from speculative build-out.
Ross built Groq to make inference hardware matter. Winter's job is to prove Groq can matter after the hardware thesis has been absorbed by Nvidia. The $650 million round gives Groq another run at that company, this time as an inference cloud operator with founder-built technology, investor-backed patience and a market that is moving from demos to production workloads.