cyberivy
AIGroqInferenceNeocloudNvidiaAI InfrastructureFunding2026

Groq Reportedly Raises 650 Million Dollars for an Inference Cloud

May 31, 2026

According to a TechCrunch report from 29 May 2026, Groq is raising up to 650 million dollars to build a pure inference-cloud business after selling its chip technology to Nvidia.

What this is about

According to a TechCrunch report from 29 May 2026, the AI company Groq is raising up to 650 million US dollars from existing investors. The background is notable: in December 2025, Groq licensed its chip technology to Nvidia in an arrangement valued at around 20 billion dollars, with part of its leadership moving to Nvidia. The remaining team is now building a company with no chip business of its own that offers AI inference as a cloud service.

What the financing actually means

Inference is the running of an already-trained AI model, meaning the actual answering of requests. Groq plans to sell this inference as a cloud service, a category the industry often calls a "neocloud." Unlike before, the company no longer sells its own hardware but provides compute as a service. According to TechCrunch, the round is effectively backstopped: existing investors Disruptive and Infinitum have reportedly agreed to cover the full 650 million dollars should other investors decline their pro-rata shares. It is important to note that this is a report about an ongoing financing, not an officially closed deal.

Why it matters

The case shows a split between two bets: building AI chips and running AI inference as a service. The chip technology has been sold; the inference business continues as a standalone company. For the market, this signals that inference-cloud services are becoming their own highly valued category alongside the pure training of models. For companies running AI applications, more competition among inference providers could, over time, affect prices and latency.

In plain language

Imagine a bakery that spent years building and selling its own ovens. Now it sells the oven blueprints to a large manufacturer and instead opens a bakery that only bakes and sells bread. Baking bread is the inference here: the daily delivery of results, not the building of the machine.

A practical example

A software vendor in Munich runs an AI chatbot for 50,000 end customers and pays monthly for inference at a cloud provider. If an additional specialized inference provider with fast response times entered the market, the company could compare offers, redirect part of the load as a test, and weigh response times against costs. Before switching, a load test, a comparison of prices per million tokens, and a review of the data-protection and location model would be sensible.

Scope and limits

First, the financing is at this point a report and not an officially confirmed deal; the amount and terms may change. Second, building an inference-cloud business without its own chips means the company depends on purchased or licensed hardware, which can affect margins. Third, named personnel details such as a new management team are taken from reporting and should be checked directly against primary sources before any business decisions.

SEO & GEO keywords

Groq, AI inference, neocloud, inference cloud, Nvidia, 650 million dollar funding, LPU, AI infrastructure, Disruptive, Infinitum, AI compute, 2026

πŸ’‘ In plain English

The company Groq sold its chip technology to Nvidia. Now, according to a report, it is raising 650 million dollars to instead offer AI as an online service that runs finished AI models quickly.

Key Takeaways

  • β†’According to TechCrunch on 29 May 2026, Groq is raising up to 650 million dollars from existing investors.
  • β†’In December 2025, Groq licensed its chip technology to Nvidia in an arrangement worth around 20 billion dollars.
  • β†’The new business runs AI inference as a cloud service (neocloud) with no chip business of its own.
  • β†’The round is considered backstopped, as Disruptive and Infinitum would cover the full 650 million.
  • β†’This is a report about an ongoing financing, not a confirmed closing.
  • β†’Inference-cloud services are emerging as their own market category alongside model training.

FAQ

How much is Groq raising?

According to a TechCrunch report from 29 May 2026, up to 650 million US dollars from existing investors.

What is a neocloud?

A cloud service that sells AI inference, the running of already-trained models, as compute.

What happened to Groq's chip business?

Groq licensed its chip technology to Nvidia in December 2025 in an arrangement worth around 20 billion dollars.

Sources & Context