Groq Secures $650 Million to Expand AI Inference Cloud
The round was led by venture firms Disruptive and Infinitum, with additional commitments from existing investors who chose to reinvest. The influx of capital will help Groq scale its power consumption to 200 megawatts (MW) by the end of 2027 and accelerate the rollout of its latest inference technology across the company’s 13 data centers in North America, Europe, the Middle East, and Asia‑Pacific.
Groq’s customers include more than five million developers and thousands of AI‑native enterprises, together generating trillions of AI tokens each week. The funding follows a December 2025 non‑exclusive licensing agreement with Nvidia, under which Nvidia secured rights to Groq’s inference technology for its upcoming LPX platform. At Nvidia’s GTC conference, the LPX rack—built around Groq’s Language Processing Unit (LPU) accelerators—was unveiled, cementing Groq’s role as a key hardware supplier for Nvidia’s data‑center clientele.
"Groq has spent years building the technology, infrastructure, and operational expertise required for the next phase of AI," said Alex Davis, Groq Chairman and CEO of Disruptive. "Today, the company has a proven global platform, a world‑class leadership team, and a clear strategy focused on one of the most important opportunities in technology: AI inference at scale. We believe that combination positions Groq to become a foundational layer of the AI economy." The board and lead investors have worked with management to sharpen the company’s focus on building an inference cloud at scale.
The new capital will accelerate the fit‑out of Groq’s existing footprint with its latest inference technology, including Nvidia’s LPX system. In addition to the funding, Groq has bolstered its leadership. CEO Adam Winter and CFO Matt Eng remain in their roles, while Alex Davis serves as Chairman. The company added Chief Operating Officer Alan Rice, formerly of xAI and Meta Data Centers and a former U.S. Navy nuclear submarine officer, and will bring in Sinclair Schuller as Chief Technology Officer and Rakesh Malhotra as Chief Product Officer in July.
John Yetimoglu, Groq Board Member and Founder and Chief Investment Officer of Infinitum, added: "We believe inference will become the largest infrastructure market in technology. As AI moves from experimentation to production, demand for reliable, cost‑efficient inference will continue to grow exponentially. Groq has the rare combination of differentiated technology, operating expertise, and global scale required to capitalize on that opportunity."
Groq’s leadership stresses that inference represents a larger long‑term opportunity than training, as running AI models in production is expected to require significantly more compute. The company’s strategy is to provide a cloud‑based inference service that can scale to the power and capacity needs of enterprise customers.
The $650 million round is Groq’s largest growth‑capital injection to date. It follows a $20 billion licensing deal with Nvidia reported in December 2025, allowing Groq to remain independent while leveraging Nvidia’s manufacturing and distribution capabilities.
Looking ahead, Groq will focus on deploying its LPX‑enabled data centers, expanding its developer ecosystem, and delivering cost‑efficient inference services. The funding will also support the integration of new hardware and software stacks that enable higher throughput and lower latency for large language models and other AI workloads.
In short, Groq’s recent funding round positions it to scale its AI inference cloud to 200 MW by 2027, deepen its partnership with Nvidia, and strengthen its leadership team—aligning the company with industry expectations that production‑grade AI will drive the next wave of compute demand.