Palantir-Nvidia Launch Sovereign AI OS, Prompting Critique of Token-Based Pricing
The architecture marries Nvidia’s open‑weight Nemotron foundation models with its newest Blackwell Ultra GPUs, and layers Palantir’s AIP, Ontology, Foundry, and Apollo platforms on top. The result is a fully integrated, air‑gapped environment that promises end‑to‑end control over data, model ownership, and compliance.
Nvidia’s press release highlighted that the system can run Nemotron models inside sealed hardware, with built‑in data‑authorization checks, rights‑to‑erasure controls, and an audit trail that satisfies the most stringent federal security mandates. Palantir, meanwhile, stressed how its enterprise data‑integration tools dovetail with the GPU‑accelerated inference engine, positioning the offering as a plug‑and‑play solution for sovereign AI deployments.
At the heart of the stack are Blackwell Ultra GPUs, each delivering 15 PetaFLOPS of dense NVFP4 compute and housing 288 GB of HBM3e memory. Nemotron models, being open‑weight, can be fine‑tuned on customer data, and when run on air‑gapped cards they meet the strict data‑privacy requirements of U.S. federal agencies and other regulated sectors.
Two days after the announcement, Palantir CEO Alex Karp appeared on CNBC’s Squawk Box and blasted the pricing models of OpenAI and Anthropic. He said the industry had "gone completely wrong" in how frontier models are sold and accused the labs of "tokenmaxxing"—maximizing token consumption regardless of value. Karp warned that hosted APIs could expose customers’ intellectual property, a claim that has not been corroborated by the labs and remains an allegation. He pointed to Palantir’s new model‑agnostic product, which lets users switch between underlying models, arguing that government and defense customers have heightened trust concerns about relying on a single frontier lab.
Karp’s remarks arrive amid a broader shift in enterprise sentiment toward cost‑efficient AI deployment. CNBC coverage noted that some organizations are moving away from indiscriminate token usage toward more efficient, on‑premises solutions. OpenAI and Anthropic have already announced exploratory pricing changes as competition in the LLM market intensifies, making the Palantir‑Nvidia partnership an attractive alternative for agencies that need to keep data and model weights in‑house.
Key developments to watch include any public response from OpenAI or Anthropic to the IP‑risk claim, the pace at which U.S. agencies adopt the Palantir‑Nvidia Sovereign AI OS, and whether other enterprise vendors begin to market open‑weight, self‑hosted alternatives. These dynamics will shape the future of AI procurement in regulated sectors and the balance between cloud‑based and on‑premises solutions.
In short, the Palantir‑Nvidia collaboration delivers a full‑stack, air‑gapped AI platform that restores data and model ownership to customers. Alex Karp’s criticism of token‑based pricing underscores a growing demand for sovereign AI solutions, and the industry will be watching how the market responds to these competing approaches.