NVIDIA Unveils Nemotron 3 Ultra, Targets the Next Wave of AI Agents

Jensen Huang used his Taipei keynote to position NVIDIA as a full-stack AI platform, installing Nemotron 3 Ultra atop its open model family with a latent MoE trained in NVFP4 and a claimed 5x throughput uplift.

By Ryan Merket · Published Jun 1, 2026, 12:35am CT

Why it matters

NVIDIA hinting at a new Nemotron release will draw developer attention, but real adoption hinges on concrete details like license, cost, and performance on NVIDIA GPUs.

An intricate, abstract geometric form representing the Nemotron 3 Ultra AI model (1970s offset-print magazine illustration, characterized by coarse halftone dots, subtle off-register color shifts, and a warm, slightly yellowed paper texture

NVIDIA has unveiled Nemotron 3 Ultra, a new flagship open model aimed squarely at the emerging market for AI agents, autonomous workflows, and enterprise reasoning systems.

The announcement came during a keynote from NVIDIA CEO Jensen Huang, who presented benchmark results positioning Nemotron 3 Ultra among the strongest open models currently available.

https://x.com/NVIDIAAI/status/2061305524700758050

Rather than focusing solely on traditional AI benchmarks like math competitions or coding exams, NVIDIA highlighted metrics tied directly to real-world agent performance, signaling where the company believes the industry is headed next.

NVIDIA's bet: AI is moving beyond chat

The presentation framed Nemotron 3 Ultra as a model built for what NVIDIA calls "Frontier Smart" applications.

According to benchmark data shown on stage, the 550-billion-parameter model posted competitive results across a range of agent-focused evaluations.

Benchmark	Nemotron 3 Ultra	GLM 5.1	Kimi K2.6	Qwen 3.5
Agent Productivity	91%	84%	91%	89%
Long-Horizon Planning	33%	40%	29%	30%
Coding	54%	64%	67%	53%
Instruction Following	82%	77%	74%	78%
Knowledge Work	1,448	1,594	1,508	1,192
Professional Work Tasks	56%	46%	56%	53%
Long Context	95%	N/A	N/A	90%

While Nemotron did not lead every category, NVIDIA emphasized its balanced performance across agent productivity, instruction following, enterprise tasks, and long-context reasoning.

Those are increasingly the capabilities developers care about as AI systems evolve from question-answering tools into software that can complete complex workflows on behalf of users.

A benchmark suite designed for agents

One of the most notable aspects of the launch wasn't necessarily the scores themselves.

It was the benchmarks NVIDIA chose to showcase.

For much of the past three years, AI companies have competed primarily on coding tests, mathematical reasoning, and standardized evaluation suites.

Nemotron's launch presentation looked different.

Metrics such as:

Agent Productivity
Long-Horizon Planning
Professional Work Tasks
Long Context
Instruction Following

suggest NVIDIA is optimizing for systems that can act, execute, and collaborate rather than simply respond.

That aligns with a broader shift occurring across the AI industry.

Founders, investors, and enterprise buyers are increasingly focused on agentic systems capable of handling customer support, software development, research, operations, and other multi-step workflows that previously required human intervention.

NVIDIA is climbing the AI stack

For years, NVIDIA occupied a relatively simple position in the AI ecosystem.

The company sold the GPUs powering everybody else's models.

That role has become one of the most profitable businesses in technology history, but Nemotron shows NVIDIA's ambitions extend far beyond hardware.

The company is increasingly building:

Foundation models
Agent frameworks
AI infrastructure
Enterprise AI tooling
Inference platforms
Developer ecosystems

The logos displayed alongside the Nemotron launch reinforce that strategy.

Partners and ecosystem participants shown during the presentation included Cursor, LangChain, Nous Research, Perplexity, Black Forest Labs, Reflection, NAVER, Mistral, and others.

Many of these companies sit at the center of the emerging agent economy.

Rather than serving as a supplier behind the scenes, NVIDIA appears increasingly interested in becoming a platform company that powers both the hardware and software layers of AI.

Entering a crowded battlefield

Nemotron arrives at a moment when open-weight AI competition is becoming increasingly intense.

Over the last year, companies including DeepSeek, Moonshot AI, Alibaba, and Zhipu have released increasingly capable reasoning models that have challenged assumptions about where frontier AI innovation originates.

Chinese labs in particular have compressed the performance gap while dramatically lowering costs, forcing U.S. companies to rethink their competitive positioning.

The benchmark comparisons shown by NVIDIA directly acknowledge that reality.

Rather than comparing Nemotron exclusively against Western models, the company placed its system alongside GLM 5.1, Kimi K2.6, and Qwen 3.5, all of which have emerged as major players in the global open-model ecosystem.

That comparison reflects a growing consensus inside the industry: the competition for frontier AI is now truly global.

Why it matters

The significance of Nemotron 3 Ultra extends beyond another model launch.

The benchmark categories highlighted during the announcement provide a glimpse into how NVIDIA sees the future of AI.

The next generation of AI systems may not be judged primarily by how well they answer questions.

Instead, they may be evaluated by how effectively they can:

Plan over long time horizons
Maintain context across large workflows
Coordinate multiple actions
Execute professional tasks
Operate as autonomous agents

If that vision proves correct, Nemotron 3 Ultra is less a chatbot competitor and more a foundation for digital workers.

For NVIDIA, the launch marks another step in a broader transformation from GPU manufacturer to full-stack AI platform.

And if the company's performance claims hold up under independent evaluation, Nemotron 3 Ultra could become one of the most important open models available to developers building the next generation of AI agents.

Why it matters

NVIDIA's bet: AI is moving beyond chat

A benchmark suite designed for agents

NVIDIA is climbing the AI stack

Entering a crowded battlefield

Why it matters

Reader comments