Awaiting next sync…
Back to explorer
Nvidia

NVIDIA: Nemotron 3 Ultra

nvidia/nemotron-3-ultra-550b-a55b

Score70
Compare
Input / M
Output / M
Context
Max Output
16K
Usage
Unranked
Quality
60.0 / 100
Providers
3
Best Uptime 30m
100.00%
Released
Jun 4, 2026
Modalities
text

About

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Capabilities

Vision
Tools
Reasoning
Streaming
JSON Mode

Providers · 3

Per-provider pricing, uptime
ProviderInput/MOutput/MContextMax OutQuantUptime 30m
NebiusRecommended
Nebius | nvidia/nemotron-3-ultra-550b-a55b-20260604
$1.00$3.008Kfp4100.00%
Together
Together | nvidia/nemotron-3-ultra-550b-a55b-20260604
$0.600$3.60512Kunknown98.98%
DeepInfra
DeepInfra | nvidia/nemotron-3-ultra-550b-a55b-20260604
$0.500$2.20262K16Kfp4

Benchmarks

Quality index 60.0 / 100
Artificial Analysis
intelligence index
37.8 / 100
coding index
49.3 / 100
agentic index
27.4 / 100
Design Arena
elo models 3d
1212.0
elo models gamedev
1195.0
elo models uicomponent
1179.0
elo models codecategories
1174.0
elo models dataviz
1164.0
elo models website
1140.0
elo models svg
1135.0
elo models asciiart
1105.0

Performance History

53 snapshots

Price History

0 recorded changes
No price changes recorded yet.

Usage (OpenRouter)

Historical usage

Activity Timeline

  • No events yet.

Versus

Nearest peers by price, context, and capabilities. One click to compare.
No comparable models found.
First seen 3d ago · Last verified 3m ago