Nvidia
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
nvidia/llama-3.3-nemotron-super-49b-v1.5
Input / M
Output / M
Context
Max Output
16K
Usage
Unranked
Quality
Unscored
Providers
1
Best Uptime 30m
Unavailable
Released
Oct 10, 2025
Modalities
text
About
Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...
Capabilities
Vision
Tools
Reasoning
Streaming
JSON Mode
Providers · 1
Per-provider pricing, uptime
| Provider | Input/M | Output/M | Context | Max Out | Quant | Uptime 30m |
|---|---|---|---|---|---|---|
| DeepInfraRecommended DeepInfra | nvidia/llama-3.3-nemotron-super-49b-v1.5 | $0.400 | $0.400 | 131K | 16K | fp8 | — |
Performance History
33 snapshots
Price History
0 recorded changes
No price changes recorded yet.
Activity Timeline
- No events yet.
Versus
Nearest peers by price, context, and capabilities. One click to compare.
No comparable models found.
First seen 3d ago · Last verified 5m ago
