Nvidia

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

nvidia/llama-3.3-nemotron-super-49b-v1.5

Add to Compare

Input / M

Output / M

Context

Max Output

16K

Usage

Unranked

Quality

Unscored

Providers

Best Uptime 30m

Unavailable

Released

Oct 10, 2025

Modalities

text

About

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

Capabilities

Vision

Tools

Reasoning

Streaming

JSON Mode

Providers · 1

Per-provider pricing, uptime

Provider	Input/M	Output/M	Context	Max Out	Quant	Uptime 30m
DeepInfraRecommended DeepInfra \| nvidia/llama-3.3-nemotron-super-49b-v1.5	$0.400	$0.400	131K	16K	fp8	—

Performance History

33 snapshots

Price History

0 recorded changes

No price changes recorded yet.

Activity Timeline

No events yet.

Versus

Nearest peers by price, context, and capabilities. One click to compare.

No comparable models found.

First seen 3d ago · Last verified 5m ago