Z Ai

Z.ai: GLM 4.7 Flash

z-ai/glm-4.7-flash

Score77

Add to Compare

Input / M

Output / M

Context

Max Output

16K

Usage

Unranked

Quality

70.5 / 100

Providers

Best Uptime 30m

99.80%

Released

Jan 19, 2026

Modalities

text

About

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

Capabilities

Vision

Tools

Reasoning

Streaming

JSON Mode

Providers · 4

Per-provider pricing, uptime

Provider	Input/M	Output/M	Context	Max Out	Quant	Uptime 30m
DeepInfraRecommended DeepInfra \| z-ai/glm-4.7-flash-20260119	$0.060	$0.400	203K	16K	bf16	99.80%
Venice Venice \| z-ai/glm-4.7-flash-20260119	$0.125	$0.500	128K	16K	fp8	95.58%
Cloudflare Cloudflare \| z-ai/glm-4.7-flash-20260119	$0.061	$0.400	131K	131K	unknown	57.17%
Novita Novita \| z-ai/glm-4.7-flash-20260119	$0.070	$0.400	200K	128K	bf16	56.32%

Benchmarks

Quality index 70.5 / 100

Design Arena

elo models uicomponent

1261.0

elo models website

1236.0

elo models codecategories

1227.0

elo models gamedev

1200.0

elo models 3d

1198.0

elo models dataviz

1167.0

elo models svg

1095.0

Performance History

53 snapshots

Price History

0 recorded changes

No price changes recorded yet.

Usage (OpenRouter)

Historical usage

Activity Timeline

No events yet.

Versus

Nearest peers by price, context, and capabilities. One click to compare.

No comparable models found.

First seen 3d ago · Last verified 4m ago