Z Ai
Z.ai: GLM 4.7 Flash
z-ai/glm-4.7-flash
Score77
Add to CompareInput / M
Output / M
Context
Max Output
16K
Usage
Unranked
Quality
70.5 / 100
Providers
4
Best Uptime 30m
99.80%
Released
Jan 19, 2026
Modalities
text
About
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...
Capabilities
Vision
Tools
Reasoning
Streaming
JSON Mode
Providers · 4
Per-provider pricing, uptime
| Provider | Input/M | Output/M | Context | Max Out | Quant | Uptime 30m |
|---|---|---|---|---|---|---|
| DeepInfraRecommended DeepInfra | z-ai/glm-4.7-flash-20260119 | $0.060 | $0.400 | 203K | 16K | bf16 | 99.80% |
| Venice Venice | z-ai/glm-4.7-flash-20260119 | $0.125 | $0.500 | 128K | 16K | fp8 | 95.58% |
| Cloudflare Cloudflare | z-ai/glm-4.7-flash-20260119 | $0.061 | $0.400 | 131K | 131K | unknown | 57.17% |
| Novita Novita | z-ai/glm-4.7-flash-20260119 | $0.070 | $0.400 | 200K | 128K | bf16 | 56.32% |
Benchmarks
Quality index 70.5 / 100
Design Arena
elo models uicomponent
1261.0
elo models website
1236.0
elo models codecategories
1227.0
elo models gamedev
1200.0
elo models 3d
1198.0
elo models dataviz
1167.0
elo models svg
1095.0
Performance History
53 snapshots
Price History
0 recorded changes
No price changes recorded yet.
Usage (OpenRouter)
Historical usage
Activity Timeline
- No events yet.
Versus
Nearest peers by price, context, and capabilities. One click to compare.
No comparable models found.
First seen 3d ago · Last verified 4m ago
