#47
GLM 4.7 Flash
glm-4.7-flash
0.0
Helpfulness
Instruction Following
Comprehension
Empathy
Creative Writing
Helpfulness
0.0
Instruction Following
0.0
Comprehension
0.0
Empathy
0.0
Creative Writing
0.0
Speed
Avg 90.1 tok/s
Release Date
January 19th, 2026
Lab
Zhipu AI
Type
Open Source
Context Size
128K
Max Output Tokens
131.1K
Cost per 1 million tokens
$0.07 / $0.40
Model Inputs*
Text, Images
Model Outputs*
Text
Tool Calling*
Enabled
Overall Assistant Score
An average score combining the 5 main categories.
67.68 pts
Rank #47
4th Percentile
0
Novice
33
Capable
66
Proficient
100
Expert
GLM-4.7-Flash is a 30B-A3B MoE model optimized for speed and cost-efficiency. It achieves SWE-bench 59.2%, TAU-Bench 79.5%, and AIME 91.6%, making it ideal for coding and reasoning tasks. With 128K context and 131K output limits, it offers exceptional value at $0.07/$0.40 per million tokens, 10x cheaper than standard GLM-4.7. Best for developers prioritizing speed and cost over maximum capability.

Intelligence

Overall Score; Higher is better

GPT 4o
GLM 4.6
Llama 4 Sco…
Gemini 2.0 …
DeepSeek V3
Gemini 2.5 …
GLM 4.5 Air
Gemini 2.0 …
GLM 4.7 Fla…
Llama 3.3 8…

Speed

Output Tokens per Second; Higher is better

Claude Haik…
Grok 4 Fast
o3-mini
Gemini 2.5 …
DeepSeek V3…
GLM 4.7 Fla…
Gemini 2.0 …
Grok 4 Fast…
Llama 3.3 7…
Gemini 3.1 …

Price

USD per 1M Tokens; Lower is better

GPT-5 Nano
Llama 3.3 8…
DeepSeek V3…
DeepSeek V3…
DeepSeek V3…
GLM 4.7 Fla…
Gemini 2.0 …
Gemini 2.0 …
Gemini 2.5 …
DeepSeek V3…

Zhipu AI Models

Overall Score; Same provider comparison

GLM 5
GLM 4.7
GLM 4.6
GLM 4.5 Air
GLM 4.7 Fla…

Zhipu AI Family

Overall Score; Same model family comparison

GLM 5
GLM 4.7
GLM 4.6
GLM 4.5 Air
GLM 4.7 Fla…

Closest Rivals

Overall Score; Nearest by overall rank

GPT 4o
GLM 4.6
Llama 4 Sco…
Gemini 2.0 …
DeepSeek V3
Gemini 2.5 …
GLM 4.5 Air
Gemini 2.0 …
GLM 4.7 Fla…
Llama 3.3 8…