Name: GLM 4.7 Flash
Author: Zhipu AI

#50

GLM 4.7 Flash

glm-4.7-flash

Helpfulness

Instruction Following

Comprehension

Empathy

Creative Writing

Helpfulness

0.0

Instruction Following

0.0

Comprehension

0.0

Empathy

0.0

Creative Writing

0.0

Speed

Avg 90.1 tok/s

Release Date

January 19th, 2026

Lab

Zhipu AI

Type

Open Source

Context Size

128K

Max Output Tokens

131.1K

Cost per 1 million tokens

$0.07 / $0.40

Model Inputs*

Text, Images

Model Outputs*

Text

Tool Calling*

Enabled

Overall Assistant Score

An average score combining the 5 main categories.

67.68 pts

Rank #50

4th Percentile

Novice

Capable

Proficient

100

Expert

GLM-4.7-Flash is a 30B-A3B MoE model optimized for speed and cost-efficiency. It achieves SWE-bench 59.2%, TAU-Bench 79.5%, and AIME 91.6%, making it ideal for coding and reasoning tasks. With 128K context and 131K output limits, it offers exceptional value at $0.07/$0.40 per million tokens, 10x cheaper than standard GLM-4.7. Best for developers prioritizing speed and cost over maximum capability.

Intelligence

Overall Score; Higher is better

GPT 4o

GLM 4.6

Llama 4 Sco…

Gemini 2.0 …

DeepSeek V3

Gemini 2.5 …

GLM 4.5 Air

Gemini 2.0 …

GLM 4.7 Fla…

Llama 3.3 8…

Speed

Output Tokens per Second; Higher is better

Claude Haik…

Grok 4 Fast

o3-mini

Gemini 2.5 …

DeepSeek V3…

GLM 4.7 Fla…

Gemini 2.0 …

Grok 4 Fast…

Llama 3.3 7…

Gemini 3.1 …

Price

USD per 1M Tokens; Lower is better

GPT-5 Nano

Llama 3.3 8…

DeepSeek V3…

GLM 4.7 Fla…

Gemini 2.0 …

Gemini 2.5 …

DeepSeek V3…

Zhipu AI Models

Overall Score; Same provider comparison

GLM 5

GLM 4.7

GLM 4.6

GLM 4.5 Air

GLM 4.7 Fla…

Zhipu AI Family

Overall Score; Same model family comparison

GLM 5

GLM 4.7

GLM 4.6

GLM 4.5 Air

GLM 4.7 Fla…

Closest Rivals

Overall Score; Nearest by overall rank

GPT 4o

GLM 4.6

Llama 4 Sco…

Gemini 2.0 …

DeepSeek V3

Gemini 2.5 …

GLM 4.5 Air

Gemini 2.0 …

GLM 4.7 Fla…

Llama 3.3 8…