#25
DeepSeek V3.1 Thinking
deepseek-reasoner
0.0
Helpfulness
Instruction Following
Comprehension
Empathy
Creative Writing
Helpfulness
0.0
Instruction Following
0.0
Comprehension
0.0
Empathy
0.0
Creative Writing
0.0
Speed
Avg 40 tok/s
Release Date
August 21, 2025
Lab
DeepSeek
Type
Open Source
Context Size
128K
Max Output Tokens
32.8K
Cost per 1 million tokens
$0.07 / $1.68
Model Inputs*
Text
Model Outputs*
Text
Tool Calling*
Enabled
Overall Assistant Score
An average score combining the 5 main categories.
82.72 pts
Rank #25
50th Percentile
0
Novice
33
Capable
66
Proficient
100
Expert
DeepSeek V3.1 Thinking activates the model's chain-of-thought capabilities, significantly boosting performance in math, coding, and complex reasoning tasks. While slower than the standard mode due to its 'thinking' process, it achieves markedly higher scores on logic-heavy benchmarks like AIME and GPQA. It sacrifices some speed and conversational fluidity for depth and analytical precision.

Intelligence

Overall Score; Higher is better

Gemini 3 Fl…
GPT-5 Mini
o4-mini
Grok 4 Fast
o1
DeepSeek V3…
DeepSeek V3…
o3-mini
Grok 3
Grok 4

Speed

Output Tokens per Second; Higher is better

Gemini 2.5 …
Claude Opus…
Gemini 3 Pro
o3
Claude Opus…
Claude Opus…
DeepSeek V3…
Kimi K2 Thi…
DeepSeek V3…
DeepSeek R1

Price

USD per 1M Tokens; Lower is better

GLM 4.5 Air
GPT-5 Nano
Llama 3.3 8…
DeepSeek V3…
DeepSeek V3…
DeepSeek V3…
GLM 4.7 Fla…
Gemini 2.0 …
Gemini 2.0 …
Gemini 2.5 …

deepseek Models

Overall Score; Same provider comparison

DeepSeek V3…
DeepSeek V3…
DeepSeek V3…
DeepSeek R1
DeepSeek V3…
DeepSeek V3

DeepSeek Family

Overall Score; Same model family comparison

DeepSeek V3…
DeepSeek V3…
DeepSeek V3…
DeepSeek R1
DeepSeek V3…
DeepSeek V3

Closest Rivals

Overall Score; Nearest by overall rank

Gemini 3 Fl…
GPT-5 Mini
o4-mini
Grok 4 Fast
o1
DeepSeek V3…
DeepSeek V3…
o3-mini
Grok 3
Grok 4