Name: DeepSeek V3.1 Thinking
Author: deepseek

#28

DeepSeek V3.1 Thinking

deepseek-reasoner

Helpfulness

Instruction Following

Comprehension

Empathy

Creative Writing

Helpfulness

0.0

Instruction Following

0.0

Comprehension

0.0

Empathy

0.0

Creative Writing

0.0

Speed

Avg 40 tok/s

Release Date

August 21, 2025

Lab

DeepSeek

Type

Open Source

Context Size

128K

Max Output Tokens

32.8K

Cost per 1 million tokens

$0.07 / $1.68

Model Inputs*

Text

Model Outputs*

Text

Tool Calling*

Enabled

Overall Assistant Score

An average score combining the 5 main categories.

82.72 pts

Rank #28

47th Percentile

Novice

Capable

Proficient

100

Expert

DeepSeek V3.1 Thinking activates the model's chain-of-thought capabilities, significantly boosting performance in math, coding, and complex reasoning tasks. While slower than the standard mode due to its 'thinking' process, it achieves markedly higher scores on logic-heavy benchmarks like AIME and GPQA. It sacrifices some speed and conversational fluidity for depth and analytical precision.

Intelligence

Overall Score; Higher is better

Gemini 3 Fl…

GPT-5 Mini

o4-mini

Grok 4 Fast

DeepSeek V3…

o3-mini

Grok 3

Grok 4

Speed

Output Tokens per Second; Higher is better

Gemini 2.5 …

Claude Opus…

Gemini 3 Pro

Claude Opus…

DeepSeek V3…

Kimi K2 Thi…

DeepSeek V3…

DeepSeek R1

Price

USD per 1M Tokens; Lower is better

GLM 4.5 Air

GPT-5 Nano

Llama 3.3 8…

DeepSeek V3…

GLM 4.7 Fla…

Gemini 2.0 …

Gemini 2.5 …

deepseek Models

Overall Score; Same provider comparison

DeepSeek V3…

DeepSeek R1

DeepSeek V3…

DeepSeek V3

DeepSeek Family

Overall Score; Same model family comparison

DeepSeek V3…

DeepSeek R1

DeepSeek V3…

DeepSeek V3

Closest Rivals

Overall Score; Nearest by overall rank

Gemini 3 Fl…

GPT-5 Mini

o4-mini

Grok 4 Fast

DeepSeek V3…

o3-mini

Grok 3

Grok 4