#23
Grok 4 Fast
grok-4-fast-non-reasoningHelpfulness
Instruction
Following
Comprehension
Empathy
Creative
Writing
Helpfulness
0.0
Instruction Following
0.0
Comprehension
0.0
Empathy
0.0
Creative Writing
0.0
Speed
Avg 93 tok/s
Release Date
September 19, 2025
Lab
xAI
Type
Proprietary
Context Size
2M
Max Output Tokens
8.2K
Cost per 1 million tokens
$0.20 / $0.50
Model Inputs*
Text, Images, Code, Web
Model Outputs*
Text
Tool Calling*
Enabled
Overall Assistant Score
An average score combining the 5 main categories.
83.54 pts
Rank #23
54th Percentile
0
Novice
33
Capable
66
Proficient
100
Expert
Grok 4 Fast Non-Reasoning is a lightweight variant of Grok 4 Fast optimized for low-latency everyday workloads where step-by-step reasoning traces are not required. It keeps the same large 2M-token context window and strong general capabilities but trims some of the deepest reasoning and comprehension capacity in exchange for very high responsiveness and low cost. This makes it well suited for classification, summarization, routing, and high-traffic chat experiences that still benefit from modern model quality without paying for full frontier-level thinking.
Intelligence
Overall Score; Higher is better
Speed
Output Tokens per Second; Higher is better
Price
USD per 1M Tokens; Lower is better
xai Models
Overall Score; Same provider comparison
Grok Family
Overall Score; Same model family comparison
Closest Rivals
Overall Score; Nearest by overall rank