#7
o3
o3-2025-04-16Helpfulness
Instruction
Following
Comprehension
Empathy
Creative
Writing
Helpfulness
0.0
Instruction Following
0.0
Comprehension
0.0
Empathy
0.0
Creative Writing
0.0
Speed
Avg 58 tok/s
Release Date
April 16, 2025
Lab
OpenAI
Type
Proprietary
Context Size
200K
Max Output Tokens
100K
Cost per 1 million tokens
$2.00 / $8.00
Model Inputs*
Text, Images, Code
Model Outputs*
Text
Tool Calling*
Enabled
Overall Assistant Score
An average score combining the 5 main categories.
89.16 pts
Rank #7
88th Percentile
0
Novice
33
Capable
66
Proficient
100
Expert
OpenAI's o3 is a large-scale reasoning model released in April 2025, built to excel at math, science, coding, and other complex analytical tasks. It reaches state-of-the-art results on benchmarks like AIME and GPQA Diamond while maintaining strong general helpfulness and reliable instruction following compared to GPT-4-class models. Although slower and more compute-intensive than fast chat models, o3 offers significantly deeper comprehension and more rigorous problem solving, making it ideal for high-stakes analysis, research workflows, and difficult multi-step reasoning problems.
Intelligence
Overall Score; Higher is better
Speed
Output Tokens per Second; Higher is better
Price
USD per 1M Tokens; Lower is better
openai Models
Overall Score; Same provider comparison
O-Series Family
Overall Score; Same model family comparison
Closest Rivals
Overall Score; Nearest by overall rank