#8
o3
o3-2025-04-16

OpenAI's o3 is a large-scale reasoning model released in April 2025, built to excel at math, science, coding, and other complex analytical tasks. It reaches state-of-the-art results on benchmarks like AIME and GPQA Diamond while maintaining strong general helpfulness and reliable instruction following compared to GPT-4-class models. Although slower and more compute-intensive than fast chat models, o3 offers significantly deeper comprehension and more rigorous problem solving, making it ideal for high-stakes analysis, research workflows, and difficult multi-step reasoning problems.

Performance Metrics

Helpfulness
Instruction Following
Comprehension
Empathy
Creative Writing
88.84
Helpfulness
89.5
Empathy
90
Instruction Following
93.2
Creative Writing
77.5
Comprehension
94
Speed
Avg 58 tok/s

Model Specifications

Release Date
April 16, 2025
Lab
OpenAI
Type
Proprietary
Context Size
200K
Max Output Tokens
100K
Cost per 1M tokens
$2.00 / $8.00
Model Inputs
Text, Images, Code
Model Outputs
Text
Tool Calling
Enabled

Compare With Similar Models