#48
Llama 3.3 8B Instruct
llama-3.3-8b-instruct
0.0
Helpfulness
Instruction Following
Comprehension
Empathy
Creative Writing
Helpfulness
0.0
Instruction Following
0.0
Comprehension
0.0
Empathy
0.0
Creative Writing
0.0
Speed
Avg 98 tok/s
Release Date
December 6, 2024
Lab
Meta
Type
Open Source
Context Size
128K
Max Output Tokens
8.2K
Cost per 1 million tokens
$0.05 / $0.10
Model Inputs*
Text
Model Outputs*
Text
Tool Calling*
Enabled
Overall Assistant Score
An average score combining the 5 main categories.
65.67 pts
Rank #48
2th Percentile
0
Novice
33
Capable
66
Proficient
100
Expert
Llama 3.3 8B is the lightweight, hyper-efficient entry in the Llama 3.3 series, designed for edge devices and low-latency applications. While it sacrifices some depth compared to its larger counterparts, it offers surprising reasoning capabilities and strict instruction following for its size class, making it an ideal choice for rapid, cost-effective text processing.

Intelligence

Overall Score; Higher is better

GPT 4o
GLM 4.6
Llama 4 Sco…
Gemini 2.0 …
DeepSeek V3
Gemini 2.5 …
GLM 4.5 Air
Gemini 2.0 …
GLM 4.7 Fla…
Llama 3.3 8…

Speed

Output Tokens per Second; Higher is better

GPT-5 Nano
Llama 3.3 8…
Llama 4 Sco…
Gemini 2.5 …
o4-mini
DeepSeek V3…
Llama 4 Mav…
GPT-5 Mini
Gemini 2.0 …
Claude Haik…

Price

USD per 1M Tokens; Lower is better

GLM 4.5 Air
GPT-5 Nano
Llama 3.3 8…
DeepSeek V3…
DeepSeek V3…
DeepSeek V3…
GLM 4.7 Fla…
Gemini 2.0 …
Gemini 2.0 …
Gemini 2.5 …

meta Models

Overall Score; Same provider comparison

Llama 4 Mav…
Llama 3.3 7…
Llama 4 Sco…
Llama 3.3 8…

Llama Family

Overall Score; Same model family comparison

Llama 4 Mav…
Llama 3.3 7…
Llama 4 Sco…
Llama 3.3 8…

Closest Rivals

Overall Score; Nearest by overall rank

GPT 4o
GLM 4.6
Llama 4 Sco…
Gemini 2.0 …
DeepSeek V3
Gemini 2.5 …
GLM 4.5 Air
Gemini 2.0 …
GLM 4.7 Fla…
Llama 3.3 8…