Llama 3.3 8B Instruct by meta - AI Model Benchmark

#51

Llama 3.3 8B Instruct

llama-3.3-8b-instruct

Helpfulness

Instruction Following

Comprehension

Empathy

Creative Writing

Helpfulness

0.0

Instruction Following

0.0

Comprehension

0.0

Empathy

0.0

Creative Writing

0.0

Speed

Avg 98 tok/s

Release Date

December 6, 2024

Lab

Meta

Type

Open Source

Context Size

128K

Max Output Tokens

8.2K

Cost per 1 million tokens

$0.05 / $0.10

Model Inputs*

Text

Model Outputs*

Text

Tool Calling*

Enabled

Overall Assistant Score

An average score combining the 5 main categories.

65.67 pts

Rank #51

2th Percentile

0

Novice

33

Capable

66

Proficient

100

Expert

Llama 3.3 8B is the lightweight, hyper-efficient entry in the Llama 3.3 series, designed for edge devices and low-latency applications. While it sacrifices some depth compared to its larger counterparts, it offers surprising reasoning capabilities and strict instruction following for its size class, making it an ideal choice for rapid, cost-effective text processing.

Intelligence

Overall Score; Higher is better

GPT 4o

GLM 4.6

Llama 4 Sco…

Gemini 2.0 …

DeepSeek V3

Gemini 2.5 …

GLM 4.5 Air

Gemini 2.0 …

GLM 4.7 Fla…

Llama 3.3 8…

Speed

Output Tokens per Second; Higher is better

GPT-5 Nano

Llama 3.3 8…

Llama 4 Sco…

Gemini 2.5 …

o4-mini

DeepSeek V3…

Llama 4 Mav…

GPT-5 Mini

Gemini 2.0 …

Claude Haik…

Price

USD per 1M Tokens; Lower is better

GLM 4.5 Air

GPT-5 Nano

Llama 3.3 8…

DeepSeek V3…

DeepSeek V3…

DeepSeek V3…

GLM 4.7 Fla…

Gemini 2.0 …

Gemini 2.0 …

Gemini 2.5 …

meta Models

Overall Score; Same provider comparison

Llama 4 Mav…

Llama 3.3 7…

Llama 4 Sco…

Llama 3.3 8…

Llama Family

Overall Score; Same model family comparison

Llama 4 Mav…

Llama 3.3 7…

Llama 4 Sco…

Llama 3.3 8…

Closest Rivals

Overall Score; Nearest by overall rank

GPT 4o

GLM 4.6

Llama 4 Sco…

Gemini 2.0 …

DeepSeek V3

Gemini 2.5 …

GLM 4.5 Air

Gemini 2.0 …

GLM 4.7 Fla…

Llama 3.3 8…