#39
Llama 3.3 8B Instruct
llama-3.3-8b-instruct

Llama 3.3 8B is the lightweight, hyper-efficient entry in the Llama 3.3 series, designed for edge devices and low-latency applications. While it sacrifices some depth compared to its larger counterparts, it offers surprising reasoning capabilities and strict instruction following for its size class, making it an ideal choice for rapid, cost-effective text processing.

Performance Metrics

Helpfulness
Instruction Following
Comprehension
Empathy
Creative Writing
66.80
Helpfulness
68
Empathy
65
Instruction Following
72.5
Creative Writing
62
Comprehension
66.5
Speed
Avg 98 tok/s

Model Specifications

Release Date
December 6, 2024
Lab
Meta
Type
Open Source
Context Size
128K
Max Output Tokens
8.2K
Cost per 1M tokens
$0.05 / $0.10
Model Inputs
Text
Model Outputs
Text
Tool Calling
Enabled

Compare With Similar Models