Creator
OpenAI
Released
2025-01-31
Intelligence
25.9
Artificial Analysis Index
Coding
17.9
Artificial Analysis Index
In $/1M
$1.10
input tokens
Out $/1M
$4.40
output tokens
Blended $/1M
$1.93
3:1 blended
Speed
190
tokens / sec
Benchmark breakdown
Independent evaluation scores, normalized to 0–100.
GPQA Diamond
75
Humanity’s Last Exam
9
MMLU-Pro
79
SciCode
40
LiveCodeBench
72
MATH-500
97
AIME 2025—
τ²-Bench (agentic)
29
Terminal-Bench Hard
7
IFBench—