Creator

Allen Institute for AI

Released

2025-11-20

Intelligence

8.1

Artificial Analysis Index

Coding

3.4

Artificial Analysis Index

In $/1M

$0.10

input tokens

Out $/1M

$0.20

output tokens

Blended $/1M

$0.13

3:1 blended

Speed

0

tokens / sec

Benchmark breakdown

Independent evaluation scores, normalized to 0–100.

GPQA Diamond
40
Humanity’s Last Exam
6
MMLU-Pro
52
SciCode
10
LiveCodeBench
27
MATH-500
AIME 2025
41
τ²-Bench (agentic)
13
Terminal-Bench Hard
0
IFBench
33
Via Artificial Analysis