Creator

Allen Institute for AI

Released

2025-12-12

Intelligence

13.9

Artificial Analysis Index

Coding

9.8

Artificial Analysis Index

In $/1M

$0.00

input tokens

Out $/1M

$0.00

output tokens

Blended $/1M

$0.00

3:1 blended

Speed

0

tokens / sec

Benchmark breakdown

Independent evaluation scores, normalized to 0–100.

GPQA Diamond
59
Humanity’s Last Exam
6
MMLU-Pro
76
SciCode
29
LiveCodeBench
70
MATH-500
AIME 2025
77
τ²-Bench (agentic)
0
Terminal-Bench Hard
0
IFBench
66
Via Artificial Analysis