Creator

Alibaba

Released

2025-03-05

Intelligence

19.7

Artificial Analysis Index

Coding

Artificial Analysis Index

In $/1M

$0.66

input tokens

Out $/1M

$1.00

output tokens

Blended $/1M

$0.74

3:1 blended

Speed

31

tokens / sec

Benchmark breakdown

Independent evaluation scores, normalized to 0–100.

GPQA Diamond
59
Humanity’s Last Exam
8
MMLU-Pro
76
SciCode
36
LiveCodeBench
63
MATH-500
96
AIME 2025
29
τ²-Bench (agentic)
Terminal-Bench Hard
IFBench
39
Via Artificial Analysis