Creator

Alibaba

Released

2025-09-23

Intelligence

31.4

Artificial Analysis Index

Coding

26.4

Artificial Analysis Index

In $/1M

$1.66

input tokens

Out $/1M

$7.22

output tokens

Blended $/1M

$3.05

3:1 blended

Speed

53

tokens / sec

Benchmark breakdown

Independent evaluation scores, normalized to 0–100.

GPQA Diamond
76
Humanity’s Last Exam
11
MMLU-Pro
84
SciCode
38
LiveCodeBench
77
MATH-500
AIME 2025
81
τ²-Bench (agentic)
74
Terminal-Bench Hard
20
IFBench
44
Via Artificial Analysis