Creator
Alibaba
Released
2025-07-21
Intelligence
25.0
Artificial Analysis Index
Coding
22.1
Artificial Analysis Index
In $/1M
$0.20
input tokens
Out $/1M
$0.82
output tokens
Blended $/1M
$0.36
3:1 blended
Speed
47
tokens / sec
Benchmark breakdown
Independent evaluation scores, normalized to 0–100.
GPQA Diamond
75
Humanity’s Last Exam
11
MMLU-Pro
83
SciCode
36
LiveCodeBench
52
MATH-500
98
AIME 2025
72
τ²-Bench (agentic)
33
Terminal-Bench Hard
15
IFBench
46