DeepHermes 3 - Llama-3.1 8B Preview

Non-reasoning

Creator

Nous Research

Released

2025-02-13

Intelligence

2.3

Artificial Analysis Index

Coding

—

Artificial Analysis Index

In $/1M

$0.00

input tokens

Out $/1M

$0.00

output tokens

Blended $/1M

$0.00

3:1 blended

Speed

0

tokens / sec

Benchmark breakdown

Independent evaluation scores, normalized to 0–100.

GPQA Diamond

27

Humanity’s Last Exam

4

MMLU-Pro

37

SciCode

9

LiveCodeBench

9

MATH-500

22

AIME 2025

—

τ²-Bench (agentic)

—

Terminal-Bench Hard

—

IFBench

—

Via Artificial Analysis

44B

Welcome.

This panel sticks with you. Pick anything below and it opens right beside it, so you can dig through 60,000-plus records without ever losing your spot here.

Travel 44B

LibraryPapers, policy, standards, statute LabsEvery organization building AI ModelsIntelligence, price, and speed BenchmarksThe evaluation catalog IncidentsAI failures and harms SearchOne field across everything DashboardThe state of AI accountability in NY 44B RegistryThe Art. 44-B compliance portal