research·IndependentNew

Hermes Bench Tool Call

Hermes Agent benchmark v0.1 — evaluate local models on the actual tool-calling patterns Hermes users hit. Reproducible, trace-capturing, training-data ready.

About

Hermes Agent benchmark v0.1 — evaluate local models on the actual tool-calling patterns Hermes users hit. Reproducible, trace-capturing, training-data ready.

Advanced AI for research and writing

$20/mo

research

Elicit

AI research assistant for literature review

$12/mo

research

Consensus

AI search engine for scientific papers

$13/mo

research

Claude Opus 4

Anthropics most intelligent model for complex tasks

Usage-based pricing

Hermes Bench Tool Call

About

Tags

More in research