research·Independent✓ Verified

ClawBench

Open-source benchmark for browser AI agents on 153 everyday online tasks across 144 live websites. 5-layer recording + DOM-match + LLM judge. Top score 33.3%.

About

Open-source benchmark for browser AI agents on 153 everyday online tasks across 144 live websites. 5-layer recording + DOM-match + LLM judge. Top score 33.3%.

Advanced AI for research and writing

$20/mo

research

Elicit

AI research assistant for literature review

$12/mo

research

Consensus

AI search engine for scientific papers

$13/mo

research

Claude Opus 4

Anthropics most intelligent model for complex tasks

Usage-based pricing

ClawBench

About

Tags

More in research