content·Independent✓ Verified

Benchmark LLM Performance on Legal Documents with Google Sheets and OpenRouter

This workflow demonstrates a simple way to run evals on a set of test cases stored in a Google Sheet.

About

This workflow demonstrates a simple way to run evals on a set of test cases stored in a Google Sheet. The example we are using comes from an info extraction task dataset, where we tested 6 different LLMs on 18 different test cases. This workflow extends the functionality of my simple eval for benchmarking legal tasks here. Rather than running executions sequentially (waiting for each one to respond before making another request), we use parallel processing to fire 2 requests every second. Yo

Tags

csv api langchain json n8n workflow automation

Pricing

Free

Visit website ↗

Marketplace

Independent

Benchmark LLM Performance on Legal Documents with Google Sheets and OpenRouter

About

Tags

More in content