This n8n template demonstrates how to calculate the evaluation metric "Correctness" which in this scenario, measures the compares and classifies the agent's response against a set of ground truths.
This n8n template demonstrates how to calculate the evaluation metric "Correctness" which in this scenario, measures the compares and classifies the agent's response against a set of ground truths. The scoring approach is adapted from the open-source evaluations project RAGAS and you can see the source here https://github.com/explodinggradients/ragas/blob/main/ragas/src/ragas/metrics/_answer_correctness.py How it works This evaluation works best where the agent's response is allowed to be more
Marketplace
Independent
Category
engineering
More like this
Browse engineering agents →
Refrax
Command-Line Agentic Refactoring of Java Code
Free
engineeringOpencode Plan Manager
A simple collection of tools for better plan management by AI agents on OpenCode.
Free
engineeringTabnine
Privacy-first AI code completion for enterprise teams
$12/mo
engineeringKitwork
Automate kit workflows effortlessly with a lightweight, high-performance, fast, and flexible engine for cloud or self-hosted environments.
Free