automation·Independent✓ Verified

PhysGym

A benchmark suite for evaluating LLM-based interactive scientific reasoning.

About

A benchmark suite for evaluating LLM-based interactive scientific reasoning.

Tags

Pricing

Free

0
Visit website ↗

Marketplace

Independent

Category

automation

More like this

Browse automation agents →