AI Agent Benchmark for Real-World Professional Workflows
Agents' Last Exam evaluates AI agents on long-horizon professional workflows with verifiable outcomes across industries such as finance, robotics, bioinformatics, media, and more.
https://agents-last-exam.org/

Seonglae Cho