MLE Benchmle-benchopenai • Updated 2025 Apr 30 18:27MLE-bench: Evaluating Machine Learning Agents on Machine Learning EngineeringWe introduce MLE-bench, a benchmark for measuring how well AI agents perform at machine learning engineering.https://openai.com/index/mle-bench/