Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/AI Code Generation/AI Coding Benchmark/
MLE Bench
Search

MLE Bench

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 Mar 19 13:46
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2025 Mar 19 13:46
Refs
Refs
MLE Bench
mle-bench
openai • Updated 2025 Apr 30 18:27
MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
We introduce MLE-bench, a benchmark for measuring how well AI agents perform at machine learning engineering.
MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
https://openai.com/index/mle-bench/
MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
 
 
 
 
 
 
 
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/AI Code Generation/AI Coding Benchmark/
MLE Bench
Copyright Seonglae Cho