Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/AI Code Generation/AI Coding Benchmark/
EditBench
Search

EditBench

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2026 Apr 24 18:50
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2026 Apr 24 18:55
Refs
Refs
 
 
 
 
editbench
waynchi • Updated 2026 Apr 23 8:13
arxiv.org
https://arxiv.org/pdf/2511.04486
EDIT-Bench: Evaluating LLM Abilities to Perform Real-World Instructed Code Edits
A benchmark for evaluating LLM code editing capabilities built on real-world edit contexts and instructions collected in-the-wild from 500 developers. EDIT stands for Evaluation of Developer Instructed Tasks.
EDIT-Bench: Evaluating LLM Abilities to Perform Real-World Instructed Code Edits
https://waynechi.com/edit-bench/
EDIT-Bench: Evaluating LLM Abilities to Perform Real-World Instructed Code Edits
 
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/AI Code Generation/AI Coding Benchmark/
EditBench
Copyright Seonglae Cho