Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/AI Planning/
PlanBench
Search

PlanBench

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2025 Apr 16 13:33
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2025 Apr 16 13:33
Refs
Refs
 
 
 
 
 
 
 
PlanBench: An Extensible Benchmark for Evaluating Large Language...
Generating plans of action, and reasoning about change have long been considered a core competence of intelligent agents. It is thus no surprise that evaluating the planning and reasoning...
PlanBench: An Extensible Benchmark for Evaluating Large Language...
https://arxiv.org/abs/2206.10498
PlanBench: An Extensible Benchmark for Evaluating Large Language...
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/AI Planning/
PlanBench
Copyright Seonglae Cho