MMLU

Created
Created
2023 Jun 2 12:29
Creator
Creator
Seonglae Cho
Editor
Edited
Edited
2024 Nov 30 10:42
Refs
Refs

Massive Multitask Language Understanding

undergraduate-level knowledge
  • Human expert metric
 
 
 
MMLU-Redux corrects errors in MMLU, revealing true LLM capabilities with 3,000 re-annotated questions and an error taxonomy.
 
 
 

Recommendations