Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/Multimodal AI/Multimodal Benchmark/
EnigmaEval
Search

EnigmaEval

Creator
Creator
Seonglae Cho
Created
Created
2025 Feb 19 11:57
Editor
Editor
Seonglae Cho
Edited
Edited
2025 Feb 19 11:57
Refs
Refs
puzzle
 
 
 
 
 
EnigmaEval: A Benchmark of Long Multimodal Reasoning Challenges
As language models master existing reasoning benchmarks, we need new challenges to evaluate their cognitive frontiers. Puzzle-solving events are rich repositories of challenging multimodal...
EnigmaEval: A Benchmark of Long Multimodal Reasoning Challenges
https://arxiv.org/abs/2502.08859
EnigmaEval: A Benchmark of Long Multimodal Reasoning Challenges
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/Multimodal AI/Multimodal Benchmark/
EnigmaEval
Copyright Seonglae Cho