Overconfidence is a problem
WorldVQA: Measuring Atomic World Knowledge in MLLMs
WorldVQA is a benchmark designed to evaluate atomic vision-centric world knowledge in Multimodal Large Language Models (MLLMs).
https://www.kimi.com/blog/worldvqa.html
Seonglae Cho
Seonglae Cho