AI Reasoning Datasets
Test only
- AIME (American Invitational Mathematics Examination)
Can AI do maths yet? Thoughts from a mathematician.
So the big news this week is that o3, OpenAI’s new language model, got 25% on FrontierMath. Let’s start by explaining what this means.
https://xenaproject.wordpress.com/2024/12/22/can-ai-do-maths-yet-thoughts-from-a-mathematician/

- IMO 2025 Gemini 2.5 Gold Medal


Seonglae Cho
