AI Math Dataset

Creator

Creator

Seonglae Cho

Created

Created

2023 Oct 16 12:28

Editor

Editor

Seonglae Cho

Edited

Edited

2025 Aug 1 22:9

Refs

Refs

Dataset-based AI Benchmark

AI Coding Benchmark

https://epoch.ai/frontiermath/the-benchmark

AI Reasoning Datasets

Test only

AIME (American Invitational Mathematics Examination)

Can AI do maths yet? Thoughts from a mathematician.

So the big news this week is that o3, OpenAI’s new language model, got 25% on FrontierMath. Let’s start by explaining what this means.

Can AI do maths yet? Thoughts from a mathematician.

https://xenaproject.wordpress.com/2024/12/22/can-ai-do-maths-yet-thoughts-from-a-mathematician/

Can AI do maths yet? Thoughts from a mathematician.

IMO 2025 Gemini 2.5 Gold Medal

notion image

https://arxiv.org/pdf/2507.15855

Recommendations

///////