AI Reasoning DatasetsFrontierMathOmni MathMath VistaOpenWebMathMATH datasetGSM8KMathQAProsQALogiQA Test onlyAIME (American Invitational Mathematics Examination)Can AI do maths yet? Thoughts from a mathematician.So the big news this week is that o3, OpenAI’s new language model, got 25% on FrontierMath. Let’s start by explaining what this means.https://xenaproject.wordpress.com/2024/12/22/can-ai-do-maths-yet-thoughts-from-a-mathematician/IMO 2025 Gemini 2.5 Gold Medalarxiv.orghttps://arxiv.org/pdf/2507.15855