simple-evalsopenai • Updated 2025 Aug 11 20:57 basicv8vc/SimpleQA · Datasets at Hugging FaceWe’re on a journey to advance and democratize artificial intelligence through open source and open science.https://huggingface.co/datasets/basicv8vc/SimpleQAIntroducing SimpleQAA factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.https://openai.com/index/introducing-simpleqa/LeaderboardAI SimpleQA LeaderboardCompare AI model factuality with the SimpleQA leaderboard. Includes scores for AIME'25 (Math), Chatbot Arena, and ArenaHard benchmarks.https://blog.elijahlopez.ca/posts/ai-simpleqa-leaderboard/