Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/NLP/Language Model/LLM/LLM Term/AI Hallucination/Hallucination Benchmark/
OpenAI SimpleQA
Search

OpenAI SimpleQA

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2024 Dec 1 1:35
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2025 Aug 2 15:22
Refs
Refs
simple-evals
openai • Updated 2025 Aug 11 20:57
 
 
 
 
basicv8vc/SimpleQA · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
basicv8vc/SimpleQA · Datasets at Hugging Face
https://huggingface.co/datasets/basicv8vc/SimpleQA
basicv8vc/SimpleQA · Datasets at Hugging Face
Introducing SimpleQA
A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.
Introducing SimpleQA
https://openai.com/index/introducing-simpleqa/
Introducing SimpleQA

Leaderboard

AI SimpleQA Leaderboard
Compare AI model factuality with the SimpleQA leaderboard. Includes scores for AIME'25 (Math), Chatbot Arena, and ArenaHard benchmarks.
AI SimpleQA Leaderboard
https://blog.elijahlopez.ca/posts/ai-simpleqa-leaderboard/
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/NLP/Language Model/LLM/LLM Term/AI Hallucination/Hallucination Benchmark/
OpenAI SimpleQA
Copyright Seonglae Cho