BrowseComp: a benchmark for browsing agentsA simple and challenging benchmark that measures the ability of AI agents to locate hard-to-find information.https://openai.com/index/browsecomp/