Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/AI Agent/AI Agent Benchmark/
OSWorld
Search

OSWorld

Creator
Creator
Seonglae Cho
Created
Created
2024 May 8 17:22
Editor
Editor
Seonglae Cho
Edited
Edited
2025 Mar 16 18:8
Refs
Refs
Multimodal AI
OSWorld
xlang-ai • Updated 2024 May 8 15:55
App Agent
 
 
 
 
 
 
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
https://os-world.github.io/
Pocketmon Red
Claude's extended thinking
Discussing Claude's new thought process
Claude's extended thinking
https://www.anthropic.com/research/visible-extended-thinking
Claude's extended thinking
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/AI Agent/AI Agent Benchmark/
OSWorld
Copyright Seonglae Cho