Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/Multimodal AI/Vision Language Model/
Gato
Search

Gato

Creator
Creator
Seonglae Cho
Created
Created
2023 Jul 15 6:37
Editor
Editor
Seonglae Cho
Edited
Edited
2024 Oct 20 23:48
Refs
Refs

A Generalist Agent

 
 
 
 
 
 
A Generalist Agent
Inspired by progress in large-scale language modelling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment generalist policy. The same network with the same weights can play Atari, caption images, chat, stack blocks with a real robot arm and much more, deciding based on its context whether to output text, joint torques, button presses, or other tokens.
A Generalist Agent
https://www.deepmind.com/blog/a-generalist-agent
 
 

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/Multimodal AI/Vision Language Model/
Gato
Copyright Seonglae Cho