Gato

Creator

Creator

Seonglae Cho

Created

Created

2023 Nov 4 8:33

Editor

Editor

Seonglae Cho

Edited

Edited

2025 Nov 7 10:44

Refs

Refs

https://arxiv.org/pdf/2205.06175

A Generalist Agent

Inspired by progress in large-scale language modelling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment generalist policy. The same network with the same weights can play Atari, caption images, chat, stack blocks with a real robot arm and much more, deciding based on its context whether to output text, joint torques, button presses, or other tokens.

A Generalist Agent

https://deepmind.google/discover/blog/a-generalist-agent/

A Generalist Agent

Recommendations

///////