Too Long; Didn't Read
Gato from DeepMind was just published! It is a single transformer that can play Atari games, caption images, chat with people, control a real robotic arm, and more! Indeed, it is trained once and uses the same weights to achieve all those tasks. Gato is a multi-modal agent meaning that it can create captions for images or answer questions as a chatbot. It understands words, images, and even physics... learn more in the video transcript below below.