Gato

tags: Transformers, Reinforcement learning
paper: (Reed et al. 2022)

Architecture

A standard decoder-only transformer is preceded by an embedding layer that embeds text and images with positional encoding and spatial information if available.

Parameter count

1.2B

Bibliography

Scott Reed, Konrad Zolna, Emilio Parisotto, Sergio Gomez Colmenarejo, Alexander Novikov, Gabriel Barth-Maron, Mai Gimenez, et al.. May 12, 2022. "A Generalist Agent". https://arxiv.org/abs/2205.06175v2.

Architecture

Parameter count

Bibliography

Links to this note

Comments

Leave a comment