- tags
- Transformers, Reinforcement learning
- paper
- (Reed et al. 2022)
Architecture
A standard decoder-only transformer is preceded by an embedding layer that embeds text and images with positional encoding and spatial information if available.
Parameter count
1.2B
Bibliography
- Scott Reed, Konrad Zolna, Emilio Parisotto, Sergio Gomez Colmenarejo, Alexander Novikov, Gabriel Barth-Maron, Mai Gimenez, et al.. . "A Generalist Agent". https://arxiv.org/abs/2205.06175v2.