This model is very similar to Gopher, with some improvements to make the model smaller and more efficient.
- Jordan Hoffmann, Sebastian Borgeaud, Arthur Mensch, Elena Buchatskaya, Trevor Cai, Eliza Rutherford, Diego de Las Casas, et al.. . "Training Compute-optimal Large Language Models". arXiv. DOI.