GPTInstruct

tags: Transformers, GPT, NLP
paper: (Ouyang et al. 2022)

Architecture

This model starts off from a pretrained GPT-3. Reward modeling is added with Reinforcement learning.

Parameter count

175B

Bibliography

Long Ouyang, Jeff Wu, Xu Jiang, Diogo Almeida, Carroll L. Wainwright, Pamela Mishkin, Chong Zhang, et al.. March 4, 2022. "Training Language Models to Follow Instructions with Human Feedback". arXiv. DOI.

Links to this note

ChatGPT

Last changed 26/07/2022 | authored by Hugo Cisneros

Comments

← Back to Notes