Reinforcement learning with human feedback tags Reinforcement learning, NLP Links to this note ChatGPT Sparrow Last changed 13/02/2023 | authored by Hugo Cisneros