Inverse reinforcement learning

tags: Reinforcement learning, Reinforcement learning with human feedback, Reward shaping

Recovering an unknown reward function from expert demonstrations such that the demonstrated behavior is optimal under it.

It is a classical setting extended in modern post-training via RLHF, adversarial IRL, and demonstration-conditioned implicit rewards.

Links to this note

Knowledge Base Index
Notes on: Reinforcement Learning via Self-Distillation by Hübotter, J., Lübeck, F., Behric, L., Baumann, A., Bagatella, M., Marta, D., Hakimi, I., Shenfeld, I., Kleine Buening, T., Guestrin, C. & Krause, A. (2026)
Notes on: Self-Distillation Enables Continual Learning by Idan Shenfeld, Mehul Damani, Jonas Hübotter, Pulkit Agrawal (2026)

Last changed 2026.04.26 | authored by Hugo Cisneros

Comments

Loading comments...

Back to Notes