Bilal Piot, Matthieu Geist, Olivier Pietquin. Learning from demonstrations: Is it worth estimating a reward function?.
1st Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2013), Oct 2013, Princeton, New Jersey, United States.
⟨hal-00916938⟩