P. Abbeel and A. Y. Ng, Apprenticeship learning via inverse reinforcement learning, Twenty-first international conference on Machine learning , ICML '04, 2004.
DOI : 10.1145/1015330.1015430
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.2.92

P. Dimitri, J. N. Bertsekas, and . Tsitsiklis, Neuro-Dynamic Programming (Optimization and Neural Computation Series, 3), Athena Scientific, 1996.

A. Boularias, J. Kober, and J. Peters, Relative entropy inverse reinforcement learning, JMLR Workshop and Conference Proceedings, 2011.

J. Steven, A. G. Bradtke, and . Barto, Linear Least-Squares algorithms for temporal difference learning, Machine Learning, pp.33-57, 1996.

K. Dvijotham and E. Todorov, Inverse Optimal Control with Linearly- Solvable MDPs, Proceedings of the 27th International Conference on Machine Learning (ICML), 2010.

Y. Guermeur, VC thoery of large margin multi-category classifiers, Journal of Machine Learning Research, vol.8, pp.2551-2594, 2007.

E. Klein, M. Geist, and O. Pietquin, Batch, Off-Policy and Model-Free Apprenticeship Learning, Proceedings of the European Workshop on Reinforcement Learning (EWRL), 2011.
DOI : 10.1007/978-3-642-29946-9_28
URL : https://hal.archives-ouvertes.fr/hal-00660623

S. Francisco, M. Melo, and . Lopes, Learning from demonstration using MDP induced metrics, Proceedings of the European Conference on Machine Learning (ECML), 2010.

R. Munos, Performance Bounds in $L_p$???norm for Approximate Value Iteration, SIAM Journal on Control and Optimization, vol.46, issue.2, pp.541-561, 2007.
DOI : 10.1137/040614384

G. Neu and C. Szepesvari, Training parsers by inverse reinforcement learning, Machine Learning, pp.303-337, 2009.
DOI : 10.1007/s10994-009-5110-1
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.150.3712

Y. Andrew, S. Ng, and . Russell, Algorithms for Inverse Reinforcement Learning, Proceedings of 17th International Conference on Machine Learning (ICML), 2000.

L. Martin and . Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming, 1994.

N. Ratliff, A. D. Bagnell, and M. Zinkevich, Maximum margin planning, Proceedings of the 23rd international conference on Machine learning , ICML '06, 2006.
DOI : 10.1145/1143844.1143936

S. Russell, Learning agents for uncertain environments (extended abstract), Proceedings of the eleventh annual conference on Computational learning theory , COLT' 98, 1998.
DOI : 10.1145/279943.279964

S. Richard, A. G. Sutton, and . Barto, Reinforcement Learning: An Introduction, 1998.

U. Syed and R. Schapire, A game-theoretic approach to apprenticeship learning, Advances in Neural Information Processing Systems 20 (NIPS), 2008.

C. Szepesvári, Algorithms for Reinforcement Learning, Synthesis Lectures on Artificial Intelligence and Machine Learning, vol.4, issue.1, 2010.
DOI : 10.2200/S00268ED1V01Y201005AIM009

B. Taskar, V. Chatalbashev, D. Koller, and C. Guestrin, Learning structured prediction models, Proceedings of the 22nd international conference on Machine learning , ICML '05, 2005.
DOI : 10.1145/1102351.1102464