A. P. Ng-a, Apprenticeship learning via inverse reinforcement learning, Proceedings of the 21st International Conference on Machine Learning (ICML), 2004.

A. T. Mckinnon-k and . Thomas-l, On the generation of markov decision processes, Journal of the Operational Research Society, 1995.

A. C. Schaal-s, Robot learning from demonstration, Proceedings of the 14th International Conference on Machine Learning (ICML), 1997.

B. A. and K. J. Peters-j, Relative entropy inverse reinforcement learning, JMLR Workshop and Conference Proceedings, 2011.

K. E. Geist-m and P. B. Pietquin-o, Inverse reinforcement learning through structured classification, Advances in Neural Information Processing Systems 25 (NIPS), 2012.

L. J. Zadrozny-b, Relating reinforcement learning performance to classification performance, Proceedings of the 22nd International Conference on Machine Learning (ICML), 2005.

S. N. and K. K. Ruszcaynski-a, Minimization methods for non-differentiable functions, 1985.

S. U. Schapire-r, A game-theoretic approach to apprenticeship learning, Advances in Neural Information Processing Systems 21 (NIPS), 2008.

S. U. Schapire-r, A reduction from apprenticeship learning to classification, Advances in Neural Information Processing Systems 23 (NIPS), 2010.

T. B. , C. V. , and K. D. Guestrin-c, Learning structured prediction models : A large margin approach, Proceedings of the 22nd International Conference on Machine Learning (ICML), 2005.