D. M. Rasmussen-c and . Peters-j, Gaussian Process Dynamic Programming, Neurocomput, vol.72, pp.7-9, 2009.

E. W. and L. E. Pieraccini-r, User modeling for spoken dialogue system evaluation, Proc. ASRU'97, 1997.

E. Y. and M. S. Meir, Reinforcement Learning with Gaussian Processes, Proceedings of the International Conference on Machine Learning (ICML 05), 2005.

G. Ga?i´c, M. Ju?-cí?cí?, C. F. , K. S. Mairesse-f, Y. K. Thomson-b et al., Gaussian processes for fast policy optimisation of POMDP-based dialogue managers, Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp.201-204, 2010.

G. M. Pietquin-o, Kalman Temporal Differences, Journal of Artificial Intelligence Research, vol.39, pp.483-532, 2010.

G. M. Pietquin-o, Statistically Linearized Least-Squares Temporal Differences, Proceedings of the IEEE International Conference on Ultra Modern Control systems Moscow (Russia) : IEEE. 8 pages, 2010.

G. M. Pietquin-o, Managing Uncertainty within the KTD Framework, Proceedings of the Workshop on Active Learning and Experimental Design Journal of Machine Learning Research Conference and Workshop Proceedings, 2011.

K. J. Ng-a, Near-Bayesian Exploration in Polynomial Time, international conference on Machine learning (ICML 09), 2009.

L. E. and P. R. Eckert-w, A stochastic model of human-machine interaction for learning dialog strategies, IEEE Transactions on Speech and Audio Processing, vol.8, issue.1, pp.11-23, 2000.

P. O. Dutoit-t, A probabilistic framework for dialog simulation and optimal strategy learning, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.2, pp.589-599, 2006.

P. O. Geist-m and C. S. Frezza-buet, Sample-Efficient Batch Reinforcement Learning for Dialogue Management Optimization, ACM Transactions on Speech and Language Processing, 2011.

R. C. Williams-c, Gaussian Processes for Machine Learning, 2006.

S. J. Stuttle-m and W. K. Young, Effects of the user model on simulation-based learning of dialogue strategies, Proceedings of ASRU'05, 2005.

S. J. Weilhammer-k and S. M. Young, A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies, The Knowledge Engineering Review, vol.21, issue.2, pp.97-126, 2006.

S. S. Kearns-m and L. D. Walker-m, Reinforcement learning for spoken dialogue systems, Proc. NIPS'99, 1999.

S. A. Littman-m, An Analysis of Model-Based Interval Estimation for Markov Decision Processes, Journal of Computer and System Sciences, 2006.

S. R. Barto-a, Reinforcement Learning, 1998.
DOI : 10.1016/B978-012526430-3/50003-9