Gaussian Process Dynamic Programming, Neurocomput, vol.72, pp.7-9, 2009. ,
User modeling for spoken dialogue system evaluation, Proc. ASRU'97, 1997. ,
Reinforcement Learning with Gaussian Processes, Proceedings of the International Conference on Machine Learning (ICML 05), 2005. ,
Gaussian processes for fast policy optimisation of POMDP-based dialogue managers, Proceedings of the 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pp.201-204, 2010. ,
Kalman Temporal Differences, Journal of Artificial Intelligence Research, vol.39, pp.483-532, 2010. ,
Statistically Linearized Least-Squares Temporal Differences, Proceedings of the IEEE International Conference on Ultra Modern Control systems Moscow (Russia) : IEEE. 8 pages, 2010. ,
Managing Uncertainty within the KTD Framework, Proceedings of the Workshop on Active Learning and Experimental Design Journal of Machine Learning Research Conference and Workshop Proceedings, 2011. ,
Near-Bayesian Exploration in Polynomial Time, international conference on Machine learning (ICML 09), 2009. ,
A stochastic model of human-machine interaction for learning dialog strategies, IEEE Transactions on Speech and Audio Processing, vol.8, issue.1, pp.11-23, 2000. ,
A probabilistic framework for dialog simulation and optimal strategy learning, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.2, pp.589-599, 2006. ,
Sample-Efficient Batch Reinforcement Learning for Dialogue Management Optimization, ACM Transactions on Speech and Language Processing, 2011. ,
Gaussian Processes for Machine Learning, 2006. ,
Effects of the user model on simulation-based learning of dialogue strategies, Proceedings of ASRU'05, 2005. ,
A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies, The Knowledge Engineering Review, vol.21, issue.2, pp.97-126, 2006. ,
Reinforcement learning for spoken dialogue systems, Proc. NIPS'99, 1999. ,
An Analysis of Model-Based Interval Estimation for Markov Decision Processes, Journal of Computer and System Sciences, 2006. ,
Reinforcement Learning, 1998. ,
DOI : 10.1016/B978-012526430-3/50003-9