J. Allen, Natural Language Understanding, 1987.

J. Carletta, Assessing Agreement on Classification Tasks: the Kappa Statistic, Computational Linguistics, vol.22, issue.2, pp.249-254, 1996.

H. Cuayáhuitl, S. Renals, O. Lemon, and H. Shimodaira, Hierarchical Dialogue Optimization Using Semi-Markov Decision Processes, Proceedings of International Conference on Speech Communication (Interspeech'07), Anvers (Belgium), 2007.

T. Dutoit, An Introduction to Text-To-Speech Synthesis, 1997.
DOI : 10.1007/978-94-011-5730-8

M. Frampton and O. Lemon, Learning more effective dialogue strategies using limited dialogue move features, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL , ACL '06, 2006.
DOI : 10.3115/1220175.1220199

K. Georgila, J. Henderson, and O. Lemon, Learning User Simulations for Information State Update Dialogue Systems, Proceedings of International Conference on Speech Communication (Interspeech'05), 2005.

K. Georgila, J. Henderson, and O. Lemon, User simulation for spoken dialogue systems: Learning and evaluation, Proceedings of International Conference on Speech Communication (Interspeech'06), 2006.

A. Graesser, K. Vanlehn, C. Rosé, P. Jordan, and D. Harter, Intelligent Tutoring Systems with Conversational Dialogue, pp.39-52, 2001.

J. Henderson, O. Lemon, and K. Georgila, Hybrid Reinforcement/Supervised Learning for Dialogue Policies from COMMUNICATOR data, Proceedings of the IJCAI workshop on Knowledge and Reasoning in Practical Dialogue Systems, pp.68-75, 2005.

O. Lemon and O. Pietquin, Machine learning for spoken dialogue systems, Proceedings of the European Conference on Speech Communication and Technologies (Interspeech'07), 2007.
URL : https://hal.archives-ouvertes.fr/hal-00216035

E. Levin, R. Pieraccini, and W. Eckert, Learning dialogue strategies within the Markov decision process framework, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings, 1997.
DOI : 10.1109/ASRU.1997.658989

E. Levin, R. Pieraccini, and W. Eckert, A stochastic model of human-machine interaction for learning dialog strategies, IEEE Transactions on Speech and Audio Processing, pp.11-23, 2000.
DOI : 10.1109/89.817450

R. Lopez-cozar, A. De-la-torre, J. Segura, and A. Rubio, Assesment of dialogue systems by means of a new simulation technique, Speech Communication, pp.387-407, 2003.

O. Pietquin and S. Renals, Asr system modelling for automatic evaluation and optimization of dialogue systems, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2002.

O. Pietquin and T. Dutoit, Aided design of finite-state dialogue management systems, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698), 2003.
DOI : 10.1109/ICME.2003.1221369

O. Pietquin, A Framework for Unsupervised Learning of Dialogue Strategies, pp.2-930344, 2004.

O. Pietquin, A Probabilistic Description of Man-Machine Spoken Communication, 2005 IEEE International Conference on Multimedia and Expo, 2005.
DOI : 10.1109/ICME.2005.1521447

O. Pietquin and R. Beaufort, Comparing ASR Modeling Methods for Spoken Dialogue Simulation and Optimal Strategy Learning, Proceedings of Interspeech, 2005.

O. Pietquin, Consistent goal-directed user model for realistic man-machine taskoriented spoken dialogue simulation, Proceedings of the IEEE International Conference on Multimedia and Expo (ICME'06), 2006.
URL : https://hal.archives-ouvertes.fr/hal-00215968

O. Pietquin, Machine learning for spoken dialogue management : an experiment with speech-based database querying, in Artificial Intelligence : Methodology, Systems and Applications, Lecture Notes in Artificial Intelligence, vol.4183, pp.172-180, 2006.

O. Pietquin and T. Dutoit, A probabilistic framework for dialog simulation and optimal strategy learning, IEEE Transactions on Audio, Speech and Language Processing, pp.589-599, 2006.
DOI : 10.1109/TSA.2005.855836

URL : https://hal.archives-ouvertes.fr/hal-00207952

O. Pietquin and T. Dutoit, Dynamic Bayesian Networks for NLU Simulation with Applications to Dialog Optimal Strategy Learning, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, 2006.
DOI : 10.1109/ICASSP.2006.1659954

P. Poupart, J. Williams, and S. Young, Partially observable Markov decision processes with continuous observations for dialogue management, Proceedings of the SigDial Workshop (SigDial'06), 2006.

L. Rabiner and B. H. Juang, Fundamentals of Speech Recognition, Signal Processing Series, 1993.

E. Reiter and R. Dale, Building Natural Language Generation Systems, 2000.
DOI : 10.1017/CBO9780511519857

V. Rieser and O. Lemon, Cluster-based user simulations for learning dialogue strategies and the super evaluation metric, Proceedings of Interspeech/ICSLP, 2006.

J. Schatzmann, K. Georgila, and S. Young, Quantitative evaluation of user simulation techniques for spoken dialogue systems, Proceedings of the SIGdial'05 Workshop, 2005.

J. Schatzmann, K. Weilhammer, M. Stuttle, and S. Young, A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies, The Knowledge Engineering Review, vol.21, issue.02, pp.97-126, 2007.
DOI : 10.1017/S0269888906000944

J. Schatzmann, B. Thomson, Y. , and S. , Statistical User Simulation with a Hidden Agenda, Proceedings of the 8 th SigDIAL Workshop, 2007.

K. Scheffler and S. Young, Corpus-based dialogue simulation for automatic strategy learning and evaluation, Proc. NAACL Workshop on Adaptation in Dialogue Systems, 2001.

S. Singh, M. Kearns, D. Litman, and M. Walker, Reinforcement learning for spoken dialogue systems, Proceedings of NIPS'99, 1999.

S. Young, USING POMDPS FOR DIALOG MANAGEMENT, 2006 IEEE Spoken Language Technology Workshop, 2006.
DOI : 10.1109/SLT.2006.326785

S. Young, J. Schatzmann, K. Weilhammer, and H. Ye, The Hidden Information State Approach to Dialog Management, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07, 2007.
DOI : 10.1109/ICASSP.2007.367185

M. Walker, D. Litman, C. Kamm, and A. Abella, PARADISE: A Framework for Evaluating Spoken Dialogue Agents, Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics, pp.271-280, 1997.

C. Watkins, Learning from delayed rewards, 1989.

J. Williams and S. Young, Scaling up POMDPs for dialogue management: the summary POMDP method, Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding (ASRU'05), 2005.

J. Williams, P. Poupart, and S. Young, Partially Observable Markov Decision Processes with Continuous Observations for Dialogue Management, Proceedings of the 6th SigDial Workshop, 2005.
DOI : 10.1007/978-1-4020-6821-8_8