E. Levin, R. Pieraccini, and W. Eckert, Learning dialogue strategies within the Markov decision process framework, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings, 1997.
DOI : 10.1109/ASRU.1997.658989

S. Singh, M. Kearns, D. Litman, and M. Walker, Reinforcement learning for spoken dialogue systems, Proc. NIPS'99, 1999.

K. Scheffler and S. Young, Corpus-based dialogue simulation for automatic strategy learning and evaluation, Proc. NAACL Workshop on Adaptation in Dialogue Systems, 2001.

O. Pietquin and T. Dutoit, A probabilistic framework for dialog simulation and optimal strategy learning, IEEE Transactions on Audio, Speech and Language Processing, vol.14, issue.2, pp.589-599, 2006.
DOI : 10.1109/TSA.2005.855836

URL : https://hal.archives-ouvertes.fr/hal-00207952

M. Frampton and O. Lemon, Learning more effective dialogue strategies using limited dialogue move features, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL , ACL '06, 2006.
DOI : 10.3115/1220175.1220199

J. Henderson, O. Lemon, and K. Georgila, Hybrid Reinforcement/Supervised Learning for Dialogue Policies from COMMUNICATOR data, IJCAI workshop on Knowledge and Reasoning in Practical Dialogue Systems, 2005.

J. Williams, P. Poupart, and S. Young, Partially Observable Markov Decision Processes with Continuous Observations for Dialogue Management, Proceedings of the SigDial Workshop (SigDial'06), 2005.
DOI : 10.1007/978-1-4020-6821-8_8

S. Young, USING POMDPS FOR DIALOG MANAGEMENT, 2006 IEEE Spoken Language Technology Workshop, 2006.
DOI : 10.1109/SLT.2006.326785

J. Williams and S. Young, Partially observable Markov decision processes for spoken dialog systems, Computer Speech & Language, vol.21, issue.2, pp.231-422, 2007.
DOI : 10.1016/j.csl.2006.06.008

T. Paek and D. M. Chickering, The Markov Assumption in spoken dialogue management, Proceedings of the 6th SIGdial Workshop on Discourse and Dialogue, 2005.

S. Singh, D. Litman, M. Kearns, and M. Walker, Optimizing dialogue management with reinforcement learning: Experiments with the NJFun system, Journal of Artificial Intelligence Research, 2002.

O. Lemon, X. Liu, D. Shapiro, and C. Tollander, Hierarchical Reinforcement Learning of Dialogue Policies in a development environment for dialogue systems: REALL-DUDE, Proceedings of Brandial, the 10th SemDial Workshop on the Semantics and Pragmatics of Dialogue, 2006.

J. Williams and S. Young, Scaling up POMDPs for Dialog Management: The ``Summary POMDP'' Method, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005., 2005.
DOI : 10.1109/ASRU.2005.1566498

M. A. Walker, R. J. Passonneau, and J. E. Boland, Quantitative and qualitative evaluation of Darpa Communicator spoken dialogue systems, Proceedings of the 39th Annual Meeting on Association for Computational Linguistics , ACL '01, pp.515-522, 2001.
DOI : 10.3115/1073012.1073078

K. Georgila, O. Lemon, and J. Henderson, Automatic annotation of COMMUNICATOR dialogue data for learning dialogue strategies and user simulations, Ninth Workshop on the Semantics and Pragmatics of Dialogue (SEMDIAL: DIALOR), 2005.

K. Georgila, J. Henderson, and O. Lemon, User simulation for spoken dialogue systems: Learning and evaluation, Proceedings of Interspeech/ICSLP, 2006.

G. Andreani, G. Di-fabbrizio, M. Gilbert, D. Gillick, D. Hakkani-tür et al., LET'S DISCOH: COLLECTING AN ANNOTATED OPEN CORPUSWITH DIALOGUE ACTS AND REWARD SIGNALS FOR NATURAL LANGUAGE HELPDESKS, 2006 IEEE Spoken Language Technology Workshop, 2006.
DOI : 10.1109/SLT.2006.326794

E. Levin, R. Pieraccini, and W. Eckert, A stochastic model of human-machine interaction for learning dialog strategies, IEEE Transactions on Speech and Audio Processing, vol.8, issue.1, pp.11-23, 2000.
DOI : 10.1109/89.817450

R. López-cózar, A. De-la-torre, J. Segura, and A. Rubio, Assessment of dialogue systems by means of a new simulation technique, Speech Communication, vol.40, issue.3, pp.387-407, 2003.
DOI : 10.1016/S0167-6393(02)00126-7

O. Pietquin, A Probabilistic Description of Man-Machine Spoken Communication, 2005 IEEE International Conference on Multimedia and Expo, 2005.
DOI : 10.1109/ICME.2005.1521447

K. Georgila, J. Henderson, and O. Lemon, Learning User Simulations for Information State Update Dialogue Systems, Eurospeech, 2005.

O. Lemon, K. Georgila, and J. Henderson, EVALUATING EFFECTIVENESS AND PORTABILITY OF REINFORCEMENT LEARNED DIALOGUE STRATEGIES WITH REAL USERS: THE TALK TOWNINFO EVALUATION, 2006 IEEE Spoken Language Technology Workshop, 2006.
DOI : 10.1109/SLT.2006.326774

V. Rieser and O. Lemon, Cluster-based user simulations for learning dialogue strategies and the super evaluation metric, Proceedings of Interspeech/ICSLP, 2006.

J. Schatzmann, K. Weilhammer, M. Stuttle, and S. Young, A survey of statistical user simulation techniques for reinforcement-learning of dialogue management strategies, The Knowledge Engineering Review, vol.21, issue.02, pp.97-126, 2006.
DOI : 10.1017/S0269888906000944

K. Georgila, J. Henderson, and O. Lemon, User simulation for spoken dialogue systems: Learning and evaluation, Proc. Interspeech'06, 2006.

O. Lemon, K. Georgila, J. Henderson, M. Gabsdil, I. Meza-ruiz et al., D4.1: Integration of Learning and Adaptivity with the ISU approach Paradise: A framework for evaluating spoken dialogue agents, Proc. of the 35th Annual Meeting of the Association for Computational Linguistics, pp.271-280, 1997.

B. Zhang, Q. Cai, J. Mao, and B. Guo, Planning and acting under uncertainty: A new model for spoken dialogue system, Proc 17th Conf on Uncertainty in AI, 2001.

J. Williams, P. Poupart, and S. Young, Partially Observable Markov Decision Processes with Continuous Observations for Dialogue Management, Proceedings of the 6th SigDial Workshop on Discourse and Dialogue, 2005.
DOI : 10.1007/978-1-4020-6821-8_8

J. Williams, P. Poupart, and S. Young, Factored Partially Observable Markov Decision Processes for Dialogue Management, 4th Workshop on Knowledge and Reasoning in Practical Dialog Systems, International Joint Conference on Artificial Intelligence (IJCAI), 2005.

W. Eckert, E. Levin, and R. Pieraccini, User modeling for spoken dialogue system evaluation, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings, 1997.
DOI : 10.1109/ASRU.1997.658991

O. Pietquin and S. Renals, Asr system modeling for automatic evaluation and optimization of dialogue systems, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, 2002.

O. Pietquin, Consistent Goal-Directed User Model for Realisitc Man-Machine Task-Oriented Spoken Dialogue Simulation, 2006 IEEE International Conference on Multimedia and Expo, 2006.
DOI : 10.1109/ICME.2006.262563

O. Pietquin and T. Dutoit, Dynamic Bayesian Networks for NLU Simulation with Applications to Dialog Optimal Strategy Learning, 2006 IEEE International Conference on Acoustics Speed and Signal Processing Proceedings, 2006.
DOI : 10.1109/ICASSP.2006.1659954

H. Cuayáhuitl, S. Renals, O. Lemon, and H. Shimodaira, Human-computer dialogue simulation using hidden Markov models, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005., 2005.
DOI : 10.1109/ASRU.2005.1566485

J. Schatzmann, K. Georgila, and S. Young, Quantitative evaluation of user simulation techniques for spoken dialogue systems, Proc. SIGdial'05, 2005.

R. Jonson, DIALOGUE CONTEXT-BASED RE-RANKING OF ASR HYPOTHESES, 2006 IEEE Spoken Language Technology Workshop, 2006.
DOI : 10.1109/SLT.2006.326845

M. Gabsdil and O. Lemon, Combining acoustic and pragmatic features to predict recognition performance in spoken dialogue systems, Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics , ACL '04, pp.344-351, 2004.
DOI : 10.3115/1218955.1218999

A. Chotimongkol and A. I. Rudnicky, Nbest Speech Hypotheses Reordering Using Linear Regression, Proceedings of EuroSpeech 2001, pp.1829-1832, 2001.

M. Walker, O. Rambow, and M. Rogati, SPoT, Second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies 2001 , NAACL '01, 2001.
DOI : 10.3115/1073336.1073339

A. Stent, R. Prasad, and M. Walker, Trainable sentence planning for complex information presentation in spoken dialog systems, Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics , ACL '04, 2004.
DOI : 10.3115/1218955.1218966

Y. He and S. Young, Semantic processing using the Hidden Vector State model, Computer Speech & Language, vol.19, issue.1, pp.85-106, 2005.
DOI : 10.1016/j.csl.2004.03.001