Online adaptation of dialogue systems

Abstract : This document is a report on online adaptation of dialogue systems (deliverable 1.5), due at month 36 of the CLASSIC project. It consists of four contributions. First, it demonstrates fast policy adaptation using the GP-SARSA algo- rithm applied to Hidden Information State (HIS) dialogue manager. Second, it describes online adapta- tion of dialogue model parameters using the NBC algorithm within the Belief Update of Dialogue State (BUDS) dialogue manager. Third, it proposes the Kalman Temporal Differences algorithm for manage- ment of uncertainty in estimate of the optimal value function. Finally, it details optimisation techniques for industrial spoken dialogue systems based on compliance-based reinforcement learning. Work related to this deliverable has been published in Gaˇsi'c et al. (2010), Jurˇc'ıˇcek et al. (2010b), Laroche et al. (2010b), and Geist and Pietquin (2010, 2011).
Document type :
Reports
Complete list of metadatas

https://hal-supelec.archives-ouvertes.fr/hal-00652841
Contributor : Sébastien van Luchene <>
Submitted on : Friday, December 16, 2011 - 2:17:47 PM
Last modification on : Wednesday, July 31, 2019 - 4:18:03 PM

Identifiers

  • HAL Id : hal-00652841, version 1

Collections

Citation

Filip Jurcicek, Milica Gašić, Steve Young, Ghislain Putois, Romain Laroche, et al.. Online adaptation of dialogue systems. 2011. ⟨hal-00652841⟩

Share

Metrics

Record views

207