Online adaptation of dialogue systems - Archive ouverte HAL Accéder directement au contenu
Rapport Année : 2011

Online adaptation of dialogue systems

Filip Jurcicek
  • Fonction : Auteur
Steve Young
  • Fonction : Auteur
  • PersonId : 873407
Ghislain Putois
  • Fonction : Auteur
  • PersonId : 865351
Romain Laroche
  • Fonction : Auteur

Résumé

This document is a report on online adaptation of dialogue systems (deliverable 1.5), due at month 36 of the CLASSIC project. It consists of four contributions. First, it demonstrates fast policy adaptation using the GP-SARSA algo- rithm applied to Hidden Information State (HIS) dialogue manager. Second, it describes online adapta- tion of dialogue model parameters using the NBC algorithm within the Belief Update of Dialogue State (BUDS) dialogue manager. Third, it proposes the Kalman Temporal Differences algorithm for manage- ment of uncertainty in estimate of the optimal value function. Finally, it details optimisation techniques for industrial spoken dialogue systems based on compliance-based reinforcement learning. Work related to this deliverable has been published in Gaˇsi'c et al. (2010), Jurˇc'ıˇcek et al. (2010b), Laroche et al. (2010b), and Geist and Pietquin (2010, 2011).
Fichier non déposé

Dates et versions

hal-00652841 , version 1 (16-12-2011)

Identifiants

  • HAL Id : hal-00652841 , version 1

Citer

Filip Jurcicek, Milica Gašić, Steve Young, Ghislain Putois, Romain Laroche, et al.. Online adaptation of dialogue systems. 2011. ⟨hal-00652841⟩
142 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More