A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimisation

Lucie Daubigney; Matthieu Geist; Senthilkumar Chandramohan; Olivier Pietquin

doi:10.1109/JSTSP.2012.2229257

Article Dans Une Revue IEEE Journal of Selected Topics in Signal Processing Année : 2012

A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimisation

(1, 2) , (2) , (2) , (2)

1
2

Lucie Daubigney

Fonction : Auteur
PersonId : 908990

Autonomous intelligent machine

IMS : Information, Multimodalité & Signal

Matthieu Geist

Fonction : Auteur
PersonId : 6945
IdHAL : matthieu-geist

IMS : Information, Multimodalité & Signal

Senthilkumar Chandramohan

Fonction : Auteur
PersonId : 888330

IMS : Information, Multimodalité & Signal

Olivier Pietquin

Fonction : Auteur
PersonId : 4024
IdHAL : olivier-pietquin
ORCID : 0000-0002-5386-465X
IdRef : 142821861

IMS : Information, Multimodalité & Signal

Résumé

Reinforcement learning is now an acknowledged approach for optimising the interaction strategy of spoken dialogue systems. If the first considered algorithms were quite basic (like SARSA), recent works concentrated on more sophisticated methods. More attention has been paid to off-policy learning, dealing with the exploration-exploitation dilemma, sample efficiency or handling non-stationarity. New algorithms have been proposed to address these issues and have been applied to dialogue management. However, each algorithm often solves a single issue at a time, while dialogue systems exhibit all the problems at once. In this paper, we propose to apply the Kalman Temporal Differences (KTD) framework to the problem of dialogue strategy optimisation so as to address all these issues in a comprehensive manner with a single framework. Our claims are illustrated by experiments led on two real-world goal-oriented dialogue management frameworks, DIPPER and HIS.

Domaines

Apprentissage [cs.LG]

Sébastien Van Luchene : Connectez-vous pour contacter le contributeur

https://centralesupelec.hal.science/hal-00771646

Soumis le : mercredi 9 janvier 2013-10:19:58

Dernière modification le : lundi 11 septembre 2023-17:41:18

Dates et versions

hal-00771646 , version 1 (09-01-2013)

Identifiants

HAL Id : hal-00771646 , version 1
DOI : 10.1109/JSTSP.2012.2229257

Citer

Lucie Daubigney, Matthieu Geist, Senthilkumar Chandramohan, Olivier Pietquin. A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimisation. IEEE Journal of Selected Topics in Signal Processing, 2012, 6 (8), pp.891-902. ⟨10.1109/JSTSP.2012.2229257⟩. ⟨hal-00771646⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

SUPELEC CNRS INRIA SUP_IMS CENTRALESUPELEC UNIV-LORRAINE INRIA2 LORIA LORIA-AIS

234 Consultations

0 Téléchargements

A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimisation

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager