Skip to Main content Skip to Navigation
Journal articles

A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimisation

Lucie Daubigney 1, 2 Matthieu Geist 2 Senthilkumar Chandramohan 2 Olivier Pietquin 2
1 MAIA - Autonomous intelligent machine
Inria Nancy - Grand Est, LORIA - AIS - Department of Complex Systems, Artificial Intelligence & Robotics
Abstract : Reinforcement learning is now an acknowledged approach for optimising the interaction strategy of spoken dialogue systems. If the first considered algorithms were quite basic (like SARSA), recent works concentrated on more sophisticated methods. More attention has been paid to off-policy learning, dealing with the exploration-exploitation dilemma, sample efficiency or handling non-stationarity. New algorithms have been proposed to address these issues and have been applied to dialogue management. However, each algorithm often solves a single issue at a time, while dialogue systems exhibit all the problems at once. In this paper, we propose to apply the Kalman Temporal Differences (KTD) framework to the problem of dialogue strategy optimisation so as to address all these issues in a comprehensive manner with a single framework. Our claims are illustrated by experiments led on two real-world goal-oriented dialogue management frameworks, DIPPER and HIS.
Document type :
Journal articles
Complete list of metadata
Contributor : Sébastien van Luchene Connect in order to contact the contributor
Submitted on : Wednesday, January 9, 2013 - 10:19:58 AM
Last modification on : Thursday, January 20, 2022 - 5:27:08 PM



Lucie Daubigney, Matthieu Geist, Senthilkumar Chandramohan, Olivier Pietquin. A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimisation. IEEE Journal of Selected Topics in Signal Processing, IEEE, 2012, 6 (8), pp.891-902. ⟨10.1109/JSTSP.2012.2229257⟩. ⟨hal-00771646⟩



Les métriques sont temporairement indisponibles