An algorithmic Survey of Parametric Value Function Approximation

Matthieu Geist; Olivier Pietquin

doi:10.1109/TNNLS.2013.2247418

Article Dans Une Revue IEEE Transactions on Neural Networks and Learning Systems Année : 2013

An algorithmic Survey of Parametric Value Function Approximation

(1) , (1)

Matthieu Geist

Fonction : Auteur
PersonId : 6945
IdHAL : matthieu-geist

IMS : Information, Multimodalité & Signal

Olivier Pietquin

Fonction : Auteur
PersonId : 4024
IdHAL : olivier-pietquin
ORCID : 0000-0002-5386-465X
IdRef : 142821861

IMS : Information, Multimodalité & Signal

Résumé

Reinforcement learning is a machine learning answer to the optimal control problem. It consists in learning an optimal control policy through interactions with the system to be controlled, the quality of this policy being quantified by the so-called value function. A recurrent subtopic of reinforcement learning is to compute an approximation of this value function when the system is too large for an exact representation. This survey reviews state-of-the-art methods for (parametric) value function approximation by grouping them into three main categories: bootstrapping, residual and projected fixed-point approaches. Related algorithms are derived by considering one of the associated cost functions and a specific minimization method, generally a stochastic gradient descent or a recursive least-squares approach.

Mots clés

Reinforcement learning survey value function approximation

Domaines

Apprentissage [cs.LG]

Fichier principal

vfa_survey.pdf (540.07 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Sébastien Van Luchene : Connectez-vous pour contacter le contributeur

https://centralesupelec.hal.science/hal-00869725

Soumis le : lundi 6 novembre 2017-17:33:08

Dernière modification le : lundi 13 février 2023-08:47:47

Dates et versions

hal-00869725 , version 1 (06-11-2017)

Identifiants

HAL Id : hal-00869725 , version 1
DOI : 10.1109/TNNLS.2013.2247418

Citer

Matthieu Geist, Olivier Pietquin. An algorithmic Survey of Parametric Value Function Approximation. IEEE Transactions on Neural Networks and Learning Systems, 2013, 24 (6), pp.845-867. ⟨10.1109/TNNLS.2013.2247418⟩. ⟨hal-00869725⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

SUPELEC

116 Consultations

526 Téléchargements

An algorithmic Survey of Parametric Value Function Approximation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager