M. Geist, O. Pietquin, and G. Fricout, A Sparse Nonlinear Bayesian Online Kernel Regression, 2008 The Second International Conference on Advanced Engineering Computing and Applications in Sciences, pp.199-204, 2008.
DOI : 10.1109/ADVCOMP.2008.7

URL : https://hal.archives-ouvertes.fr/hal-00327081

C. M. Bishop, Neural Networks for Pattern Recognition, 1995.

B. Scholkopf and A. J. Smola, Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond, 2001.

V. N. Vapnik, Statisical Learning Theory, 1998.

L. Feldkamp and G. Puskorius, A signal processing framework based on dynamic neural networks with application to problems in adaptation, filtering, and classification, Proceedings of the IEEE, pp.2259-2277, 1998.
DOI : 10.1109/5.726790

R. Van-der-merwe, Sigma-Point Kalman Filters for Probabilistic Inference in Dynamic State-Space Models, 2004.

C. M. Bishop and M. E. Tipping, Bayesian Regression and Classification, Advances in Learning Theory: Methods, Models and Applications NATO Science Series III: Computer and Systems Sciences, pp.267-285, 2003.

J. Vermaak, S. J. Godsill, and A. Doucet, Sequential Bayesian Kernel Regression, Advances in Neural Information Processing Systems 16, 2003.

Z. Chen, Bayesian Filtering : From Kalman Filters to Particle Filters, and Beyond, 2003.

Y. Engel, S. Mannor, and R. Meir, The Kernel Recursive Least-Squares Algorithm, IEEE Transactions on Signal Processing, vol.52, issue.8, pp.2275-2285, 1998.
DOI : 10.1109/TSP.2004.830985

M. Geist, O. Pietquin, and G. Fricout, Online Bayesian kernel regression from nonlinear mapping of observations, 2008 IEEE Workshop on Machine Learning for Signal Processing, pp.309-314, 2008.
DOI : 10.1109/MLSP.2008.4685498

URL : https://hal.archives-ouvertes.fr/hal-00335052

R. E. Kalman, A New Approach to Linear Filtering and Prediction Problems, Journal of Basic Engineering, vol.82, issue.1, pp.35-45, 1960.
DOI : 10.1115/1.3662552

S. J. Julier and J. K. Uhlmann, Unscented Filtering and Nonlinear Estimation, Proceedings of the IEEE, pp.401-422, 2004.
DOI : 10.1109/JPROC.2003.823141

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.136.6539

D. Simon, Optimal State Estimation: Kalman, H Infinity, and Nonlinear Approaches, 2006.
DOI : 10.1002/0470045345

A. L. Strehl, L. Li, and M. L. Littman, Incremental model-based learners with formal learning-time guarantees, 22nd Conference on Uncertainty in Artificial Intelligence, pp.485-493, 2006.

I. Szita and A. L?, Kalman Filter Control Embedded into the Reinforcement Learning Framework, Neural Computation, vol.19, issue.5, pp.491-499, 2004.
DOI : 10.1038/nn963

Y. Engel, Algorithms and Representations for Reinforcement Learning, 2005.

C. W. Phua and R. Fitch, Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation, Proceedings of the 24th international conference on Machine learning, ICML '07, 2007.
DOI : 10.1145/1273496.1273591

D. P. Bertsekas, Dynamic Programming and Optimal Control, Athena Scientific, 1995.

M. A. Carreira-perpinan, Mode-finding for mixtures of Gaussian distributions, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.22, issue.11, pp.1318-1323, 2000.
DOI : 10.1109/34.888716

D. Schneegass, S. Udluft, and T. Martinetz, Kernel Rewards Regression: an Information Efficient Batch Policy Iteration Approach, AIA'06: Proceedings of the 24th IASTED international conference on Artificial intelligence and applications, pp.428-433, 2006.

R. Dearden, N. Friedman, and S. J. Russell, Bayesian Q-learning, Fifteenth National Conference on Artificial Intelligence, pp.761-768, 1998.

A. L. Strehl, L. Li, E. Wiewiora, J. Langford, and M. L. Littman, PAC model-free reinforcement learning, Proceedings of the 23rd international conference on Machine learning , ICML '06, pp.881-888, 2006.
DOI : 10.1145/1143844.1143955

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.120.326