H. Robbins and S. Monro, A Stochastic Approximation Method, The Annals of Mathematical Statistics, vol.22, issue.3, pp.400-407, 1951.
DOI : 10.1214/aoms/1177729586

J. Kiefer and J. Wolfowitz, Stochastic Estimation of the Maximum of a Regression Function, The Annals of Mathematical Statistics, vol.23, issue.3, pp.462-466, 1952.
DOI : 10.1214/aoms/1177729392

H. Tembine, J. Y. Le-boudec, R. Elazouzi, and E. Altman, Mean field asymptotics of Markov Decision Evolutionary Games and teams, 2009 International Conference on Game Theory for Networks, 2009.
DOI : 10.1109/GAMENETS.2009.5137395

URL : https://hal.archives-ouvertes.fr/hal-01321123

H. Tembine, Dynamic Robust Games in MIMO Systems, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), vol.41, issue.4, pp.41-990, 2011.
DOI : 10.1109/TSMCB.2010.2102751

URL : https://hal.archives-ouvertes.fr/hal-00632034

H. Tembine, Distributed strategic learning for wireless engineers, Notes, vol.440, 2010.
DOI : 10.1201/b11896

URL : https://hal.archives-ouvertes.fr/hal-00752209