Reinforcement Learning Real Experiments for Opportunistic Spectrum Access

Abstract : This paper proposes the analysis of experimental results obtained on the first worldwide implementation on real signals of reinforcement learning algorithms used for cognitive radio decision making in an opportunistic spectrum access (OSA) context. Two algorithms, able to act in highly unpredictable conditions, are compared: UCB (Upper Confidence Bound) and WD (Weight Driven). The OSA scenario is played in lab conditions around a couple of USRP N210 platforms. One platform is playing the role of the primary network and generates signals in a set of frequency bands with a pre-defined mean vacancy probability for each. An OFDM modulation scheme is used here, generated with GRC environment (GNU Radio Companion). Another platform runs Simulink in order to play the role of the secondary user (SU) cognitive engine that learns. The experimental results shown in this paper illustrate how the SU learns and predicts the channels' vacancy thanks to UCB and WD algorithms. They validate in real conditions machine learning algorithms capabilities for opportunistic spectrum access context, in terms of learning speed and convergence accuracy. They enable also to compare UCB and WD performance.
Type de document :
Communication dans un congrès
WSR'14, Mar 2014, Karlsruhe, Germany. 10 p., 2014
Liste complète des métadonnées

https://hal-supelec.archives-ouvertes.fr/hal-00994975
Contributeur : Myriam Andrieux <>
Soumis le : jeudi 22 mai 2014 - 14:24:26
Dernière modification le : mercredi 16 mai 2018 - 11:23:47

Identifiants

  • HAL Id : hal-00994975, version 1

Citation

Christophe Moy. Reinforcement Learning Real Experiments for Opportunistic Spectrum Access. WSR'14, Mar 2014, Karlsruhe, Germany. 10 p., 2014. 〈hal-00994975〉

Partager

Métriques

Consultations de la notice

880