Reinforcement Learning Real Experiments for Opportunistic Spectrum Access

Abstract : This paper proposes the analysis of experimental results obtained on the first worldwide implementation on real signals of reinforcement learning algorithms used for cognitive radio decision making in an opportunistic spectrum access (OSA) context. Two algorithms, able to act in highly unpredictable conditions, are compared: UCB (Upper Confidence Bound) and WD (Weight Driven). The OSA scenario is played in lab conditions around a couple of USRP N210 platforms. One platform is playing the role of the primary network and generates signals in a set of frequency bands with a pre-defined mean vacancy probability for each. An OFDM modulation scheme is used here, generated with GRC environment (GNU Radio Companion). Another platform runs Simulink in order to play the role of the secondary user (SU) cognitive engine that learns. The experimental results shown in this paper illustrate how the SU learns and predicts the channels' vacancy thanks to UCB and WD algorithms. They validate in real conditions machine learning algorithms capabilities for opportunistic spectrum access context, in terms of learning speed and convergence accuracy. They enable also to compare UCB and WD performance.
Document type :
Conference papers
Complete list of metadatas

https://hal-supelec.archives-ouvertes.fr/hal-00994975
Contributor : Myriam Andrieux <>
Submitted on : Thursday, May 22, 2014 - 2:24:26 PM
Last modification on : Friday, November 16, 2018 - 1:30:06 AM

Identifiers

  • HAL Id : hal-00994975, version 1

Citation

Christophe Moy. Reinforcement Learning Real Experiments for Opportunistic Spectrum Access. WSR'14, Mar 2014, Karlsruhe, Germany. 10 p. ⟨hal-00994975⟩

Share

Metrics

Record views

907