Experimental Performance Comparison and Analysis for Various MAB Problems under Cognitive Radio Framework

Navikkumar Modi; Christophe Moy; Philippe Mary

Communication Dans Un Congrès Année : 2014

Experimental Performance Comparison and Analysis for Various MAB Problems under Cognitive Radio Framework

(1) , (1) , (2)

1
2

Navikkumar Modi

Fonction : Auteur
PersonId : 7611
IdHAL : navikkumar-modi
IdRef : 221606734

Institut d'Électronique et des Technologies du numéRique

Christophe Moy

Fonction : Auteur
PersonId : 10216
IdHAL : christophe-moy
ORCID : 0000-0001-9639-1648
IdRef : 138021341

Institut d'Électronique et des Technologies du numéRique

Philippe Mary

Fonction : Auteur
PersonId : 933376

SCN

Résumé

This presentation gives a brief overview and experimental performance comparison of different types of the online sequential decision making Multiarmed bandit (MAB) problem for the cognitive radio opportunistic spectrum access. In this work, we consider online learning problem of classical, rested and restless MAB for single user/arm and furthermore, it will be extended for the multiple users/arms. A classical MAB problem assumes independent and identically distributed (i.i.d) rewards, while rested and restless formulation of the MAB assumes Markovian rewards. The fundamental objective of the MAB formulation is to maximize the total rewards obtained by playing the best optimal arm. The classical difficulty of the MAB is a fundamental trade-off between exploration and exploitation, which requires an efficient policy design to achieve optimum performance. The short introduction and performance analysis of the various policies (UCB1, UCB Tuned, KL-UCB, etc.) are done by analyzing regret, which is defined as a reward loss compare to optimal performance. For almost all the algorithms, a detailed theoretical analysis of the regret bound is available, while it's important to analyze the experimental performance of the different policies on various MAB formulations. The experimental performance of different MAB algorithms could be easily assessed case by case of specific problems, but it would be interesting to present a more convincing comparison of their actual experimental performance. The main objective of the presentation is to provide an extensive experimental analysis of existing MAB algorithms along different dimensions such as, expected regret, optimal arm selection, and computational complexity. Furthermore, some experimental measurements under dynamic spectrum access framework are carried out for the validation of the theoretical results.

Mots clés

Machine learning cognitive radio MAB UCB

Domaines

Electronique

Myriam Andrieux : Connectez-vous pour contacter le contributeur

https://centralesupelec.hal.science/hal-01093467

Soumis le : mercredi 10 décembre 2014-16:34:07

Dernière modification le : vendredi 24 mars 2023-14:52:59

Dates et versions

hal-01093467 , version 1 (10-12-2014)

Identifiants

HAL Id : hal-01093467 , version 1

Citer

Navikkumar Modi, Christophe Moy, Philippe Mary. Experimental Performance Comparison and Analysis for Various MAB Problems under Cognitive Radio Framework. WinnComm-Europe 2014, Nov 2014, Rome, Italy. ⟨hal-01093467⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

SUPELEC UNIV-NANTES UNIV-RENNES1 CNRS INSA-RENNES IETR SUP_IETR IETR-INSA IETR_SCEE CENTRALESUPELEC UR1-MATH-STIC UR1-UFR-ISTIC IETR-SYSCOM UNIV-RENNES INSA-GROUPE UR1-MATH-NUM IETR-ASIC IETR-SIGNAL NANTES-UNIVERSITE

145 Consultations

0 Téléchargements

Experimental Performance Comparison and Analysis for Various MAB Problems under Cognitive Radio Framework

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager