Abstract : This paper deals with the learning and decision making issue for cognitive radio (CR). Two reinforcement-learning algorithms proposed in the literature are compared for opportunistic spectrum access (OSA): Upper Confidence Bound (UCB) algorithm and Weight Driven (WD) algorithm. This paper also introduces two new metrics in order to evaluate the machine learning algorithm performance for CR: effective cumulative regret and percentage of successful trials. They provide a fair evaluation means for CR performance.
https://hal-supelec.archives-ouvertes.fr/hal-00994933
Contributor : Myriam Andrieux <>
Submitted on : Thursday, May 22, 2014 - 1:48:21 PM Last modification on : Tuesday, October 6, 2020 - 3:10:03 AM