Wilhelmi F, Bellalta B, Cano C, Jonsson A. Implications of Decentralized Q-learning Resource Allocation in Wireless Networks. arXiv pre-print

List of results published directly linked with the projects co-funded by the Spanish Ministry of Economy and Competitiveness under the María de Maeztu Units of Excellence Program (MDM-2015-0502).

List of publications acknowledging the funding in Scopus.

The record for each publication will include access to postprints (following the Open Access policy of the program), as well as datasets and software used. Ongoing work with UPF Library and Informatics will improve the interface and automation of the retrieval of this information soon.

The MdM Strategic Research Program has its own community in Zenodo for material available in this repository as well as at the UPF e-repository

Back Wilhelmi F, Bellalta B, Cano C, Jonsson A. Implications of Decentralized Q-learning Resource Allocation in Wireless Networks. arXiv pre-print

Wilhelmi F, Bellalta B, Cano C, Jonsson A. Implications of Decentralized Q-learning Resource Allocation in Wireless Networks. arXiv pre-print.

Reinforcement Learning is gaining attention by the wireless networking community due to its potential to learn good-performing configurations only from the observed results. In this work we propose a stateless variation of Q-learning, which we apply to exploit spatial reuse in a wireless network. In particular, we allow networks to modify both their transmission power an dthe channel used solely based on the experienced throughput. We concentrate in a completely decentralized scenario in which no information about neighbouring nodes is available to the learners. Our results show that although the algorithm is able to find the best-performing actions to enhance aggregate throughput, there is high variability in the throughput experienced by the individual networks. We identify the cause of this variability as the adversarial setting of our setup, in which the most played actions provide intermittent good/poor performance depending on the neighbouring decisions. We also evaluate the effect of the intrinsic learning parameters of the algorithm on this variability

Additional material:

Code for simulation (GitHub, commit: eb4042a1830c8ea30b7eae3d72a51afe765a8d86)
Open access version at UPF repository and arXiv pre-print

Link: https://arxiv.org/abs/1705.10508

DTIC MdM Strategic Program: Artificial and Natural Intelligence for ICT and beyond

Wilhelmi F, Bellalta B, Cano C, Jonsson A. Implications of Decentralized Q-learning Resource Allocation in Wireless Networks. arXiv pre-print

Related Assets