Carrascosa M, Bellalta B. Decentralized AP selection using Multi-Armed Bandits: Opportunistic ε-Greedy with Stickiness. Symposium on Computers and Communications IEEE ISCC 2019

We develop a large number of software tools and hosting infrastructures to support the research developed at the Department. We will be detailing in this section the different tools available. You can take a look for the moment at the offer available within the UPF Knowledge Portal, the innovations created in the context of EU projects in the Innovation Radar and the software sections of some of our research groups:

Artificial Intelligence

Nonlinear Time Series Analysis

Downloads

Web Research

Dyswebxia

Music Technology

Interactive Technologies

Barcelona MedTech

GitHub

Natural Language Processing

GitHub
Resources (datasets, software and other material)

Nonlinear Time Series Analysis

Downloads

UbicaLab

GitHub

Wireless Networking

GitHub

Educational Technologies

GitHub

Back Carrascosa M, Bellalta B. Decentralized AP selection using Multi-Armed Bandits: Opportunistic ε-Greedy with Stickiness. Symposium on Computers and Communications IEEE ISCC 2019

Carrascosa M, Bellalta B. Decentralized AP selection using Multi-Armed Bandits: Opportunistic ε-Greedy with Stickiness. arXiv pre-print

WiFi densification leads to the existence of multiple overlapping coverage areas, which allows user stations (STAs)to choose between different Access Points (APs). The standard WiFi association method makes the STAs select the AP with the strongest signal, which in many cases leads to underutilization of some APs while overcrowding others. To mitigate this situation, Reinforcement Learning techniques such as Multi-Armed Bandits can be used to dynamically learn the optimal mapping between APs and STAs, and so redistribute the STAs among the available APs accordingly. This is an especially challenging problem since the network response observed by a given STA depends on the behavior of the others, and so it is very difficult to predict without a global view of the network. In this paper, we focus on solving this problem in a decentralized way, where STAs independently explore the different APs inside their coverage range, and select the one that better satisfy its needs. To do it, we propose a novel approach called Opportunistic ε-greedy with Stickiness that halts the exploration when a suitable AP is found, then, it remains associated to it while the STA is satisfied, only resuming the exploration after several unsatisfactory association periods. With this approach, we reduce significantly the network response variability, improving the ability of the STAs to find a solution faster, as well as achieving a more efficient use of the network resources.

Keywords: IEEE 802.11, WLANs, Reinforcement Learning, Multi-Armed Bandits

Additional material:

arXiv pre-print: https://arxiv.org/abs/1903.00281
Code: https://github.com/wn-upf/Decentralized-AP-selection-using-Multi-Armed-Bandits

Link: https://arxiv.org/abs/1903.00281

DTIC MdM Strategic Program: Artificial and Natural Intelligence for ICT and beyond

Carrascosa M, Bellalta B. Decentralized AP selection using Multi-Armed Bandits: Opportunistic ε-Greedy with Stickiness. Symposium on Computers and Communications IEEE ISCC 2019

Related Assets