Handover is an inherent part of mobile communication systems to maintain the connectivity of every User Equipment (UE). The rapid growth in the number of connected UEs and the trend toward densely deployed Base Stations (BSs) raise significant challenges for handover procedures. The OpenRAN framework, with its open architecture, offers a transformative opportunity to leverage a data-driven approach, e.g., Reinforcement Learning (RL), for connection management. However, state-of-the-art solutions that utilize RL are typically designed for direct evaluation in running networks, leading to potential performance degradation. In this paper, we propose the 2OFFRAN framework, which combines offline training and Off-Policy Evaluation. 2OFFRAN collects Key Performance Metrics and UE-to-BS allocation data from real networks that run well-established handover algorithms. Subsequently, it employs the collected dataset to train a Deep Q Learning algorithm for more efficient connection management and builds a RAN dynamics model based on a Deep Neural Network to evaluate the RL algorithm's performance before its deployment. Results show that 2OFFRAN outperforms traditional handover strategies, improving throughput, user fairness, and load balancing while enhancing the safety of deploying RL in the RAN.

2OffRAN: Offline Off-Policy Reinforcement Learning for Safe Handover in O-RAN / Navarro, Annalisa; Botta, Alessio; Canonico, Roberto; Wang, Yizhou; Fitzek, Frank H. P.; Nguyen, Giang T.. - (2025), pp. 1-6. ( 2nd IEEE International Conference on Machine Learning for Communication and Networking, ICMLCN 2025 esp 2025) [10.1109/icmlcn64995.2025.11140172].

2OffRAN: Offline Off-Policy Reinforcement Learning for Safe Handover in O-RAN

Navarro, Annalisa;Botta, Alessio;Canonico, Roberto;
2025

Abstract

Handover is an inherent part of mobile communication systems to maintain the connectivity of every User Equipment (UE). The rapid growth in the number of connected UEs and the trend toward densely deployed Base Stations (BSs) raise significant challenges for handover procedures. The OpenRAN framework, with its open architecture, offers a transformative opportunity to leverage a data-driven approach, e.g., Reinforcement Learning (RL), for connection management. However, state-of-the-art solutions that utilize RL are typically designed for direct evaluation in running networks, leading to potential performance degradation. In this paper, we propose the 2OFFRAN framework, which combines offline training and Off-Policy Evaluation. 2OFFRAN collects Key Performance Metrics and UE-to-BS allocation data from real networks that run well-established handover algorithms. Subsequently, it employs the collected dataset to train a Deep Q Learning algorithm for more efficient connection management and builds a RAN dynamics model based on a Deep Neural Network to evaluate the RL algorithm's performance before its deployment. Results show that 2OFFRAN outperforms traditional handover strategies, improving throughput, user fairness, and load balancing while enhancing the safety of deploying RL in the RAN.
2025
2OffRAN: Offline Off-Policy Reinforcement Learning for Safe Handover in O-RAN / Navarro, Annalisa; Botta, Alessio; Canonico, Roberto; Wang, Yizhou; Fitzek, Frank H. P.; Nguyen, Giang T.. - (2025), pp. 1-6. ( 2nd IEEE International Conference on Machine Learning for Communication and Networking, ICMLCN 2025 esp 2025) [10.1109/icmlcn64995.2025.11140172].
File in questo prodotto:
File Dimensione Formato  
2OffRAN_Offline_Off-Policy_Reinforcement_Learning_for_Safe_Handover_in_O-RAN.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 575.41 kB
Formato Adobe PDF
575.41 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/1048677
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact