Handling Imbalanced 5G and Beyond Network Tabular Data Using Conditional Generative Models

Jimena Andrade-Hoz, Jose M. Alcaraz-Calero, Qi Wang

Research output: Chapter in Book/Published conference outputConference publication

Abstract

Data-driven machine learning based approaches have been playing increasingly important roles in 5G and beyond (B5G) network management and optimisation. One of the major challenges is that the existence of class imbalance in networking datasets can greatly bias the classifier towards a majority classification. This discrepancy can pose a serious problem for AI-based models, which require large and diverse amounts of data to learn patterns and generate classifications. The generation of synthetic data is a process that has evolved over time from the application of statistical models to a more machine learning-centric approaches. In this research work we present the development, implementation, evaluation and comparison of four generative models for tabular data on a B5G network management dataset. These models have been previously optimised according to certain evaluation metrics of the generated synthetic data. The dataset presents a problem of imbalance between its classes, which is improved by using generative models to enrich it with synthetic data. The results show that machine learning based generative models obtain more accurate data than traditional statistical models, and are much faster in terms of conditional data sampling.
Original languageEnglish
Title of host publicationProceedings of the 20th International Wireless Communications & Mobile Computing Conference
Place of PublicationUnited States
PublisherIEEE
DOIs
Publication statusPublished - 17 Jul 2024
Event20th International Wireless Communications & Mobile Computing Conference - , Cyprus
Duration: 27 May 202431 May 2024
https://iwcmc.net/2024/index.php

Conference

Conference20th International Wireless Communications & Mobile Computing Conference
Abbreviated titleIWCMC 2024
Country/TerritoryCyprus
Period27/05/2431/05/24
Internet address

Keywords

  • generative model
  • synthetic tabular data
  • networking dataset
  • B5G network
  • network optimisation

Fingerprint

Dive into the research topics of 'Handling Imbalanced 5G and Beyond Network Tabular Data Using Conditional Generative Models'. Together they form a unique fingerprint.

Cite this