Comprehensive Analysis of Different Techniques for Data Augmentation and Proposal of New Variants of BOSME and GAN

    Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

    Resumen

    In many environments in which detection of minority class instances is critical, the available data intended for training Machine Learning models is poorly distributed. The data imbalance usually produces deterioration of the trained model by generalisation of instances belonging to minority class predicting as majority class instances. To avoid these, different techniques have been adopted in the literature and expand the original database such as Generative Adversarial Networks (GANs) or Bayesian network-based over-sampling method (BOSME). Starting from these two methods, in this work we propose three new variants of data augmentation to address data imbalance issue. We use traffic event data from three different areas of California divided in two subgroups attending their severity. Experiments show that top performance cases where reached after using our variants. The importance of data augmentation techniques as preprocessing tool has been proved as well, as a consequence of performance drop of systems in which original databases with imbalanced data where used.

    Idioma originalInglés
    Título de la publicación alojadaHybrid Artificial Intelligent Systems - 18th International Conference, HAIS 2023, Proceedings
    EditoresPablo García Bringas, Hilde Pérez García, Francisco Javier Martínez de Pisón, Francisco Martínez Álvarez, Alicia Troncoso Lora, Álvaro Herrero, José Luis Calvo Rolle, Héctor Quintián, Emilio Corchado
    EditorialSpringer Science and Business Media Deutschland GmbH
    Páginas145-155
    Número de páginas11
    ISBN (versión impresa)9783031407246
    DOI
    EstadoPublicada - 2023
    EventoProceedings of the 18th International Conference on Hybrid Artificial Intelligence Systems, HAIS 2023 - Salamanca, Espana
    Duración: 5 sept 20237 sept 2023

    Serie de la publicación

    NombreLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volumen14001 LNAI
    ISSN (versión impresa)0302-9743
    ISSN (versión digital)1611-3349

    Conferencia

    ConferenciaProceedings of the 18th International Conference on Hybrid Artificial Intelligence Systems, HAIS 2023
    País/TerritorioEspana
    CiudadSalamanca
    Período5/09/237/09/23

    Huella

    Profundice en los temas de investigación de 'Comprehensive Analysis of Different Techniques for Data Augmentation and Proposal of New Variants of BOSME and GAN'. En conjunto forman una huella única.

    Citar esto