Jurnal Informatika (Mar 2025)
Performance Analysis of Resampling Techniques for Overcoming Data Imbalance in Multiclass Classification
Abstract
In the digital era, the development of modern technology has brought significant transformation to the medical world. The main objective of this research is to identify the performance of deep learning models in classifying kidney disease. By integrating the Convolutional Neural Network model, the performance of the classification process can be analyzed effectively and efficiently. However, data imbalance dramatically affects the performance evaluation of a model, requiring data resampling techniques. This research applies two resampling techniques, bootstrap-based random oversampling and random undersampling, to training data and adds data augmentation to increase image variations to prevent model overfitting. The architecture uses MobileNetV2, which compares hyperparameter fine-tuning in three optimizers. This research shows that the performance of MobileNetV2, which implements the bootstrap-based random oversampling technique, has the highest accuracy compared to random undersampling and no resampling methods. The oversampling technique with the RMSprop optimizer produced the highest accuracy, namely 95%. With precision, recall, and F-1 score, respectively, 0.93, 0.95, 0.94. The accuracy of oversampling with the Adam and Nadam optimizer is 94%. So, the contribution of this research is by applying bootstrap-based oversampling techniques and adding data augmentation to produce good model performance to be used to classify medical images.
Keywords