Optimizing Diabetic Neuropathy Severity Classification Using Electromyography Signals Through Synthetic Oversampling Techniques

Authors

  • I Ketut Adi Purnawan Departement of Electrical Engineering, Institut Teknologi Sepuluh Nopember
  • Adhi Dharma Wibawa Department of Medical Technology and Engineering, Institut Teknologi Sepuluh Nopember
  • Arik Kurniawati Department of Informatics, Universitas Trunojoyo Madura
  • Mauridhi Hery Purnomo Institut Teknologi Sepuluh Nopember

DOI:

https://doi.org/10.23887/janapati.v13i3.85675

Keywords:

electromyography, diabetes neuropathy, ROS, SMOTE, XGBoost

Abstract

Electromyography signals are electrical signals generated by muscle activity and are very useful for analyzing the health conditions of muscles and nerves. Data imbalance is a prevalent issue in EMG signal data, especially when addressing patients with varied health conditions and restricted data availability. A major difficulty for machine learning models is class imbalance in datasets, which frequently leads to biased predictions favoring the dominant class and neglecting the minority classes. The data augmentation method employs the Synthetic Minority Over Sampling Technique (SMOTE) and Random Over Sampling (ROS) to address data imbalances and enhance the performance of classification models for underrepresented classes. This study employs an oversampling technique to enhance the efficacy of the XG Boost model. SMOTE exhibits better efficacy relative to competing methods; the application of appropriate oversampling techniques allows models to integrate patterns from both majority and often neglected minority data.

References

R. Pop-Busui et al., “Diabetic Neuropathy: A Position Statement by the American Diabetes Association,” Diabetes Care, vol. 40, no. 1, pp. 136–154, Oct. 2017, doi: 10.2337/dc16-2042.

O. S. Purwanti, N. Nursalam, and M. G. R. Pandin, “Early detection of diabetic neuropathy based on health belief model: a scoping review,” Frontiers in Endocrinology, vol. 15, Oct. 2024, doi: 10.3389/fendo.2024.1369699.

J. Carmichael, H. Fadavi, F. Ishibashi, A. C. Shore, and M. Tavakoli, “Advances in Screening, Early Diagnosis and Accurate Staging of Diabetic Neuropathy,” Frontiers in Endocrinology, vol. 12, Oct. 2021, doi: 10.3389/fendo.2021.671257.

F. Shakeel, A. S. Sabhitha, and S. Sharma, “Exploratory review on class imbalance problem: An overview,” in 2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT), IEEE, Oct. 2017, pp. 1–8. doi: 10.1109/ICCCNT.2017.8204150.

N. Abdelhamid, A. Padmavathy, D. Peebles, F. Thabtah, and D. Goulder-Horobin, “Data Imbalance in Autism Pre-Diagnosis Classification Systems: An Experimental Study,” Journal of Information & Knowledge Management, vol. 19, no. 01, p. 2040014, Oct. 2020, doi: 10.1142/S0219649220400146.

T. S. Amelia, M. N. S. Hasibuan, and R. Pane, “Comparative analysis of resampling techniques on Machine Learning algorithm,” Sinkron, vol. 7, no. 2, pp. 628–634, Oct. 2022, doi: 10.33395/sinkron.v7i2.11427.

A. S. Ashraf and T. Ahmed, “MACHINE LEARNING SHREWD APPROACH FOR AN IMBALANCED DATASET CONVERSION SAMPLES,” Journal of Engineeringand Technology, vol. 11, no. 1, 2020.

N. U. Niaz, K. M. N. Shahariar, and M. J. A. Patwary, “Class Imbalance Problems in Machine Learning: A Review of Methods And Future Challenges,” in Proceedings of the 2nd International Conference on Computing Advancements, ACM, Oct. 2022, pp. 485–490. doi: 10.1145/3542954.3543024.

E. Sutoyo and M. A. Fadlurrahman, “Penerapan SMOTE untuk Mengatasi Imbalance Class dalam Klasifikasi Television Advertisement Performance Rating Menggunakan Artificial Neural Network,” Jurnal Edukasi dan Penelitian Informatika (JEPIN), vol. 6, no. 3, p. 379, Dec. 2020, doi: 10.26418/jp.v6i3.42896.

S. V. Narwane and S. D. Sawarkar, “Is handling unbalanced datasets for machine learning uplifts system performance?: A case of diabetic prediction,” Diabetes & Metabolic Syndrome: Clinical Research & Reviews, vol. 16, no. 9, p. 102609, Oct. 2022, doi: 10.1016/j.dsx.2022.102609.

M. A. Wiratama and W. M. Pradnya, “Optimasi Algoritma Data Mining Menggunakan Backward Elimination untuk Klasifikasi Penyakit Diabetes,” Jurnal Nasional Pendidikan Teknik Informatika (JANAPATI), vol. 11, no. 1, p. 1, Apr. 2022, doi: 10.23887/janapati.v11i1.45282.

A. Ramezankhani, O. Pournik, J. Shahrabi, F. Azizi, F. Hadaegh, and D. Khalili, “The Impact of Oversampling with SMOTE on the Performance of 3 Classifiers in Prediction of Type 2 Diabetes,” Medical Decision Making, vol. 36, no. 1, pp. 137–144, Jan. 2016, doi: 10.1177/0272989X14560647.

A. T. Akbar, R. Husaini, and H. Prapcoyo, “Preprocessing Using SMOTE and K-Means for Classification by Logistic Regression on Pima Indian Diabetes Dataset,” Telematika, vol. 20, no. 2, p. 238, Jun. 2023, doi: 10.31315/telematika.v20i2.9676.

H. Hairani, K. E. Saputro, and S. Fadli, “K-means-SMOTE for handling class imbalance in the classification of diabetes with C4.5, SVM, and naive Bayes,” Jurnal Teknologi dan Sistem Komputer, vol. 8, no. 2, pp. 89–93, Apr. 2020, doi: 10.14710/jtsiskom.8.2.2020.89-93.

T. Riston et al., “Oversampling Methods for Handling Imbalance Data in Binary Classification,” 2023, pp. 3–23. doi: 10.1007/978-3-031-37108-0_1.

F. Mohd, M. A. Jalil, N. M. M. Noora, S. Ismail, W. F. F. Yahya, and M. Mohamad, “Improving Accuracy of Imbalanced Clinical Data Classification Using Synthetic Minority Over-Sampling Technique,” 2019, pp. 99–110. doi: 10.1007/978-3-030-36365-9_8.

N. M. Nayan, A. Islam, M. U. Islam, E. Ahmed, M. M. Hossain, and M. Z. Alam, “SMOTE Oversampling and Near Miss Undersampling Based Diabetes Diagnosis from Imbalanced Dataset with XAI Visualization,” in 2023 IEEE Symposium on Computers and Communications (ISCC), IEEE, Oct. 2023, pp. 1–6. doi: 10.1109/ISCC58397.2023.10218281.

K. S. Gill, V. Anand, D. Upadhyay, and S. Dangi, “Diabetes Classification Using XG Boost Classification Techniques Through Machine Learning based SMOTE Analysis,” in 2024 3rd International Conference for Innovation in Technology (INOCON), IEEE, Oct. 2024, pp. 1–4. doi: 10.1109/INOCON60754.2024.10512046.

K. D. K. Wardhani and M. Akbar, “Diabetes Risk Prediction Using Extreme Gradient Boosting (XGBoost),” Jurnal Online Informatika, vol. 7, no. 2, pp. 244–250, Oct. 2022, doi: 10.15575/join.v7i2.970.

R. Ismail, “Muscle Power Signal Acquisition Monitoring Using Surface EMG,” J Biomed Res Environ Sci, vol. 3, no. 5, pp. 663–667, May 2022, doi: 10.37871/jbres1493.

M. Arslan, M. Guzel, M. Demirci, and S. Ozdemir, “SMOTE and Gaussian Noise Based Sensor Data Augmentation,” in 2019 4th International Conference on Computer Science and Engineering (UBMK), IEEE, Sep. 2019, pp. 1–5. doi: 10.1109/UBMK.2019.8907003.

M. W. Dwinanda, N. Satyahadewi, and W. Andani, “CLASSIFICATION OF STUDENT GRADUATION STATUS USING XGBOOST ALGORITHM,” BAREKENG: Jurnal Ilmu Matematika dan Terapan, vol. 17, no. 3, pp. 1785–1794, Sep. 2023, doi: 10.30598/barekengvol17iss3pp1785-1794.

Downloads

Published

2024-12-01

How to Cite

Purnawan, I. K. A., Wibawa, A. D., Kurniawati, A., & Purnomo, M. H. (2024). Optimizing Diabetic Neuropathy Severity Classification Using Electromyography Signals Through Synthetic Oversampling Techniques. Jurnal Nasional Pendidikan Teknik Informatika : JANAPATI, 13(3), 681–690. https://doi.org/10.23887/janapati.v13i3.85675

Issue

Section

Articles