Optimizing Diabetic Neuropathy Severity Classification Using Electromyography Signals Through Synthetic Oversampling Techniques
DOI:
https://doi.org/10.23887/janapati.v13i3.85675Keywords:
electromyography, diabetes neuropathy, ROS, SMOTE, XGBoostAbstract
Electromyography signals are electrical signals generated by muscle activity and are very useful for analyzing the health conditions of muscles and nerves. Data imbalance is a prevalent issue in EMG signal data, especially when addressing patients with varied health conditions and restricted data availability. A major difficulty for machine learning models is class imbalance in datasets, which frequently leads to biased predictions favoring the dominant class and neglecting the minority classes. The data augmentation method employs the Synthetic Minority Over Sampling Technique (SMOTE) and Random Over Sampling (ROS) to address data imbalances and enhance the performance of classification models for underrepresented classes. This study employs an oversampling technique to enhance the efficacy of the XG Boost model. SMOTE exhibits better efficacy relative to competing methods; the application of appropriate oversampling techniques allows models to integrate patterns from both majority and often neglected minority data.
References
R. Pop-Busui et al., “Diabetic Neuropathy: A Position Statement by the American Diabetes Association,” Diabetes Care, vol. 40, no. 1, pp. 136–154, Oct. 2017, doi: 10.2337/dc16-2042.
O. S. Purwanti, N. Nursalam, and M. G. R. Pandin, “Early detection of diabetic neuropathy based on health belief model: a scoping review,” Frontiers in Endocrinology, vol. 15, Oct. 2024, doi: 10.3389/fendo.2024.1369699.
J. Carmichael, H. Fadavi, F. Ishibashi, A. C. Shore, and M. Tavakoli, “Advances in Screening, Early Diagnosis and Accurate Staging of Diabetic Neuropathy,” Frontiers in Endocrinology, vol. 12, Oct. 2021, doi: 10.3389/fendo.2021.671257.
F. Shakeel, A. S. Sabhitha, and S. Sharma, “Exploratory review on class imbalance problem: An overview,” in 2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT), IEEE, Oct. 2017, pp. 1–8. doi: 10.1109/ICCCNT.2017.8204150.
N. Abdelhamid, A. Padmavathy, D. Peebles, F. Thabtah, and D. Goulder-Horobin, “Data Imbalance in Autism Pre-Diagnosis Classification Systems: An Experimental Study,” Journal of Information & Knowledge Management, vol. 19, no. 01, p. 2040014, Oct. 2020, doi: 10.1142/S0219649220400146.
T. S. Amelia, M. N. S. Hasibuan, and R. Pane, “Comparative analysis of resampling techniques on Machine Learning algorithm,” Sinkron, vol. 7, no. 2, pp. 628–634, Oct. 2022, doi: 10.33395/sinkron.v7i2.11427.
A. S. Ashraf and T. Ahmed, “MACHINE LEARNING SHREWD APPROACH FOR AN IMBALANCED DATASET CONVERSION SAMPLES,” Journal of Engineeringand Technology, vol. 11, no. 1, 2020.
N. U. Niaz, K. M. N. Shahariar, and M. J. A. Patwary, “Class Imbalance Problems in Machine Learning: A Review of Methods And Future Challenges,” in Proceedings of the 2nd International Conference on Computing Advancements, ACM, Oct. 2022, pp. 485–490. doi: 10.1145/3542954.3543024.
E. Sutoyo and M. A. Fadlurrahman, “Penerapan SMOTE untuk Mengatasi Imbalance Class dalam Klasifikasi Television Advertisement Performance Rating Menggunakan Artificial Neural Network,” Jurnal Edukasi dan Penelitian Informatika (JEPIN), vol. 6, no. 3, p. 379, Dec. 2020, doi: 10.26418/jp.v6i3.42896.
S. V. Narwane and S. D. Sawarkar, “Is handling unbalanced datasets for machine learning uplifts system performance?: A case of diabetic prediction,” Diabetes & Metabolic Syndrome: Clinical Research & Reviews, vol. 16, no. 9, p. 102609, Oct. 2022, doi: 10.1016/j.dsx.2022.102609.
M. A. Wiratama and W. M. Pradnya, “Optimasi Algoritma Data Mining Menggunakan Backward Elimination untuk Klasifikasi Penyakit Diabetes,” Jurnal Nasional Pendidikan Teknik Informatika (JANAPATI), vol. 11, no. 1, p. 1, Apr. 2022, doi: 10.23887/janapati.v11i1.45282.
A. Ramezankhani, O. Pournik, J. Shahrabi, F. Azizi, F. Hadaegh, and D. Khalili, “The Impact of Oversampling with SMOTE on the Performance of 3 Classifiers in Prediction of Type 2 Diabetes,” Medical Decision Making, vol. 36, no. 1, pp. 137–144, Jan. 2016, doi: 10.1177/0272989X14560647.
A. T. Akbar, R. Husaini, and H. Prapcoyo, “Preprocessing Using SMOTE and K-Means for Classification by Logistic Regression on Pima Indian Diabetes Dataset,” Telematika, vol. 20, no. 2, p. 238, Jun. 2023, doi: 10.31315/telematika.v20i2.9676.
H. Hairani, K. E. Saputro, and S. Fadli, “K-means-SMOTE for handling class imbalance in the classification of diabetes with C4.5, SVM, and naive Bayes,” Jurnal Teknologi dan Sistem Komputer, vol. 8, no. 2, pp. 89–93, Apr. 2020, doi: 10.14710/jtsiskom.8.2.2020.89-93.
T. Riston et al., “Oversampling Methods for Handling Imbalance Data in Binary Classification,” 2023, pp. 3–23. doi: 10.1007/978-3-031-37108-0_1.
F. Mohd, M. A. Jalil, N. M. M. Noora, S. Ismail, W. F. F. Yahya, and M. Mohamad, “Improving Accuracy of Imbalanced Clinical Data Classification Using Synthetic Minority Over-Sampling Technique,” 2019, pp. 99–110. doi: 10.1007/978-3-030-36365-9_8.
N. M. Nayan, A. Islam, M. U. Islam, E. Ahmed, M. M. Hossain, and M. Z. Alam, “SMOTE Oversampling and Near Miss Undersampling Based Diabetes Diagnosis from Imbalanced Dataset with XAI Visualization,” in 2023 IEEE Symposium on Computers and Communications (ISCC), IEEE, Oct. 2023, pp. 1–6. doi: 10.1109/ISCC58397.2023.10218281.
K. S. Gill, V. Anand, D. Upadhyay, and S. Dangi, “Diabetes Classification Using XG Boost Classification Techniques Through Machine Learning based SMOTE Analysis,” in 2024 3rd International Conference for Innovation in Technology (INOCON), IEEE, Oct. 2024, pp. 1–4. doi: 10.1109/INOCON60754.2024.10512046.
K. D. K. Wardhani and M. Akbar, “Diabetes Risk Prediction Using Extreme Gradient Boosting (XGBoost),” Jurnal Online Informatika, vol. 7, no. 2, pp. 244–250, Oct. 2022, doi: 10.15575/join.v7i2.970.
R. Ismail, “Muscle Power Signal Acquisition Monitoring Using Surface EMG,” J Biomed Res Environ Sci, vol. 3, no. 5, pp. 663–667, May 2022, doi: 10.37871/jbres1493.
M. Arslan, M. Guzel, M. Demirci, and S. Ozdemir, “SMOTE and Gaussian Noise Based Sensor Data Augmentation,” in 2019 4th International Conference on Computer Science and Engineering (UBMK), IEEE, Sep. 2019, pp. 1–5. doi: 10.1109/UBMK.2019.8907003.
M. W. Dwinanda, N. Satyahadewi, and W. Andani, “CLASSIFICATION OF STUDENT GRADUATION STATUS USING XGBOOST ALGORITHM,” BAREKENG: Jurnal Ilmu Matematika dan Terapan, vol. 17, no. 3, pp. 1785–1794, Sep. 2023, doi: 10.30598/barekengvol17iss3pp1785-1794.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 I Ketut Adi Purnawan, Adhi Dharma Wibawa, Arik Kurniawati, Mauridhi Hery Purnomo
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with Janapati agree to the following terms:- Authors retain copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC BY-SA 4.0) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work. (See The Effect of Open Access)