Application of Naïve Bayes Algorithm for Diabetes Prediction

  • Laili Najla Salsabila UIN Syarif Hidayatullah Jakarta
  • Muhamad Riyo Dwi Pangga UIN Syarif Hidayatullah Jakarta
  • Syahrul Mauhub Yasser UIN Syarif Hidayatullah Jakarta
  • Nabila Arin Riyani UIN Syarif Hidayatullah Jakarta
  • Siti Aminah UIN Syarif Hidayatullah Jakarta
  • Wahyunengsih Wahyunengsih UIN Syarif Hidayatullah Jakarta
Keywords: Diabetes, Naïve Bayes, Prediction, Classification

Abstract

Diabetes is a chronic disease that is considered a significant health problem worldwide. Early detection and prediction of diabetes is a crucial step to enable early intervention and prevent complications. This study aims to apply the Naïve Bayes algorithm in predicting the probability of someone having diabetes. The dataset used in the study was obtained from the National Institute of Diabetes and Digestive and Kidney Diseases. Attributes such as gender, age, body mass index, glucose level, and others were used as independent variables in the Naïve Bayes algorithm to classify them into two groups: having or not having diabetes. From the research results, it has been shown that the Naïve Bayes algorithm can produce a prediction accuracy of 84.6%, 82.3% precision, and 60.8% recall.

References

[1] Nining Lestari and Burhannudin Ichsan, "Diabetes Melitus sebagai Faktor Risiko Keparahan dan Kematian Pasien Covid-19: Meta-Analisi," Biomedika, vol. 13, pp. 83-94, February 2021.
[2] Aisah Mujahidah Rasunah, Erwin Budi Setiawan, and Isman Kurniawan, "Drug Review-based Diabetes Prediction by Using Naïve Bayes Method," in International Conference Advancement in Data Science, E-learning and Information Systems, Bali, 2021.
[3] Fitriyani , "Prediksi Diabetes Menggunakan Algoritma Naive Bayes dan Greedy Forward Selection," Jurnal Nasional Teknologi dan Sistem Informasi, vol. 07, pp. 61-69, August 2021.
[4] Yuni Wardana and Devni Prima Sari, "Prediksi Penyakit Diabetes Dengan Naïve Bayes," Journal Of Mathematics UNP, vol. 8, pp. 89-97, September 2023.
[5] Fitrokh Nur Ikhromr, Ipin Sugiyarto, Umi Faddillah, and Bibit Sudarsono, "Implementasi Data Mining Untuk Memprediksi Penyakit Diabetes Menggunakan Naive Bayes dan K-Nearest Neighbor," Journal of Informatio Technology and Computer Science, pp. 416-428, May 2023.
[6] Ahmad Zaki Arrayyan, Hendra Setiawan, and Karisma Trinanda Putra, "Naive Bayes for Diabetes Prediction: Developing a Classification Model for Risk Identification in Specific Populations," Semesta Teknika, vol. 27, pp. 28-36, May 2024.
[7] Sherrvy Eva Wijayaningrum. (2023, October) Diabetes - Penyebab, Jenis, Gejala dan Pengobatannya. [Online]. https://www.siloamhospitals.com- /en/informasi-siloam/artikel/diabetes
[8] Mochammad Haldi Widianto. (2019) BINUS University. [Online]. https://binus.ac.id/bandung/2019/12/algoritma-naive-bayes/
[9] Trivusi. (2022, September) Pengertian dan Contoh Algoritma Naive Bayes Classifier. [Online]. https://www.trivusi.web.id/2022/07/algoritma-naive- bayes.html? m=1
[10] Ram D. Joshi and Chandra K. Dhakal, "Predicting Type 2 Diabetes Using Logistic Regression and Machine Learning Approaches," International Journal of Environmental Research and Public Health, vol. 18, pp. 1-17, July 2021.
[11] Ayuni Fachrunisa Lubis et al., "Classification of Diabetes Mellitus Sufferers Eating Patterns Using K-Nearest Neighbors, Naive Bayes and Decission Tree," Public Research Journal of Engineering, Data Technology and Computer Science, vol. 2, pp. 44-51, July 2024.
[12] Mission & Vision - NIDDK | National Institute of Diabetes and Digestive and Kidney Diseases. [Online]. https://www.niddk.nih.gov/about-niddk/meet- director/mission-vision
[13]"Predict diabetes based on diagnostic measures | Kaggle," https://www.kaggle.com/datasets/houcembenmansour/predict-diabetes-based-on-diagnostic-measures (accessed 02 April 2024).
[14] Arif Rahman. (2020, April) Cyberthreat. Id. [Online]. https://cyberthreat.id/read/6089/Yuk-Simak-Data-data-yang-Termasuk-Sensitif-di-Dunia-Medis
[15] Maulia Indriana Ghani. (2022, January) Zenius. [Online]. https://www.zenius.net/blog/teorema-bayes
[16] Maria Susan Anggreany. (2020, November) School of Computer Science | BINUS University. [Online]. https://socs.binus.ac.id/2020/11/01/- confusion-matrix/
[17] Naomi Chatrina Siregar, Riki Ruli A. Siregar, and M. Yoga Distra Sudirman, "Implementasi Metode Naive Bayes Classifier(NBC) Pada Komentar Warga Sekolah Mengenai Pelaksanaan Pembelajaran Jarak Jauh (PJJ)," Jurnal Teknologia, vol. 3, pp. 102-110, August 2020.
[18] Alfan Zainal Macfud, Abdi Pandu Kusuma, and Wahyu Dwi Puspitasari, "Analisis Algoritma Naive Bayes Classifier (NBC) Pada Klasifikasi Tingkat Minat Barang Di Toko Violet Cell," Jurnal Mahasiswa Teknik Informatika, vol. 7, pp. 87-94, February 2023.
[19] What Is Machine Learning (ML) | IBM. [Online]. https://www.ibm.com/topics/machine-learning
[20] Ivan Diryana Sudirman, Data-Driven Entrepreneur: Bisnis Berdaya Saing dengan Data Science dan Rapid Miner. Jakarta Selatan: Penerbit Salemba, 2023.
[21] Trivusi. (2022, September) Data Splitting: Pengertian, Metode, dan Kegunaannya. . [Online]. https://www.trivusi.web.id/2022/08/data-splitting.html
[22] Anthony Anggrawan and Mayadi, "Application of KNN Machine Learning and Fuzzy C-Means to Diagnose Diabetes," Matrik: Jurnal Manajemen, Teknik Informatika, dan Rekayasa Komputer, vol. 22, pp. 405-418, March 2023.
[23] Narendra Mohan and Vinod Jain, "Performance Analysis of Support Vector Machine in Diabetes Prediction," in 2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA), Coimbatore, 2020.
[24] Ahmad Afif, "Penerapan Algoritma Naïve Bayes Untuk Klasifikasi Penyakit Diabetes Mellitus di Rumah Sakit Aisyiah," Jurnal Ilmu Komputer dan Matematika, vol. 1, pp. 40-46, 2020.
[25] Sarang Narkhede. (2018, May) Understanding Confusion Matrix | Towards Data Science. [Online]. https://towardsdatascience.com/understanding- confusion-matrix-a9ad42dcfd62
[26] Muhammad Ridhwan Galela. (2023, November) Kemenkeu Learning Center. [Online]. https://klc2.kemenkeu.go.id/kms/knowledge/tidak-perlu-confused- dengan-confusion-matrix-728befa0/detail/
[27] Khafifah Munawaroh and Alamsyah, "Performance Comparison of SVM, Naive Bayes, and KNN Algorithms for Analysis of Public Opinion Sentiment Against COVID-19 Vaccination on Twitter," Journal of Advances in Information Systems and Technology, pp. 113-125, October 2022.
[28] Jwan Kanaan Alwan, Dhulfiqar Saad Jaafar, and Itimad Raheem Ali, "Diabetes Diagnosis Systems Using Modified Naive Bayes Classifier," Indonesian Journal of Electrical Engineering and Computer Science, vol. 28, pp. 1766-1774, September 2022.
[29] Turke Althobaiti, Saad Althobaiti, and Mahmoud M. Selim, "An Optimized Diabetes Mellitus Detection Model for Improved Prediction of Accuracy and Clinical Decision-Making," Alexandria Engineering Journal, pp. 311-324, March 2024.
Published
2024-06-27
How to Cite
Salsabila, L., Dwi Pangga, M., Yasser, S., Riyani, N., Aminah, S., & Wahyunengsih, W. (2024). Application of Naïve Bayes Algorithm for Diabetes Prediction. UJMC (Unisda Journal of Mathematics and Computer Science), 10(1), 56 - 68. https://doi.org/https://doi.org/10.52166/ujmc.v10i1.6886