Analysis and Application of the K-Means Clustering Algorithm to Identify Dominant Diseases Based on Patient Medical Record Data at Prima Melati Clinic

Elsa Ramadhani

Authors

Elsa Ramadhani Muhammadiyah University of North Sumatra

Keywords:

Dominant Diseases, ; K-Means Clustering, Medical Records

Abstract

Transformation Digital transformation in the healthcare sector necessitates optimal utilization of medical record data to facilitate more effective decision-making processes. Prima Melati Clinic continues to experience limitations in managing medical record data, which has not undergone systematic analysis to distinguish prevailing disease patterns. The purpose of this study is to analyze and apply the K-Means Clustering algorithm to identify dominant diseases based on patient medical record data at Prima Melati Clinic. The research methodology used is a quantitative approach that utilizes data mining techniques through the Knowledge Discovery in Database (KDD) stages, which include data preprocessing, application of the K-Means algorithm, and interpretation of clustering results. The dataset used consists of approximately 1000 patient medical records covering the period of January 2025 to May 2025. The data preprocessing phase includes data cleaning, missing value management, and data normalization using the StandardScaler technique. Determining the optimal number of clusters is achieved through the Elbow method, using the Sum of Squared Errors (SSE) calculation. The findings indicate that the K-Means algorithm with a cluster size of 3 (k) effectively categorizes patient data into three main clusters based on disease diagnostic characteristics classified by severity (mild, moderate, and severe). Each cluster reveals distinct dominant disease patterns, providing insight into disease distribution in relation to the severity of the patient's condition. The results of this analysis can be utilized by clinics in developing drug procurement strategies, scheduling medical personnel, and designing more targeted disease prevention strategies. Consequently, the implementation of the K-Means Clustering algorithm has demonstrated effectiveness in identifying dominant disease patterns and in strengthening data-driven decision-making at Prima Melati Clinic.

Downloads

Download data is not yet available.

References

Aljohani, N. R. (2024). Machine learning clustering techniques in healthcare analytics: A comprehensive review. Healthcare Analytics, 6, 100412.

Eken, S. (2020). A data mining approach for healthcare data classification and clustering. Journal of Healthcare Engineering, 2020, 1–12.

Fay, M. P., Smith, J. A., & Brown, T. R. (2023). Data visualization techniques for healthcare analytics and machine learning applications. Applied Sciences, 13(5), Article 2874. https://doi.org/10.3390/app13052874

Han, J., Kamber, M., & Pei, J. (2022). Data mining: Concepts and techniques (4th ed.). Morgan Kaufmann.

Kabir, M. A., Rahman, M. S., & Islam, M. T. (2024). Python-based machine learning applications in healthcare analytics: A systematic review. Healthcare Analytics, 5, 100321.

Kementerian Kesehatan Republik Indonesia. (2022). Peraturan Menteri Kesehatan Republik Indonesia Nomor 24 Tahun 2022 tentang rekam medis. Kementerian Kesehatan Republik Indonesia.

Kruse, C. S., Stein, A., Thomas, H., & Kaur, H. (2018). The use of electronic health records to support population health: A systematic review of the literature. Journal of Medical Systems, 42(11), Article 214. https://doi.org/10.1007/s10916-018-1075-6

Sharma, M., Singh, G., & Singh, R. (2021). Big data analytics in healthcare: A systematic literature review. Journal of Big Data, 8(1), 1–24.

Tsai, C. H., Eghdam, A., Davoody, N., Wright, G., Flowerday, S., & Koch, S. (2020). Effects of electronic health record implementation and barriers to adoption and use: A scoping review. JMIR Medical Informatics, 8(11), e19165. https://doi.org/10.2196/19165

World Health Organization. (2021). Global strategy on digital health 2020–2025. World Health Organization.

Analysis and Application of the K-Means Clustering Algorithm to Identify Dominant Diseases Based on Patient Medical Record Data at Prima Melati Clinic

Authors

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

License

Similar Articles

Main Menu

RECOMENDED TOOL

JOURNAL TEMPLATE

Chat Us

Keywords

VISITORS

Make a Submission