Predicting the Risk of Hypertension in Adult Patients Using the Random Forest Algorithm
DOI:
https://doi.org/10.61536/ambidextrous.v4i02.497Keywords:
hypertension, risk prediction, random forest, machine learning, healthAbstract
Hypertension remains a persistent and widespread health problem in the adult population, yet many cases go undetected due to limited early symptoms and reliance on conventional clinical assessment. This study aims to develop and evaluate a hypertension risk prediction model in adult patients using the Random Forest algorithm. This study employed a quantitative approach with an exploratory–predictive study design based on electronic secondary data, with a descriptive–analytical framework utilizing data mining techniques. The study population comprised all adult patients registered at selected healthcare facilities, while the sample consisted of 120 adult patients selected by purposive sampling from the hypertension risk dataset on Kaggle. The instrument used was a structured electronic medical record table, including age, gender, body mass index (BMI), blood pressure, and relevant medical history. The data underwent preprocessing and encoding, then were analyzed using the Random Forest algorithm on the Python platform with the scikitlearn library. Model performance was evaluated using accuracy, precision, recall, and F1score metrics. The results showed that the Random Forest model provided an accuracy of 87.5%, precision of 91.7%, recall of 84.6%, and F1 score of 88.0%, indicating a strong hypertension risk classification capability. The study concluded that Random Forest can be utilized as a reliable decision support system for early detection of hypertension risk in adult populations, especially when integrated with electronic medical records.
Downloads
References
Ahmed, S., Hasan, R., & Islam, M. R. (2021). Performance evaluation of Random Forest for medical prediction systems. IEEE Access, 9, 102345–102356. https://doi.org/10.1109/ACCESS.2021.3087654
Benjamin, E. J., Muntner, P., Alonso, A., Bittencourt, M. S., Callaway, C. W., Carson, A. P., et al. (2023). Heart disease and stroke statistics—2023 update. Circulation, 147(8), e93–e621. https://doi.org/10.1161/CIR.0000000000001123
Breiman, L. (2020). Random Forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
Chen, T., & Guestrin, C. (2022). XGBoost: A scalable tree boosting system. Communications of the ACM, 65(1), 1–10. https://doi.org/10.1145/3152073
Emzir. (2022). Metodologi penelitian pendidikan kuantitatif dan kualitatif. Rajawali Pers.
Han, J., Kamber, M., & Pei, J. (2021). Data mining: Concepts and techniques (3rd ed.). Morgan Kaufmann.
Hidayat, A., Setiawan, A., & Prasetyo, B. (2023). Genetic and psychosocial factors influencing hypertension risk: A machine learning approach. Journal of Biomedical Informatics, 138, 104295. https://doi.org/10.1016/j.jbi.2023.104295
Kastorini, C. M., Milionis, H. J., Goudevenos, J. A., & Panagiotakos, D. B. (2021). Hypertension risk prediction model based on clinical and lifestyle factors. European Journal of Preventive Cardiology, 28(15), 1721–1729. https://doi.org/10.1093/eurjpc/zwaa139
Kementerian Kesehatan Republik Indonesia (Kemenkes RI). (2024). Laporan Survei Kesehatan Indonesia (SKI) 2023. Kementerian Kesehatan Republik Indonesia.
Liaw, A., & Wiener, M. (2021). Classification and regression by Random Forest. R News, 2(3), 18–22.
Mills, K. T., Stefanescu, A., & He, J. (2021). The global epidemiology of hypertension. The Lancet, 398(10304), 987–998. https://doi.org/10.1016/S0140-6736(21)01101-4
Panday, A. (2023). Hypertension risk prediction dataset [Data set]. Kaggle. https://www.kaggle.com/datasets/ankushpanday1/hypertension-risk-prediction-dataset
Pal, S. K., & Mitra, S. (2022). Application of Random Forest in medical data analysis. Journal of Medical Systems, 46(1), 1–12. https://doi.org/10.1007/s10916-021-01787-2
Putri, F., & Arianto, D. B. (2024). Perbandingan performa Random Forest dan Gradient Boosting dalam prediksi pada dataset Customer Shopping Trends. Kohesi: Jurnal Sains dan Teknologi, 5(11), 1–10.
Rahman, M. H., Yusuf, S., & Amin, M. R. (2023). Random Forest based model for hypertension risk prediction. BMC Medical Informatics and Decision Making, 23(1), 1–11. https://doi.org/10.1186/s12911-023-02325-1
Rahmawati, S., Suryani, N., & Wulandari, R. (2021). Prevalence and determinants of hypertension among young adults: A cross sectional study. Journal of Human Hypertension, 35(1), 45–52. https://doi.org/10.1038/s41371-020-0367-2
Rahmawati, S., Suryani, N., & Wulandari, R. (2022). Risk factor clustering and machine learning based prediction of hypertension in young adults. Journal of Clinical Medicine, 11(14), 4012. https://doi.org/10.3390/jcm11144012
Roth, G. A., Mensah, G. A., Johnson, C. O., Hripcsak, G., Tleyjeh, I. M., & Hillis, S. D. (2023). Global burden of cardiovascular diseases and risk factors, 1990–2020. Journal of the American College of Cardiology, 81(2), 121–143. https://doi.org/10.1016/j.jacc.2022.11.008
Shanthamallu, S., Little, L. L., & Forzley, S. (2021). Preprocessing and encoding techniques for healthcare data in machine learning. Journal of Biomedical Informatics, 115, 103–115. https://doi.org/10.1016/j.jbi.2021.103678
Singh, A. K., Shankar, S., Singh, R., Whittle, R., Kapoor, S., & Singh, S. (2021). Machine learning approaches for chronic disease prediction. Healthcare Informatics Research, 27(2), 102–110. https://doi.org/10.4258/hir.2021.27.2.102
Suwandy, S., Arief, M., & Wijaya, D. (2022). Multivariate risk factor modeling of hypertension using cross sectional data in Indonesia. International Journal of Environmental Research and Public Health, 19(10), 5932. https://doi.org/10.3390/ijerph19105932
Sudaryono. (2021). Metodologi penelitian kuantitatif dan kualitatif: Teori dan aplikasi. Deepublish.
Sugiyono. (2023). Metode penelitian kuantitatif, kualitatif, dan R&D. Alfabeta.
Syahdi, R. R., Sari, D. P., & Wijaya, A. (2023). Lifestyle related determinants of hypertension among adults in Indonesia. Journal of Epidemiology and Global Health, 13(2), 89–97. https://doi.org/10.2991/jegh.kh.23.0024
Syahdi, R. R., Sari, D. P., & Wijaya, A. (2024). Application of data mining for early detection of hypertension in adults. Journal of Medical Systems, 48(1), 1–10. https://doi.org/10.1007/s10916-023-02025-9
World Health Organization (WHO). (2023). Hypertension. World Health Organization. https://www.who.int/news-room/fact-sheets/detail/hypertension
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Santinah, Dede Brahma Arianto

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.










