Predicting the Risk of Hypertension in Adult Patients Using the Random Forest Algorithm

Authors

  • Santinah Faletehan University
  • Dede Brahma Arianto Faletehan University

DOI:

https://doi.org/10.61536/ambidextrous.v4i02.497

Keywords:

hypertension, risk prediction, random forest, machine learning, health

Abstract

Hypertension remains a persistent and widespread health problem in the adult population, yet many cases go undetected due to limited early symptoms and reliance on conventional clinical assessment. This study aims to develop and evaluate a hypertension risk prediction model in adult patients using the Random Forest algorithm. This study employed a quantitative approach with an exploratory–predictive study design based on electronic secondary data, with a descriptive–analytical framework utilizing data mining techniques. The study population comprised all adult patients registered at selected healthcare facilities, while the sample consisted of 120 adult patients selected by purposive sampling from the hypertension risk dataset on Kaggle. The instrument used was a structured electronic medical record table, including age, gender, body mass index (BMI), blood pressure, and relevant medical history. The data underwent preprocessing and encoding, then were analyzed using the Random Forest algorithm on the Python platform with the scikitlearn library. Model performance was evaluated using accuracy, precision, recall, and F1score metrics. The results showed that the Random Forest model provided an accuracy of 87.5%, precision of 91.7%, recall of 84.6%, and F1 score of 88.0%, indicating a strong hypertension risk classification capability. The study concluded that Random Forest can be utilized as a reliable decision support system for early detection of hypertension risk in adult populations, especially when integrated with electronic medical records.

Downloads

Download data is not yet available.

References

Ahmed, S., Hasan, R., & Islam, M. R. (2021). Performance evaluation of Random Forest for medical prediction systems. IEEE Access, 9, 102345–102356. https://doi.org/10.1109/ACCESS.2021.3087654

Benjamin, E. J., Muntner, P., Alonso, A., Bittencourt, M. S., Callaway, C. W., Carson, A. P., et al. (2023). Heart disease and stroke statistics—2023 update. Circulation, 147(8), e93–e621. https://doi.org/10.1161/CIR.0000000000001123

Breiman, L. (2020). Random Forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324

Chen, T., & Guestrin, C. (2022). XGBoost: A scalable tree boosting system. Communications of the ACM, 65(1), 1–10. https://doi.org/10.1145/3152073

Emzir. (2022). Metodologi penelitian pendidikan kuantitatif dan kualitatif. Rajawali Pers.

Han, J., Kamber, M., & Pei, J. (2021). Data mining: Concepts and techniques (3rd ed.). Morgan Kaufmann.

Hidayat, A., Setiawan, A., & Prasetyo, B. (2023). Genetic and psychosocial factors influencing hypertension risk: A machine learning approach. Journal of Biomedical Informatics, 138, 104295. https://doi.org/10.1016/j.jbi.2023.104295

Kastorini, C. M., Milionis, H. J., Goudevenos, J. A., & Panagiotakos, D. B. (2021). Hypertension risk prediction model based on clinical and lifestyle factors. European Journal of Preventive Cardiology, 28(15), 1721–1729. https://doi.org/10.1093/eurjpc/zwaa139

Kementerian Kesehatan Republik Indonesia (Kemenkes RI). (2024). Laporan Survei Kesehatan Indonesia (SKI) 2023. Kementerian Kesehatan Republik Indonesia.

Liaw, A., & Wiener, M. (2021). Classification and regression by Random Forest. R News, 2(3), 18–22.

Mills, K. T., Stefanescu, A., & He, J. (2021). The global epidemiology of hypertension. The Lancet, 398(10304), 987–998. https://doi.org/10.1016/S0140-6736(21)01101-4

Panday, A. (2023). Hypertension risk prediction dataset [Data set]. Kaggle. https://www.kaggle.com/datasets/ankushpanday1/hypertension-risk-prediction-dataset

Pal, S. K., & Mitra, S. (2022). Application of Random Forest in medical data analysis. Journal of Medical Systems, 46(1), 1–12. https://doi.org/10.1007/s10916-021-01787-2

Putri, F., & Arianto, D. B. (2024). Perbandingan performa Random Forest dan Gradient Boosting dalam prediksi pada dataset Customer Shopping Trends. Kohesi: Jurnal Sains dan Teknologi, 5(11), 1–10.

Rahman, M. H., Yusuf, S., & Amin, M. R. (2023). Random Forest based model for hypertension risk prediction. BMC Medical Informatics and Decision Making, 23(1), 1–11. https://doi.org/10.1186/s12911-023-02325-1

Rahmawati, S., Suryani, N., & Wulandari, R. (2021). Prevalence and determinants of hypertension among young adults: A cross sectional study. Journal of Human Hypertension, 35(1), 45–52. https://doi.org/10.1038/s41371-020-0367-2

Rahmawati, S., Suryani, N., & Wulandari, R. (2022). Risk factor clustering and machine learning based prediction of hypertension in young adults. Journal of Clinical Medicine, 11(14), 4012. https://doi.org/10.3390/jcm11144012

Roth, G. A., Mensah, G. A., Johnson, C. O., Hripcsak, G., Tleyjeh, I. M., & Hillis, S. D. (2023). Global burden of cardiovascular diseases and risk factors, 1990–2020. Journal of the American College of Cardiology, 81(2), 121–143. https://doi.org/10.1016/j.jacc.2022.11.008

Shanthamallu, S., Little, L. L., & Forzley, S. (2021). Preprocessing and encoding techniques for healthcare data in machine learning. Journal of Biomedical Informatics, 115, 103–115. https://doi.org/10.1016/j.jbi.2021.103678

Singh, A. K., Shankar, S., Singh, R., Whittle, R., Kapoor, S., & Singh, S. (2021). Machine learning approaches for chronic disease prediction. Healthcare Informatics Research, 27(2), 102–110. https://doi.org/10.4258/hir.2021.27.2.102

Suwandy, S., Arief, M., & Wijaya, D. (2022). Multivariate risk factor modeling of hypertension using cross sectional data in Indonesia. International Journal of Environmental Research and Public Health, 19(10), 5932. https://doi.org/10.3390/ijerph19105932

Sudaryono. (2021). Metodologi penelitian kuantitatif dan kualitatif: Teori dan aplikasi. Deepublish.

Sugiyono. (2023). Metode penelitian kuantitatif, kualitatif, dan R&D. Alfabeta.

Syahdi, R. R., Sari, D. P., & Wijaya, A. (2023). Lifestyle related determinants of hypertension among adults in Indonesia. Journal of Epidemiology and Global Health, 13(2), 89–97. https://doi.org/10.2991/jegh.kh.23.0024

Syahdi, R. R., Sari, D. P., & Wijaya, A. (2024). Application of data mining for early detection of hypertension in adults. Journal of Medical Systems, 48(1), 1–10. https://doi.org/10.1007/s10916-023-02025-9

World Health Organization (WHO). (2023). Hypertension. World Health Organization. https://www.who.int/news-room/fact-sheets/detail/hypertension

Published

2026-05-08

How to Cite

Santinah, & Dede Brahma Arianto. (2026). Predicting the Risk of Hypertension in Adult Patients Using the Random Forest Algorithm. Ambidextrous Journal of Innovation Efficiency and Technology in Organization, 4(02), 104–113. https://doi.org/10.61536/ambidextrous.v4i02.497

Similar Articles

<< < 1 2 

You may also start an advanced similarity search for this article.