*Result*: Development of a risk assessment model for multimorbidity of diabetes, hypertension, and coronary heart disease with XGBoost in primary care in Shanghai, China: a retrospective study.
Am J Hypertens. 2017 Jan;30(1):42-50. (PMID: 27557861)
Int J Environ Res Public Health. 2021 Feb 09;18(4):. (PMID: 33572441)
Arch Gerontol Geriatr. 2018 Jan;74:133-140. (PMID: 29096228)
Health Inf Sci Syst. 2022 Dec 30;11(1):2. (PMID: 36593862)
J Affect Disord. 2022 Dec 15;319:260-266. (PMID: 36162655)
J Am Coll Cardiol. 2016 Feb 2;67(4):407-416. (PMID: 26821629)
Age Ageing. 2017 Nov 1;46(6):882-888. (PMID: 28985248)
Am J Hypertens. 1994 May;7(5):390-5. (PMID: 8060570)
Arthritis Care Res (Hoboken). 2023 Feb;75(2):220-230. (PMID: 35588095)
Artif Intell Med. 1999 May;16(1):3-23. (PMID: 10225344)
Diabetes Care. 2009 Mar;32(3):424-6. (PMID: 19074997)
Prev Med. 2020 Dec;141:106260. (PMID: 33017600)
J Am Heart Assoc. 2018 Nov 20;7(22):e010723. (PMID: 30571489)
Am J Epidemiol. 2012 Mar 15;175(6):584-93. (PMID: 22328705)
Int J Med Inform. 2023 Jul;175:105088. (PMID: 37156169)
Scand J Clin Lab Invest. 2013 Oct;73(7):569-75. (PMID: 24094290)
BMJ Open. 2019 Aug 30;9(8):e029594. (PMID: 31471439)
Clin Exp Hypertens. 2023 Dec 31;45(1):2224942. (PMID: 37337961)
Intern Med. 2014;53(7):655-60. (PMID: 24694472)
BMC Endocr Disord. 2022 Mar 8;22(1):59. (PMID: 35260113)
PLoS Med. 2018 May 21;15(5):e1002571. (PMID: 29782486)
BMC Med Inform Decis Mak. 2021 Jul 21;21(1):222. (PMID: 34289843)
Biomed Res Int. 2021 Jul 3;2021:6621785. (PMID: 34327233)
J Gerontol A Biol Sci Med Sci. 2015 Mar;70(3):332-8. (PMID: 25060316)
J Clin Epidemiol. 2002 Oct;55(10):1054-5. (PMID: 12464384)
BMC Cardiovasc Disord. 2014 Feb 25;14:26. (PMID: 24568132)
Sci Total Environ. 2023 Jan 10;855:158896. (PMID: 36150596)
Sci Rep. 2018 Feb 28;8(1):3806. (PMID: 29491441)
BMC Prim Care. 2022 Oct 3;23(1):258. (PMID: 36192706)
Circulation. 2004 Jul 6;110(1):32-5. (PMID: 15210602)
Clin Exp Hypertens. 2023 Dec 31;45(1):2271190. (PMID: 37983187)
Healthcare (Basel). 2020 Jul 31;8(3):. (PMID: 32751894)
Diabetologia. 2015 May;58(5):961-7. (PMID: 25680582)
Am J Epidemiol. 2014 Mar 15;179(6):684-91. (PMID: 24418684)
Front Public Health. 2023 Feb 01;11:1118628. (PMID: 36817881)
Clin Exp Hypertens. 2023 Dec 31;45(1):2277653. (PMID: 37939250)
Am J Epidemiol. 2018 Jan 1;187(1):103-112. (PMID: 29309518)
J Gerontol A Biol Sci Med Sci. 2019 Jul 12;74(8):1265-1270. (PMID: 30169580)
Front Big Data. 2020 Apr 30;3:15. (PMID: 33693389)
BMJ Open. 2023 Feb 9;13(2):e064275. (PMID: 36759029)
Math Biosci Eng. 2022 Feb 7;19(4):3597-3608. (PMID: 35341266)
Curr Geriatr Rep. 2018 Dec;7(4):201-209. (PMID: 30984516)
BMC Med. 2017 Aug 24;15(1):152. (PMID: 28835246)
BMJ Open. 2022 Jan 21;12(1):e049133. (PMID: 35063955)
Wellcome Open Res. 2020 Oct 8;5:233. (PMID: 33215050)
IEEE/ACM Trans Comput Biol Bioinform. 2020 Nov-Dec;17(6):2131-2140. (PMID: 30998478)
J Clin Med. 2020 Dec 10;9(12):. (PMID: 33321977)
PLoS One. 2022 Oct 7;17(10):e0275619. (PMID: 36206287)
Sci Rep. 2020 Oct 9;10(1):16879. (PMID: 33037233)
Lancet Public Health. 2017 May 19;2(6):e277-e285. (PMID: 28626830)
Front Cardiovasc Med. 2022 Sep 09;9:966217. (PMID: 36158847)
Public Health. 2022 Apr;205:63-71. (PMID: 35240390)
EClinicalMedicine. 2023 Feb 16;57:101860. (PMID: 36864977)
Sci Rep. 2023 Jan 12;13(1):672. (PMID: 36635398)
BMC Geriatr. 2018 Jan 16;18(1):16. (PMID: 29338690)
BMC Geriatr. 2018 Oct 4;18(1):236. (PMID: 30286727)
*Further Information*
*Background: Multimorbidity has emerged as a growing global health concern. Within its heterogeneous patterns, the cardiometabolic cluster is notably among the most common. Assessing the risk of such multimorbidity from a general practice perspective has become a priority in primary care. This study aimed to develop a comprehensive risk assessment model for the multimorbidity of diabetes, hypertension, and coronary heart disease among older adults in the community, utilizing large-scale data from Shanghai, China.
Methods: Retrospective data spanning 2017 to 2019 were collected from 40,261 residents across 47 community health centers. These data comprised residents' health records, health examination results, hospital information system (HIS) records, imaging databases, and lifestyle information. The XGBoost machine learning algorithm was utilized to construct a comprehensive risk assessment model for the multimorbidity of diabetes, hypertension, and coronary heart disease. Model performance was evaluated using the area under the receiver operating characteristic curve, accuracy, precision, recall, and the F1 Score.
Results: The dataset was split into training (80%) and testing (20%) sets. A total of 46 features were incorporated into the final comprehensive risk assessment model for the multimorbidity of diabetes, hypertension, and coronary heart disease. The optimal XGBoost model achieved a micro-average AUC of 0.822, a macro-average AUC of 0.795, and a weighted-average AUC of 0.784. These parameters demonstrate the high superiority of the constructed model.
Conclusions: The XGBoost-based risk assessment model for the multimorbidity of diabetes, hypertension, and coronary heart disease, integrated clinical and public health data from community residents. It identifies multidimensional predictors across four dimensions, underscoring its practical value in supporting integrated risk assessment and informing targeted health management strategies for individuals with multimorbidity.
Clinical Trial Number: Not applicable.
(© 2026. The Author(s).)*
*Declarations. Ethics approval and consent to participate: All methods were carried out in accordance with relevant guidelines and regulations. The research was conducted in accordance with the Declaration of Helsinki, with a continuous concern to protect the dignity, integrity and privacy of the participants, including the confidentiality of their data. The studies involving human participants were reviewed and approved by the Ethics Committee of Tongji University (ref: LL-2016-ZRKX-017). The projects adopted a centralised, project-level consent model. Under the premise of adhering to the initially agreed research scope, it was able to conduct large-scale participant recruitment in an ethical and efficient manner without the need to repeat the single consent process. All the patient data used in this study were strictly anonymized before analysis. Given the large scale and centralized nature of the data processing, it was really impossible to re-identify individuals. Participants provided their written informed consent before participating in this study. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests.*