Show simple item record

dc.contributor.authorLjubic, Branimir
dc.contributor.authorHai, Ameen Abdel
dc.contributor.authorStanojevic, Marija
dc.contributor.authorDiaz, Wilson
dc.contributor.authorPolimac, Daniel
dc.contributor.authorPavlovski, Martin
dc.contributor.authorObradovic, Zoran
dc.date.accessioned2022-06-06T11:54:11Z
dc.date.available2022-06-06T11:54:11Z
dc.date.issued2020-09-01
dc.identifier.citationLjubic, B., Hai, A. A., Stanojevic, M., Diaz, W., Polimac, D., Pavlovski, M., & Obradovic, Z. (2020). Predicting complications of diabetes mellitus using advanced machine learning algorithms. Journal of the American Medical Informatics Association, 27(9), 1343–1351. doi:10.1093/jamia/ocaa120
dc.identifier.issn1527-974X
dc.identifier.issn1067-5027
dc.identifier.doi10.1093/jamia/ocaa120
dc.identifier.urihttp://hdl.handle.net/10754/678696
dc.description.abstractObjective: We sought to predict if patients with type 2 diabetes mellitus (DM2) would develop 10 selected complications. Accurate prediction of complications could help with more targeted measures that would prevent or slow down their development. Materials and Methods: Experiments were conducted on the Healthcare Cost and Utilization Project State Inpatient Databases of California for the period of 2003 to 2011. Recurrent neural network (RNN) long short-term memory (LSTM) and RNN gated recurrent unit (GRU) deep learning methods were designed and compared with random forest and multilayer perceptron traditional models. Prediction accuracy of selected complications were compared on 3 settings corresponding to minimum number of hospitalizations between diabetes diagnosis and the diagnosis of complications. Results: The diagnosis domain was used for experiments. The best results were achieved with RNN GRU model, followed by RNN LSTM model. The prediction accuracy achieved with RNN GRU model was between 73% (myocardial infarction) and 83% (chronic ischemic heart disease), while accuracy of traditional models was between 66% – 76%. Discussion: The number of hospitalizations was an important factor for the prediction accuracy. Experiments with 4 hospitalizations achieved significantly better accuracy than with 2 hospitalizations. To achieve improved accuracy deep learning models required training on at least 1000 patients and accuracy significantly dropped if training datasets contained 500 patients. The prediction accuracy of complications decreases over time period. Considering individual complications, the best accuracy was achieved on depressive disorder and chronic ischemic heart disease. Conclusions: The RNN GRU model was the best choice for electronic medical record type of data, based on the achieved results.
dc.description.sponsorshipThis research was supported in part by the King Abdullah University of Science and Technology Center Partnership Fund and Pennsylvania Department of Health CURE Health Data Science Research Project.
dc.publisherOxford University Press (OUP)
dc.relation.urlhttps://academic.oup.com/jamia/article/27/9/1343/5899846
dc.subjectdiabetes mellitus
dc.subjectdiabetes mellitus complications
dc.subjectdeep learning
dc.subjectmachine learning
dc.subjectRNN models
dc.titlePredicting complications of diabetes mellitus using advanced machine learning algorithms
dc.typeArticle
dc.identifier.journalJOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION
dc.identifier.wosutWOS:000593113300002
dc.contributor.institutionCenter for Data Analytics and Biomedical Informatics, Temple University, Philadelphia, Pennsylvania, USA
dc.identifier.volume27
dc.identifier.issue9
dc.identifier.pages1343-1351
dc.identifier.eid2-s2.0-85091890855
kaust.acknowledged.supportUnitCenter Partnership Fund


This item appears in the following Collection(s)

Show simple item record