Journal Articles

Publication Date



AMIA Joint Summits on Translational Science Proceedings


Phenotyping for Type 2 Diabetes (T2DM) is needed due to the increasing demand for T2DM research on electronic health records (EHRs). eMERGE is a reliable and interpretable rule-based algorithm for the identification of T2DM cases and controls in EHRs. MIMIC-IV, an extension of MIMIC-III, contains more than 520,000 hospital admissions and has become a valuable EHR database for secondary medical research. However, there was no prior work to extract T2DM cases and controls from MIMIC-IV, which requires a comprehensive knowledge of the database. Our work provided insight into the structure and data elements in MIMIC-IV and adapted eMERGE to accomplish the task. The results included MIMIC-IV's data tables and elements used, 12,735 cases and 9,828 controls of T2DM, and summary statistics of the cohorts in comparison with those on other EHR databases. They could be used for the development of statistical and machine learning models in future studies about the disease.



To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.