Faculty, Staff and Student Publications
Language
English
Publication Date
1-10-2026
Journal
npj Digital Medicine
DOI
10.1038/s41746-025-02271-0
PMID
41513815
PMCID
PMC12852785
PubMedCentral® Posted Date
1-10-2026
PubMedCentral® Full Text Version
Post-print
Abstract
Scattered between many healthcare providers across the US, Electronic Health Records (EHR) are extensively used for research purposes. Collaboration and sharing of EHRs between multiple institutions often provide access to more diverse datasets and a chance to conduct comprehensive studies. However, these collaboration efforts are usually hindered by privacy issues that render the pooling of such data at a centralized database impossible. Furthermore, EHRs are often incomplete and require statistical imputation prior to the study. To enable collaborative studies on top of incomplete, private EHRs, here we provide a provably secure solution built with secure multiparty computation (SMC) that provides practical runtimes and accuracy on par with the state-of-the-art, non-secure equivalents. Our solution enables the utilization of distributed datasets as a whole to impute the missing data and conduct collective studies between non-trusting private data proprietors. We demonstrate its effectiveness on various synthetic and real-world datasets, and show that our solution can significantly improve the classification of high-risk patient outcomes during ICU admission.
Keywords
atabases, Software, Statistical methods
Published Open-Access
yes
Recommended Citation
Smajlović, Haris; Lian, Yi; Long, Qi; et al., "Secure Distributed Multiple Imputation Enables Missing Data Inference for Private Data Proprietors" (2026). Faculty, Staff and Student Publications. 740.
https://digitalcommons.library.tmc.edu/uthshis_docs/740