Faculty, Staff and Student Publications

Data Encoding For Healthcare Data Democratization and Information Leakage Prevention

Language

English

Publication Date

2-21-2024

Journal

Nature Communications

DOI

10.1038/s41467-024-45777-z

PMID

38383571

PMCID

PMC10882022

PubMedCentral® Posted Date

February 2024

PubMedCentral® Full Text Version

Post-print

Abstract

The lack of data democratization and information leakage from trained models hinder the development and acceptance of robust deep learning-based healthcare solutions. This paper argues that irreversible data encoding can provide an effective solution to achieve data democratization without violating the privacy constraints imposed on healthcare data and clinical models. An ideal encoding framework transforms the data into a new space where it is imperceptible to a manual or computational inspection. However, encoded data should preserve the semantics of the original data such that deep learning models can be trained effectively. This paper hypothesizes the characteristics of the desired encoding framework and then exploits random projections and random quantum encoding to realize this framework for dense and longitudinal or time-series data. Experimental evaluation highlights that models trained on encoded time-series data effectively uphold the information bottleneck principle and hence, exhibit lesser information leakage from trained models.

Keywords

Health care, Medical research

Published Open-Access

yes

Recommended Citation

Thakur, Anshul; Zhu, Tingting; Abrol, Vinayak; et al., "Data Encoding For Healthcare Data Democratization and Information Leakage Prevention" (2024). Faculty, Staff and Student Publications. 1912.
https://digitalcommons.library.tmc.edu/uthmed_docs/1912

Download

Included in

Laboratory Medicine Commons, Medical Pathology Commons

COinS

Faculty, Staff and Student Publications

Data Encoding For Healthcare Data Democratization and Information Leakage Prevention

Language

Publication Date

Journal

DOI

PMID

PMCID

PubMedCentral® Posted Date

PubMedCentral® Full Text Version

Abstract

Keywords

Published Open-Access

Recommended Citation

Included in

Search

Browse

Author Corner

More Info

Library

Faculty, Staff and Student Publications

Data Encoding For Healthcare Data Democratization and Information Leakage Prevention

Authors

Language

Publication Date

Journal

DOI

PMID

PMCID

PubMedCentral® Posted Date

PubMedCentral® Full Text Version

Abstract

Keywords

Published Open-Access

Recommended Citation

Included in

Share

Search

Browse

Author Corner

More Info

Library