Faculty, Staff and Student Publications

An Open Natural Language Processing (NLP) Framework For Ehr-Based Clinical Research: A Case Demonstration Using The National Covid Cohort Collaborative (N3C)

Language

English

Publication Date

11-17-2023

Journal

Journal of the American Medical Informatics Association

DOI

10.1093/jamia/ocad134

PMID

37555837

PMCID

PMC10654844

PubMedCentral® Posted Date

August 2023

PubMedCentral® Full Text Version

Post-print

Abstract

Despite recent methodology advancements in clinical natural language processing (NLP), the adoption of clinical NLP models within the translational research community remains hindered by process heterogeneity and human factor variations. Concurrently, these factors also dramatically increase the difficulty in developing NLP models in multi-site settings, which is necessary for algorithm robustness and generalizability. Here, we reported on our experience developing an NLP solution for Coronavirus Disease 2019 (COVID-19) signs and symptom extraction in an open NLP framework from a subset of sites participating in the National COVID Cohort (N3C). We then empirically highlight the benefits of multi-site data for both symbolic and statistical methods, as well as highlight the need for federated annotation and evaluation to resolve several pitfalls encountered in the course of these efforts.

Keywords

Humans, Natural Language Processing, Electronic Health Records, COVID-19, Algorithms

Published Open-Access

yes

Recommended Citation

Liu, Sijia; Wen, Andrew; Wang, Liwei; et al., "An Open Natural Language Processing (NLP) Framework For Ehr-Based Clinical Research: A Case Demonstration Using The National Covid Cohort Collaborative (N3C)" (2023). Faculty, Staff and Student Publications. 165.
https://digitalcommons.library.tmc.edu/uthshis_docs/165

Download

Included in

Bioinformatics Commons, Biomedical Informatics Commons, Data Science Commons

COinS

Faculty, Staff and Student Publications

An Open Natural Language Processing (NLP) Framework For Ehr-Based Clinical Research: A Case Demonstration Using The National Covid Cohort Collaborative (N3C)

Language

Publication Date

Journal

DOI

PMID

PMCID

PubMedCentral® Posted Date

PubMedCentral® Full Text Version

Abstract

Keywords

Published Open-Access

Recommended Citation

Included in

Search

Browse

Author Corner

More Info

Library

Faculty, Staff and Student Publications

An Open Natural Language Processing (NLP) Framework For Ehr-Based Clinical Research: A Case Demonstration Using The National Covid Cohort Collaborative (N3C)

Authors

Language

Publication Date

Journal

DOI

PMID

PMCID

PubMedCentral® Posted Date

PubMedCentral® Full Text Version

Abstract

Keywords

Published Open-Access

Recommended Citation

Included in

Share

Search

Browse

Author Corner

More Info

Library