Faculty, Staff and Student Publications

Deep Learning In Clinical Natural Language Processing: A Methodical Review

Stephen Wu, University of Texas Health Science Center at Houston, School of Health Information Sciences, Houston TX, USA
Kirk Roberts
Surabhi Datta
Jingcheng Du
Zongcheng Ji
Yuqi Si
Sarvesh Soni
Qiong Wang
Qiang Wei
Yang Xiang
Bo Zhao
Hua Xu

Publication Date

3-1-2020

Journal

Journal of the American Medical Informatics Association

Abstract

OBJECTIVE: This article methodically reviews the literature on deep learning (DL) for natural language processing (NLP) in the clinical domain, providing quantitative analysis to answer 3 research questions concerning methods, scope, and context of current research.

MATERIALS AND METHODS: We searched MEDLINE, EMBASE, Scopus, the Association for Computing Machinery Digital Library, and the Association for Computational Linguistics Anthology for articles using DL-based approaches to NLP problems in electronic health records. After screening 1,737 articles, we collected data on 25 variables across 212 papers.

RESULTS: DL in clinical NLP publications more than doubled each year, through 2018. Recurrent neural networks (60.8%) and word2vec embeddings (74.1%) were the most popular methods; the information extraction tasks of text classification, named entity recognition, and relation extraction were dominant (89.2%). However, there was a "long tail" of other methods and specific tasks. Most contributions were methodological variants or applications, but 20.8% were new methods of some kind. The earliest adopters were in the NLP community, but the medical informatics community was the most prolific.

DISCUSSION: Our analysis shows growing acceptance of deep learning as a baseline for NLP research, and of DL-based NLP in the medical community. A number of common associations were substantiated (eg, the preference of recurrent neural networks for sequence-labeling named entity recognition), while others were surprisingly nuanced (eg, the scarcity of French language clinical NLP with deep learning).

CONCLUSION: Deep learning has not yet fully penetrated clinical NLP and is growing rapidly. This review highlighted both the popular and unique trends in this active field.

Keywords

Bibliometrics, Deep Learning, Electronic Health Records, Humans, Natural Language Processing

DOI

10.1093/jamia/ocz200

PMID

31794016

PMCID

PMC7025365

PubMedCentral® Posted Date

June 2023

PubMedCentral® Full Text Version

Post-Print

Published Open-Access

yes

Download

Included in

Bioinformatics Commons, Biomedical Informatics Commons, Data Science Commons

COinS

Faculty, Staff and Student Publications

Deep Learning In Clinical Natural Language Processing: A Methodical Review

Publication Date

Journal

Abstract

Keywords

DOI

PMID

PMCID

PubMedCentral® Posted Date

PubMedCentral® Full Text Version

Published Open-Access

Included in

Search

Browse

Author Corner

More Info

Library

Faculty, Staff and Student Publications

Deep Learning In Clinical Natural Language Processing: A Methodical Review

Authors

Publication Date

Journal

Abstract

Keywords

DOI

PMID

PMCID

PubMedCentral® Posted Date

PubMedCentral® Full Text Version

Published Open-Access

Included in

Share

Search

Browse

Author Corner

More Info

Library