Faculty, Staff and Student Publications

Use GPT-J Prompt Generation With Roberta For Ner Models On Diagnosis Extraction of Periodontal Diagnosis From Electronic Dental Records

Language

English

Publication Date

1-1-2023

Journal

AMIA Annual Symposium Proceedings

PMID

38222409

PMCID

PMC10785852

PubMedCentral® Posted Date

January 2024

PubMedCentral® Full Text Version

Post-print

Abstract

This study explored the usability of prompt generation on named entity recognition (NER) tasks and the performance in different settings of the prompt. The prompt generation by GPT-J models was utilized to directly test the gold standard as well as to generate the seed and further fed to the RoBERTa model with the spaCy package. In the direct test, a lower ratio of negative examples with higher numbers of examples in prompt achieved the best results with a F1 score of 0.72. The performance revealed consistency, 0.92-0.97 in the F1 score, in all settings after training with the RoBERTa model. The study highlighted the importance of seed quality rather than quantity in feeding NER models. This research reports on an efficient and accurate way to mine clinical notes for periodontal diagnoses, allowing researchers to easily and quickly build a NER model with the prompt generation approach.

Keywords

Humans, Dental Records, Natural Language Processing

Published Open-Access

yes

Recommended Citation

Chuang, Yao-Shun; Jiang, Xiaoqian; Lee, Chun-Teh; et al., "Use GPT-J Prompt Generation With Roberta For Ner Models On Diagnosis Extraction of Periodontal Diagnosis From Electronic Dental Records" (2023). Faculty, Staff and Student Publications. 297.
https://digitalcommons.library.tmc.edu/uthshis_docs/297

Download

Included in

Bioinformatics Commons, Biomedical Informatics Commons, Data Science Commons, Dentistry Commons, Medical Sciences Commons

COinS

Faculty, Staff and Student Publications

Use GPT-J Prompt Generation With Roberta For Ner Models On Diagnosis Extraction of Periodontal Diagnosis From Electronic Dental Records

Language

Publication Date

Journal

PMID

PMCID

PubMedCentral® Posted Date

PubMedCentral® Full Text Version

Abstract

Keywords

Published Open-Access

Recommended Citation

Included in

Search

Browse

Author Corner

More Info

Library

Faculty, Staff and Student Publications

Use GPT-J Prompt Generation With Roberta For Ner Models On Diagnosis Extraction of Periodontal Diagnosis From Electronic Dental Records

Authors

Language

Publication Date

Journal

PMID

PMCID

PubMedCentral® Posted Date

PubMedCentral® Full Text Version

Abstract

Keywords

Published Open-Access

Recommended Citation

Included in

Share

Search

Browse

Author Corner

More Info

Library