Language

English

Publication Date

11-17-2023

Journal

BMC Medical Informatics and Decision Making

DOI

10.1186/s12911-023-02361-7

PMID

37978498

PMCID

PMC10656827

PubMedCentral® Posted Date

November 2023

PubMedCentral® Full Text Version

Post-print

Abstract

BACKGROUND: Child abuse and neglect (CAN) is prevalent, associated with long-term adversities, and often undetected. Primary care settings offer a unique opportunity to identify CAN and facilitate referrals, when warranted. Electronic health records (EHR) contain extensive information to support healthcare decisions, yet time constraints preclude most providers from thorough EHR reviews that could indicate CAN. Strategies that summarize EHR data to identify CAN and convey this to providers has potential to mitigate CAN-related sequelae. This study used expert review/consensus and Natural Language Processing (NLP) to develop and test a lexicon to characterize children who have experienced or are at risk for CAN and compared machine learning methods to the lexicon + NLP approach to determine the algorithm's performance for identifying CAN.

METHODS: Study investigators identified 90 CAN terms and invited an interdisciplinary group of child abuse experts for review and validation. We then used NLP to develop pipelines to finalize the CAN lexicon. Data for pipeline development and refinement were drawn from a randomly selected sample of EHR from patients seen at pediatric primary care clinics within a U.S. academic health center. To explore a machine learning approach for CAN identification, we used Support Vector Machine algorithms.

RESULTS: The investigator-generated list of 90 CAN terms were reviewed and validated by 25 invited experts, resulting in a final pool of 133 terms. NLP utilized a randomly selected sample of 14,393 clinical notes from 153 patients to test the lexicon, and .03% of notes were identified as CAN positive. CAN identification varied by clinical note type, with few differences found by provider type (physicians versus nurses, social workers, etc.). An evaluation of the final NLP pipelines indicated 93.8% positive CAN rate for the training set and 71.4% for the test set, with decreased precision attributed primarily to false positives. For the machine learning approach, SVM pipeline performance was 92% for CAN + and 100% for non-CAN, indicating higher sensitivity than specificity.

CONCLUSIONS: The NLP algorithm's development and refinement suggest that innovative tools can identify youth at risk for CAN. The next key step is to refine the NLP algorithm to eventually funnel this information to care providers to guide clinical decision making.

Keywords

Automated clinical summaries, Child abuse and neglect, Screening, Primary care, Pediatrics

Published Open-Access

yes

Recommended Citation

Hanson, Rochelle F; Zhu, Vivienne; Are, Funlola; et al., "Initial Development of Tools To Identify Child Abuse and Neglect In Pediatric Primary Care" (2023). Faculty, Staff and Student Publications. 2467.
https://digitalcommons.library.tmc.edu/uthmed_docs/2467

Download

Included in

Internal Medicine Commons, Medical Sciences Commons, Mental and Social Health Commons, Psychiatry and Psychology Commons, Psychology Commons

COinS

Faculty, Staff and Student Publications

Initial Development of Tools To Identify Child Abuse and Neglect In Pediatric Primary Care

Language

Publication Date

Journal

DOI

PMID

PMCID

PubMedCentral® Posted Date

PubMedCentral® Full Text Version

Abstract

Keywords

Published Open-Access

Recommended Citation

Included in

Search

Browse

Author Corner

More Info

Library

Faculty, Staff and Student Publications

Initial Development of Tools To Identify Child Abuse and Neglect In Pediatric Primary Care

Authors

Language

Publication Date

Journal

DOI

PMID

PMCID

PubMedCentral® Posted Date

PubMedCentral® Full Text Version

Abstract

Keywords

Published Open-Access

Recommended Citation

Included in

Share

Search

Browse

Author Corner

More Info

Library