Publication Date
9-1-2021
Journal
Chest
DOI
10.1016/j.chest.2021.04.048
PMID
33932466
PMCID
PMC8727846
PubMedCentral® Posted Date
4-28-2021
PubMedCentral® Full Text Version
Post-print
Published Open-Access
yes
Keywords
Biopsy, Fine-Needle, Bronchoscopy, Calibration, Carcinoma, Non-Small-Cell Lung, Endosonography, Female, Humans, Image-Guided Biopsy, Lung Neoplasms, Lymphatic Metastasis, Male, Mediastinum, Middle Aged, Neoplasm Staging, Patient Selection, Predictive Value of Tests, Prognosis, United States, endobronchial ultrasound, lung cancer, lung cancer staging, mediastinal adenopathy
Abstract
BACKGROUND: Two models, the Help with the Assessment of Adenopathy in Lung cancer (HAL) and Help with Oncologic Mediastinal Evaluation for Radiation (HOMER), were recently developed to estimate the probability of nodal disease in patients with non-small cell lung cancer (NSCLC) as determined by endobronchial ultrasound-transbronchial needle aspiration (EBUS-TBNA). The objective of this study was to prospectively externally validate both models at multiple centers.
RESEARCH QUESTION: Are the HAL and HOMER models valid across multiple centers?
STUDY DESIGN AND METHODS: This multicenter prospective observational cohort study enrolled consecutive patients with PET-CT clinical-radiographic stages T1-3, N0-3, M0 NSCLC undergoing EBUS-TBNA staging. HOMER was used to predict the probability of N0 vs N1 vs N2 or N3 (N2|3) disease, and HAL was used to predict the probability of N2|3 (vs N0 or N1) disease. Model discrimination was assessed using the area under the receiver operating characteristics curve (ROC-AUC), and calibration was assessed using the Brier score, calibration plots, and the Hosmer-Lemeshow test.
RESULTS: Thirteen centers enrolled 1,799 patients. HAL and HOMER demonstrated good discrimination: HAL ROC-AUC = 0.873 (95%CI, 0.856-0.891) and HOMER ROC-AUC = 0.837 (95%CI, 0.814-0.859) for predicting N1 disease or higher (N1|2|3) and 0.876 (95%CI, 0.855-0.897) for predicting N2|3 disease. Brier scores were 0.117 and 0.349, respectively. Calibration plots demonstrated good calibration for both models. For HAL, the difference between forecast and observed probability of N2|3 disease was +0.012; for HOMER, the difference for N1|2|3 was -0.018 and for N2|3 was +0.002. The Hosmer-Lemeshow test was significant for both models (P = .034 and .002), indicating a small but statistically significant calibration error.
INTERPRETATION: HAL and HOMER demonstrated good discrimination and calibration in multiple centers. Although calibration error was present, the magnitude of the error is small, such that the models are informative.
Included in
Medical Sciences Commons, Neoplasms Commons, Oncology Commons, Pulmonology Commons, Respiratory Tract Diseases Commons
Comments
Associated Data