Faculty, Staff and Student Publications
Language
English
Publication Date
8-1-2024
Journal
Database and Expert Systems Applications
DOI
10.1007/978-3-031-68309-1_20
PMID
39463781
PMCID
PMC11503500
PubMedCentral® Posted Date
8-18-2025
PubMedCentral® Full Text Version
Author MSS
Abstract
This study addresses the prevalent issue of missing data in patient-reported outcome datasets, particularly focusing on head and neck cancer patient symptom ratings sourced from the MD Anderson Symptom Inventory. Given that many data mining and machine learning algorithms necessitate complete datasets, the accurate imputation of missing data as an initial step becomes crucial. In this study we propose, for the first time, the use of collaborative filtering for imputing missing head and neck cancer patient symptom ratings. Two configurations of collaborative filtering, namely patient-based and symptom-based, leverage known ratings to infer the missing ones. Additionally, this study compares the performance of collaborative filtering with alternative imputation methods such as Multiple Imputation by Chained Equations, Nearest Neighbor Imputation, and Linear interpolation. Performance is compared using Root Mean Squared Error and Mean Absolute Error metrics. Findings demonstrate that collaborative filtering is a viable and comparatively superior approach for imputing missing patient symptom data.
Keywords
Head and Neck Cancer, Imputation, Collaborative Filtering
Published Open-Access
yes
Recommended Citation
Anyimadu, Eric Ababio; Fuller, Clifton David; Zhang, Xinhua; et al., "Collaborative Filtering for the Imputation of Patient Reported Outcomes" (2024). Faculty, Staff and Student Publications. 5669.
https://digitalcommons.library.tmc.edu/uthgsbs_docs/5669
Included in
Bioinformatics Commons, Biomedical Informatics Commons, Genetic Phenomena Commons, Medical Genetics Commons, Oncology Commons