Language

English

Publication Date

1-28-2025

Journal

Cancers

DOI

10.3390/cancers17030445

PMID

39941811

PMCID

PMC11815886

PubMedCentral® Posted Date

1-28-2025

PubMedCentral® Full Text Version

Post-print

Abstract

Background: Previous studies have described sex-specific patient subtyping in glioblastoma. The cluster labels associated with these "legacy data" were used to train a predictive model capable of recapitulating this clustering in contemporary contexts.

Methods: We used robust ensemble machine learning to train a model using gene microarray data to perform multi-platform predictions including RNA-seq and potentially scRNA-seq.

Results: The engineered feature set was composed of many previously reported genes that are associated with patient prognosis. Interestingly, these well-known genes formed a predictive signature only for female patients, and the application of the predictive signature to male patients produced unexpected results.

Conclusions: This work demonstrates how annotated "legacy data" can be used to build robust predictive models capable of multi-target predictions across multiple platforms.

Keywords

clustering, disease subtyping, machine learning, glioblastoma multiforme, GBM, feature engineering, gene expression signatures, female

Published Open-Access

yes

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.