Publication Date
8-14-2023
Journal
Cancer Cell
DOI
10.1016/j.ccell.2023.06.009
PMID
37582339
PMCID
PMC10506762
PubMedCentral® Posted Date
8-14-2024
PubMedCentral® Full Text Version
Author MSS
Published Open-Access
yes
Keywords
Humans, Proteogenomics, Proteomics, Genomics, Neoplasms, Gene Expression Profiling, pan-cancer, proteogenomics, data harmonization, multi-omics, open data, CPTAC
Abstract
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) investigates tumors from a proteogenomic perspective, creating rich multi-omics datasets connecting genomic aberrations to cancer phenotypes. To facilitate pan-cancer investigations, we have generated harmonized genomic, transcriptomic, proteomic, and clinical data for >1000 tumors in 10 cohorts to create a cohesive and powerful dataset for scientific discovery. We outline efforts by the CPTAC pan-cancer working group in data harmonization, data dissemination, and computational resources for aiding biological discoveries. We also discuss challenges for multi-omics data integration and analysis, specifically the unique challenges of working with both nucleotide sequencing and mass spectrometry proteomics data.
Included in
Biological Phenomena, Cell Phenomena, and Immunity Commons, Biomedical Informatics Commons, Genetics and Genomics Commons, Medical Genetics Commons, Medical Molecular Biology Commons, Neoplasms Commons, Oncology Commons