Student and Faculty Publications
Publication Date
12-20-2022
Journal
BMC Genomics
Abstract
BACKGROUND: RNA-sequencing has become a standard tool for analyzing gene activity in bulk samples and at the single-cell level. By increasing sample sizes and cell counts, this technique can uncover substantial information about cellular transcriptional states. Beyond quantification of gene expression, RNA-seq can be used for detecting variants, including single nucleotide polymorphisms, small insertions/deletions, and larger variants, such as copy number variants. Notably, joint analysis of variants with cellular transcriptional states may provide insights into the impact of mutations, especially for complex and heterogeneous samples. However, this analysis is often challenging due to a prohibitively high number of variants and cells, which are difficult to summarize and visualize. Further, there is a dearth of methods that assess and summarize the association between detected variants and cellular transcriptional states.
RESULTS: Here, we introduce XCVATR (eXpressed Clusters of Variant Alleles in Transcriptome pRofiles), a method that identifies variants and detects local enrichment of expressed variants within embedding of samples and cells in single-cell and bulk RNA-seq datasets. XCVATR visualizes local "clumps" of small and large-scale variants and searches for patterns of association between each variant and cellular states, as described by the coordinates of cell embedding, which can be computed independently using any type of distance metrics, such as principal component analysis or t-distributed stochastic neighbor embedding. Through simulations and analysis of real datasets, we demonstrate that XCVATR can detect enrichment of expressed variants and provide insight into the transcriptional states of cells and samples. We next sequenced 2 new single cell RNA-seq tumor samples and applied XCVATR. XCVATR revealed subtle differences in CNV impact on tumors.
CONCLUSIONS: XCVATR is publicly available to download from https://github.com/harmancilab/XCVATR .
Keywords
Gene Expression Profiling, High-Throughput Nucleotide Sequencing, Transcriptome, RNA-Seq, Sequence Analysis, RNA, RNA, Single-Cell Analysis
Included in
Bioinformatics Commons, Biomedical Informatics Commons, Genetic Structures Commons, Genomics Commons, Medical Genetics Commons, Oncology Commons
Comments
Supplementary Materials
PMID: 36539717