Faculty, Staff and Student Publications
Language
English
Publication Date
2-1-2026
Journal
PLOS Computational Biology
DOI
10.1371/journal.pcbi.1013956
PMID
41671295
PMCID
PMC12912703
PubMedCentral® Posted Date
2-11-2026
PubMedCentral® Full Text Version
Post-print
Abstract
Spatial transcriptomics (ST) provides unprecedented insights into gene expression patterns while retaining spatial context, making it a valuable tool for understanding complex tissue architectures, such as those found in cancers. Seurat, by far the most popular tool for analyzing ST data, uses the Wilcoxon rank-sum test by default for differential expression analysis. However, as a nonparametric method that disregards spatial correlations, the Wilcoxon test can lead to inflated false positive rates and misleading findings. This limitation highlights the need for a more robust statistical approach that effectively incorporates spatial correlations. To this end, we propose a Generalized Estimating Equations (GEE) framework as a robust solution for differential gene expression analysis in ST. We conducted a comprehensive comparison of the GEE-based tests with existing methods, including the Wilcoxon rank-sum test and z-test. By appropriately accounting for spatial correlations, extensive simulations showed that the GEE test with robust standard error, referred to as the Independent GEE, demonstrated superior Type I error control and comparable power relative to other methods. Applications to ST datasets from breast and prostate cancer showed poor calibration of the p-values and potential false positive findings from the Wilcoxon rank-sum test. Our comparative study based on simulations and real data applications suggests that the Independent GEE test is well-suited for ST data, offering more accurate identification of biologically relevant gene expression changes and complementing the Wilcoxon rank-sum test. We have implemented the proposed method in R package "SpatialGEE", available on GitHub.
Keywords
Humans, Gene Expression Profiling, Transcriptome, Breast Neoplasms, Prostatic Neoplasms, Computational Biology, Male, Female, Computer Simulation, Algorithms, Models, Statistical
Published Open-Access
yes
Recommended Citation
Wang, Yishan; Zang, Chenxuan; Li, Ziyi; et al., "A Comparative Study of Statistical Methods for Identifying Differentially Expressed Genes in Spatial Transcriptomics" (2026). Faculty, Staff and Student Publications. 6098.
https://digitalcommons.library.tmc.edu/uthgsbs_docs/6098
Included in
Bioinformatics Commons, Biomedical Informatics Commons, Genetic Phenomena Commons, Medical Genetics Commons, Oncology Commons