Publication Date
8-1-2025
Journal
Annals of the Rheumatic Diseases
DOI
10.1016/j.ard.2025.05.009
PMID
40514331
PMCID
PMC12715722
PubMedCentral® Posted Date
12-20-2025
PubMedCentral® Full Text Version
Author MSS
Abstract
Objectives: Systemic sclerosis (SSc) is a complex autoimmune disease with both known and unidentified genetic contributors. While genome-wide association studies (GWAS) have implicated multiple loci, many reside in noncoding regions. We aimed to identify novel protein-coding variants and pathogenic pathways using exome sequencing (ES) integrated with an Evolutionary Action-Machine Learning (EAML) framework, single-cell RNA sequencing (scRNA-seq), and expression quantitative trait locus (eQTL) analysis.
Methods: GWAS was conducted in 2,559 SSc cases and 893 controls of Caucasian ancestry, with replication in 9,846 cases and 18,333 controls of European ancestry. EAML prioritized genes with high-impact missense variants predictive of disease. Public scRNA-seq data from SSc and control skin biopsies were analyzed to localize gene expression across cell types. Whole blood eQTL data were used to identify regulatory effects of risk variants.
Results: A novel SSc risk locus at MICB (rs2516497, P = 3.66 × 10-13) was identified and replicated. EAML highlighted 284 genes enriched in interferon signaling. scRNA-seq localized MICB and NOTCH4 to fibroblasts and endothelial cells, while HLA class II genes were enriched in macrophages and fibroblasts. eQTL analysis confirmed regulatory effects at MICB, NOTCH4, and other prioritized genes, linking SSc-associated variants to transcriptional dysregulation.
Conclusions: This integrative genomic study identifies novel risk loci and mechanistic pathways in SSc, highlighting MICB, NOTCH4, and interferon-related genes. The findings provide insight into the cellular and regulatory architecture of SSc and support the utility of combining ES, machine learning, scRNA-seq, and eQTL data in complex disease genetics.
Keywords
Humans, Genome-Wide Association Study, Quantitative Trait Loci, Scleroderma, Systemic, Machine Learning, Exome Sequencing, Genetic Predisposition to Disease, Interferons, Male, Female, Case-Control Studies, Signal Transduction, Polymorphism, Single Nucleotide, White People, Middle Aged
Published Open-Access
yes
Recommended Citation
Ketkar, Shamika; Dai, Hongzheng; Burrage, Lindsay; et al., "Integrative Exome Sequencing and Machine Learning Identify Micb and Interferon Pathway Genes as Contributors to SSc Risk" (2025). Faculty, Staff and Students Publications. 6251.
https://digitalcommons.library.tmc.edu/baylor_docs/6251