Research Article: Integrated transcriptome and single-cell RNA sequencing identifies small GTPase-associated biomarkers in ulcerative colitis
Abstract:
Ulcerative colitis (UC) is a chronic, multifactorial inflammatory bowel disease. The involvement of small GTPase-related genes (SGRGs) in UC remains poorly defined. This study aims to identify SGRGs associated with UC and elucidate the molecular mechanisms by which they regulate the pathological progression of UC.
Multiple transcriptomic datasets were integrated to identify candidate SGRGs in UC. Four machine learning algorithms were utilized for biomarker screening, followed by the construction of a diagnostic nomogram. The robustness of the identified biomarkers and the nomogram was rigorously evaluated through external validation in multiple independent cohorts. Furthermore, the associations between biomarkers and clinical severity, as well as their capacity for differential diagnosis (UC vs. Crohn’s disease (CD)), were assessed. Functional enrichment, immune infiltration, and cross-dataset single-cell RNA sequencing (scRNA-seq) were employed for in-depth mechanistic investigation. In vivo experiments validated core gene expression.
Three biomarkers ( ARHGEF3 , S100A8 , and RHOU ) were successfully identified and validated across multiple independent cohorts. The nomogram constructed based on these biomarkers exhibited excellent diagnostic performance (AUC = 0.991 in the training set, 0.938 and 0.968 in the external validation cohort). Notably, the expression levels of these biomarkers significantly correlated with clinical severity and pathological mucosal states, demonstrating their capacity to reflect disease activity and monitor progression. Furthermore, among the identified biomarkers, ARHGEF3 significantly differentiated UC from CD (p < 0.05), while all three markers exhibited discriminative potential with AUC values exceeding 0.6. Functional annotation illustrated that they were commonly enriched within pathways encompassing “Tight junction” and “Leukocyte transendothelial migration”. Immune infiltration assessment demonstrated 27 differentially expressed immune cells (DEICs) among the UC and control groups. Cross-dataset scRNA-seq analysis further confirmed that macrophages were the key cells, exhibiting significant metabolic reprogramming and functional heterogeneity in UC. In vivo experiments confirmed the expression of above biomarkers in UC.
This study systematically identifies ARHGEF3 , S100A8 , and RHOU as novel UC diagnostic biomarkers, implicating them in disease progression via immune regulation and macrophage function. These findings provide a new basis for precise diagnosis and targeted therapy.
Introduction:
Ulcerative colitis (UC) is a chronic, multifactorial inflammatory bowel disease. The involvement of small GTPase-related genes (SGRGs) in UC remains poorly defined. This study aims to identify SGRGs associated with UC and elucidate the molecular mechanisms by which they regulate the pathological progression of UC.
Read more