

Graphical Abstract — Cell
Spaceflight imposes substantial physiological stress on reproductive systems, yet relevant transcriptomic data remain fragmented across repositories. To address this need, we developed ASTROREPOMICS, a web-based platform that integrates 17 rigorously normalized and batch-corrected transcriptomic datasets spanning multiple species and reproductive tissues. The platform supports reproducible cross-study and cross-species analyses through standardized metadata and an intuitive user interface.
We highlight its utility through two example analyses: (1) irradiated mouse sperm exhibited suppression of RNA splicing and protein-processing pathways alongside activation of interferon- and GPCR-associated programs; and (2) a multi-species intersected-DEG assessment between irradiated rat mammary tissue and microgravity-exposed zebrafish embryos uncovered conserved signatures involving RNA metabolism, cytokine signaling, and angiogenesis.
By consolidating dispersed datasets and offering tailored analytical capabilities, ASTROREPOMICS provides a centralized resource for hypothesis generation and strengthens the research infrastructure needed to advance reproductive health studies in space, supporting long-term efforts to safeguard fertility during deep-space exploration.

The ASTROREPOMICS workflow — Cell
(A) Transcriptomic datasets were collected from multiple public and institutional repositories, including NASA’s GeneLab, the Gene Expression Omnibus (GEO), the Integrated Biobank for Space Life Science (ibSLS), and the Spermatogenesis-Related Multi-Omics Database (SperMD).
(B) Candidate datasets were retrieved using reproduction-related keywords (e.g., sperm, oocyte, zygote, embryo, and gonadal tissue) and spaceflight-relevant conditions (e.g., microgravity and radiation).
(C) Studies were included only if they were both reproduction- and space-relevant, with metadata completeness confirmed through manual review. Curated datasets were organized within a MySQL-based relational database containing normalized gene expression values and structured sample metadata, including species, tissue or cell type, and exposure condition.
(D) The interface enables user-level access for data filtering, batch correction, and cross-study integration using methods such as per-dataset Z score normalization or ComBat, with principal-component analysis (PCA) provided for assessing and visualizing batch-effect removal. Users can then perform differential gene expression analysis with customizable pseudocounts and statistical tests, followed by downstream visualization. Outputs include cross-species DEG comparison plots, volcano plots, heatmaps, pathway enrichment plots, and pathway-to-gene network diagrams derived from gene set enrichment analysis (GSEA) and over-representation analysis (ORA).
Astrobiology, Genomics, bioinformatics,






