ExplorerBiotechnologyBiology
Research PaperResearchia:202606.09019

Integrating gene regulatory priors into Transformer attention with scTransformer for interpretable scRNA-seq analysis

Mikele Milia

Abstract

Motivation: Transformer-based models are increasingly applied to large-scale single-cell transcriptomics, showing strong performance through self-supervised learning on millions of cells. However, most existing approaches treat genes as independent features, and largely ignore prior biological knowledge, which limits interpretability and robustness. In this paper, we explore whether explicitly incorporating gene regulatory information can improve both model performance and biological insight. Re...

Submitted: June 9, 2026Subjects: Biology; Biotechnology

Description / Details

Motivation: Transformer-based models are increasingly applied to large-scale single-cell transcriptomics, showing strong performance through self-supervised learning on millions of cells. However, most existing approaches treat genes as independent features, and largely ignore prior biological knowledge, which limits interpretability and robustness. In this paper, we explore whether explicitly incorporating gene regulatory information can improve both model performance and biological insight. Results: We present scTransformer, the first Transformer-based approach that builds a priori knowledge of biological mechanisms into the model's attention patterns. By constraining information flow according to known regulatory structures, the model learns representations that are more biologically meaningful. We evaluate scTransformer on a disease-relevant single-nucleus RNA-seq dataset using supervised cell-type classification. Compared to standard Transformers, our approach improves classification accuracy, enhances separation of cell types in embedding space, and produces attention patterns consistent with known regulatory programs. Overall, our results demonstrate that embedding biological structure into Transformer models can enhance interpretability without sacrificing performance, offering a principled step toward biologically grounded foundation models for single-cell omics.


Source: arXiv:2606.09558v1 - http://arxiv.org/abs/2606.09558v1 PDF: https://arxiv.org/pdf/2606.09558v1 Original Link: http://arxiv.org/abs/2606.09558v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jun 9, 2026
Topic:
Biotechnology
Area:
Biology
Comments:
0
Bookmark
Integrating gene regulatory priors into Transformer attention with scTransformer for interpretable scRNA-seq analysis | Researchia