ExplorerBiotechnologyBiology
Research PaperResearchia:202605.20020

Informational blueprints reveal condition-dependent gene regulatory architectures

Doruk Efe Gökmen

Abstract

While coding regions in the genome have a direct interpretation in terms of protein products, significant fractions are non-coding and yet control essential biological functions. Unlike the genetic code, there is no "lookup table" that identifies where regulatory proteins, known as transcription factors (TFs), bind. Here, we extract these binding sites by distilling sequences of nucleotide letters into collective coordinates (hyperletters) representing the binding sites that are active under spe...

Submitted: May 20, 2026Subjects: Biology; Biotechnology

Description / Details

While coding regions in the genome have a direct interpretation in terms of protein products, significant fractions are non-coding and yet control essential biological functions. Unlike the genetic code, there is no "lookup table" that identifies where regulatory proteins, known as transcription factors (TFs), bind. Here, we extract these binding sites by distilling sequences of nucleotide letters into collective coordinates (hyperletters) representing the binding sites that are active under specific environmental conditions. Going beyond local information footprints between individual bases and expression levels, our information blueprint\textit{information blueprint} algorithm compresses the global information by optimising filters that simultaneously scan an entire promoter sequence. Inspired by renormalisation-group techniques, we identify TF binding sites as coarse-grained variables combining groups of correlated mutations with the highest collective impact on gene expression. We validate our approach on experimental data for E. coli\textit{E. coli} and discover novel regulatory elements illustrating its deployment at scale across growth conditions.


Source: arXiv:2605.19071v1 - http://arxiv.org/abs/2605.19071v1 PDF: https://arxiv.org/pdf/2605.19071v1 Original Link: http://arxiv.org/abs/2605.19071v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
May 20, 2026
Topic:
Biotechnology
Area:
Biology
Comments:
0
Bookmark