ExplorerBiotechnologyBiology
Research PaperResearchia:202606.11016

A mathematical framework for centromere-aware evaluation of human genome assemblies

Luca Franco

Abstract

Accurate evaluation of genome assemblies within highly repetitive regions, such as centromeres, remains a major open challenge in genomics. Conventional benchmarking relies on sequence alignment, which becomes problematic in regions of high homogeneity and divergence. Here, we framed centromere assembly evaluation as a comparative distribution problem in a compact centeny representation by computing genomic distances between functional motifs, rather than relying on nucleotide sequence. Our dist...

Submitted: June 11, 2026Subjects: Biology; Biotechnology

Description / Details

Accurate evaluation of genome assemblies within highly repetitive regions, such as centromeres, remains a major open challenge in genomics. Conventional benchmarking relies on sequence alignment, which becomes problematic in regions of high homogeneity and divergence. Here, we framed centromere assembly evaluation as a comparative distribution problem in a compact centeny representation by computing genomic distances between functional motifs, rather than relying on nucleotide sequence. Our distribution-based metric assesses agreement between a query and a target chromosome by comparing their centromeric inter-motif distances rendered by KL divergence. When applied genome-wide to currently available human telomere-to-telomere (T2T) genomes, this approach yields an accuracy ranking for the entire assembly and for each individual chromosome. Altogether, we present a rapid and robust scoring system based on genomes numerical rendering of inter-motif distances, that provides a quantitative standard of assembly integrity in repetitive DNA regions and establishes a bona fide framework for chromosome-level genome-to-genome comparison.


Source: arXiv:2606.11276v1 - http://arxiv.org/abs/2606.11276v1 PDF: https://arxiv.org/pdf/2606.11276v1 Original Link: http://arxiv.org/abs/2606.11276v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jun 11, 2026
Topic:
Biotechnology
Area:
Biology
Comments:
0
Bookmark
A mathematical framework for centromere-aware evaluation of human genome assemblies | Researchia