Back to Explorer
Research PaperResearchia:202601.21010[Genomics > Biology]

Efficient Imputation for Patch-based Missing Single-cell Data via Cluster-regularized Optimal Transport

Yuyu Liu

Abstract

Missing data in single-cell sequencing datasets poses significant challenges for extracting meaningful biological insights. However, existing imputation approaches, which often assume uniformity and data completeness, struggle to address cases with large patches of missing data. In this paper, we present CROT, an optimal transport-based imputation algorithm designed to handle patch-based missing data in tabular formats. Our approach effectively captures the underlying data structure in the presence of significant missingness. Notably, it achieves superior imputation accuracy while significantly reducing runtime, demonstrating its scalability and efficiency for large-scale datasets. This work introduces a robust solution for imputation in heterogeneous, high-dimensional datasets with structured data absence, addressing critical challenges in both biological and clinical data analysis. Our code is available at Anomalous Github.


Source: arXiv:2601.14653v1 - http://arxiv.org/abs/2601.14653v1 PDF: https://arxiv.org/pdf/2601.14653v1 Original Link: http://arxiv.org/abs/2601.14653v1

Submission:1/21/2026
Comments:0 comments
Subjects:Biology; Genomics
Original Source:
View Original PDF
arXiv: This paper is hosted on arXiv, an open-access repository
Was this helpful?

Discussion (0)

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!