Back to Explorer
Research PaperResearchia:202601.27009[Biomolecules > Biochemistry]

PCEvo: Path-Consistent Molecular Representation via Virtual Evolutionary

Kun Li

Abstract

Molecular representation learning aims to learn vector embeddings that capture molecular structure and geometry, thereby enabling property prediction and downstream scientific applications. In many AI for science tasks, labeled data are expensive to obtain and therefore limited in availability. Under the few-shot setting, models trained with scarce supervision often learn brittle structure-property relationships, resulting in substantially higher prediction errors and reduced generalization to unseen molecules. To address this limitation, we propose PCEvo, a path-consistent representation method that learns from virtual paths through dynamic structural evolution. PCEvo enumerates multiple chemically feasible edit paths between retrieved similar molecular pairs under topological dependency constraints. It transforms the labels of the two molecules into stepwise supervision along each virtual evolutionary path. It introduces a path-consistency objective that enforces prediction invariance across alternative paths connecting the same two molecules. Comprehensive experiments on the QM9 and MoleculeNet datasets demonstrate that PCEvo substantially improves the few-shot generalization performance of baseline methods. The code is available at https://anonymous.4open.science/r/PCEvo-4BF2.


Source: arXiv:2601.19257v1 - http://arxiv.org/abs/2601.19257v1 PDF: https://arxiv.org/pdf/2601.19257v1 Original Link: http://arxiv.org/abs/2601.19257v1

Submission:1/27/2026
Comments:0 comments
Subjects:Biochemistry; Biomolecules
Original Source:
View Original PDF
arXiv: This paper is hosted on arXiv, an open-access repository
Was this helpful?

Discussion (0)

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!