Back to Explorer
Research PaperResearchia:202602.05028[Biotechnology > Biochemistry]

Mechanisms of AI Protein Folding in ESMFold

Kevin Lu

Abstract

How do protein structure prediction models fold proteins? We investigate this question by tracing how ESMFold folds a beta hairpin, a prevalent structural motif. Through counterfactual interventions on model latents, we identify two computational stages in the folding trunk. In the first stage, early blocks initialize pairwise biochemical signals: residue identities and associated biochemical features such as charge flow from sequence representations into pairwise representations. In the second stage, late blocks develop pairwise spatial features: distance and contact information accumulate in the pairwise representation. We demonstrate that the mechanisms underlying structural decisions of ESMFold can be localized, traced through interpretable representations, and manipulated with strong causal effects.


Source: arXiv:2602.06020v1 - http://arxiv.org/abs/2602.06020v1 PDF: https://arxiv.org/pdf/2602.06020v1 Original Article: View on arXiv

Submission:2/5/2026
Comments:0 comments
Subjects:Biochemistry; Biotechnology
Original Source:
View Original PDF
arXiv: This paper is hosted on arXiv, an open-access repository
Was this helpful?

Discussion (0)

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!