Explorerβ€ΊChemistryβ€ΊChemistry
Research PaperResearchia:202603.05037

Information Routing in Atomistic Foundation Models: How Equivariance Creates Linearly Disentangled Representations

Joshua Steier

Abstract

What do atomistic foundation models encode in their intermediate representations, and how is that information organized? We introduce Composition Projection Decomposition (CPD), which uses QR projection to linearly remove composition signal from learned representations and probes the geometric residual. Across eight models from five architectural families on QM9 molecules and Materials Project crystals, we find a disentanglement gradient: tensor product equivariant architectures (MACE) produce r...

Submitted: March 5, 2026Subjects: Chemistry; Chemistry

Description / Details

What do atomistic foundation models encode in their intermediate representations, and how is that information organized? We introduce Composition Projection Decomposition (CPD), which uses QR projection to linearly remove composition signal from learned representations and probes the geometric residual. Across eight models from five architectural families on QM9 molecules and Materials Project crystals, we find a disentanglement gradient: tensor product equivariant architectures (MACE) produce representations where geometry is almost fully linearly accessible after composition removal (Rgeom2=0.782R^2_{\text{geom}} = 0.782 for HOMO-LUMO gap), while handcrafted descriptors (ANI-2x) entangle the same information nonlinearly (Rgeom2=βˆ’0.792R^2_{\text{geom}} = -0.792 under Ridge; R2=+0.784R^2 = +0.784 under MLP). MACE routes target-specific signal through irreducible representation channels -- dipole to L=1L = 1, HOMO-LUMO gap to L=0L = 0 -- a pattern not observed in ViSNet's vector-scalar architecture under the same probe. We show that gradient boosted tree probes on projected residuals are systematically inflated, recovering R2=0.68R^2 = 0.68--0.950.95 on a purely compositional target, and recommend linear probes as the primary metric. Linearly disentangled representations are more sample-efficient under linear probing, suggesting a practical advantage for equivariant architectures beyond raw prediction accuracy.


Source: arXiv:2603.03155v1 - http://arxiv.org/abs/2603.03155v1 PDF: https://arxiv.org/pdf/2603.03155v1 Original Link: http://arxiv.org/abs/2603.03155v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Mar 5, 2026
Topic:
Chemistry
Area:
Chemistry
Comments:
0
Bookmark
Information Routing in Atomistic Foundation Models: How Equivariance Creates Linearly Disentangled Representations | Researchia