Explorerβ€ΊData Scienceβ€ΊStatistics
Research PaperResearchia:202607.01025

Signed-Permutation Coordinate Transport for RMSNorm Transformers

John Sweeney

Abstract

Modern LLM workflows move coordinate-indexed objects across checkpoints: steering vectors, sparse autoencoders, top-$k$ neuron sets, attribution lists, and merge alignments. This is only well posed after fixing the model's residual-stream gauge, which we show is architecture-dependent: LayerNorm residual charts have permutation gauge $S_d$ (up to a global sign flip), while RMSNorm charts with generic per-channel gain have signed-permutation gauge $B_d = S_d \ltimes \{\pm 1\}^d$. Permutation-only...

Submitted: July 1, 2026Subjects: Statistics; Data Science

Description / Details

Modern LLM workflows move coordinate-indexed objects across checkpoints: steering vectors, sparse autoencoders, top-kk neuron sets, attribution lists, and merge alignments. This is only well posed after fixing the model's residual-stream gauge, which we show is architecture-dependent: LayerNorm residual charts have permutation gauge SdS_d (up to a global sign flip), while RMSNorm charts with generic per-channel gain have signed-permutation gauge Bd=Sd⋉{Β±1}dB_d = S_d \ltimes \{\pm 1\}^d. Permutation-only alignment is therefore symmetry-incomplete for RMSNorm models. We introduce sign-marginalized Hungarian matching and prove a sharp failure mode: with decorrelated coordinates, raw signed-correlation matching has a structural permutation-accuracy ceiling at the positive-sign fraction of the true gauge, which sign-marginalization removes. We then make coordinate-preserving transport, not function-level merging, the primary object: composing saved-checkpoint local BdB_d gauges along same-base fine-tuning trajectories recovers 91.1% of cross-run coordinates at 1500 steps versus 60.3% for endpoint matching, and the gain is not explained by merely routing through the base. The recovered gauge transfers tools that permutation-only alignment breaks: TinyLlama SAE reconstruction has NMSE 0.004 under BdB_d versus 1.08 under SdS_d; Qwen sentiment steering preserves 95.8% of its effect versus 17.2%; refusal steering reverses sign under SdS_d; coordinate-preserving merges behave the same way. The same covariance governs stateful training: signed transport of AdamW state preserves the resumed trajectory, while permutation-only state follows a different one from a functionally identical checkpoint. Finally, gauge-sweep audits show index-level interpretability claims are reproducible only relative to an explicit gauge.


Source: arXiv:2606.31963v1 - http://arxiv.org/abs/2606.31963v1 PDF: https://arxiv.org/pdf/2606.31963v1 Original Link: http://arxiv.org/abs/2606.31963v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jul 1, 2026
Topic:
Data Science
Area:
Statistics
Comments:
0
Bookmark
Signed-Permutation Coordinate Transport for RMSNorm Transformers | Researchia