Explorerβ€ΊRoboticsβ€ΊRobotics
Research PaperResearchia:202606.19011

The Token Is a Group Element: On Lie-Algebra Attention over Matrix Lie Groups

Przemyslaw Musialski

Abstract

We place the attention token on the group: a token is an element $g_i$ of a matrix Lie group $G$ -- a bare transformation, with no feature payload and no external action $ρ(g)$ carrying it. To our knowledge this is the first attention construction whose tokens are bare matrix Lie group elements: their score is the closed-form algebra norm of the relative pose rather than a learned kernel, and it reaches the affine full-frame groups that every irrep- or surjective-exp-based method must exclude. W...

Submitted: June 19, 2026Subjects: Robotics; Robotics

Description / Details

We place the attention token on the group: a token is an element gig_i of a matrix Lie group GG -- a bare transformation, with no feature payload and no external action ρ(g)ρ(g) carrying it. To our knowledge this is the first attention construction whose tokens are bare matrix Lie group elements: their score is the closed-form algebra norm of the relative pose rather than a learned kernel, and it reaches the affine full-frame groups that every irrep- or surjective-exp-based method must exclude. We call it Lie-Algebra Attention. Once tokens are group elements, the rest follows with none of the usual representation-theoretic machinery. The relative geometry of a pair is canonical, giβˆ’1gjg_i^{-1} g_j, so the pairwise invariant wij=log⁑(giβˆ’1gj)w_{ij} = \log(g_i^{-1} g_j) is intrinsic rather than designed; equivariance under the diagonal GG-action is tautological, and the cocycle condition holds automatically. The attention score is the negative squared algebra norm, sij=βˆ’βˆ₯log⁑(giβˆ’1gj)βˆ₯Ξ»2/Ο„s_{ij} = -\|\log(g_i^{-1} g_j)\|_Ξ»^2/Ο„: the canonical proximity kernel under a block-weighted Frobenius inner product, with no irreducible representations, spherical harmonics, Clebsch-Gordan products, or learned kernel. The construction applies to any matrix Lie group on a chosen logarithm chart containing the relative poses, including the non-compact non-abelian affine groups with scale and shear that no vector-token attention method reaches: neither the irrep tradition nor surjective-exp methods. Three sequence-completion experiments, on SE(2), SO(3), and Aff(2), bear this out: the closed-form score matches a learned MLP kernel on the same invariant and outperforms it on SE(2), using 50 to 80x fewer score parameters, while a vector-token baseline breaks invariance by five to twelve orders of magnitude.


Source: arXiv:2606.20547v1 - http://arxiv.org/abs/2606.20547v1 PDF: https://arxiv.org/pdf/2606.20547v1 Original Link: http://arxiv.org/abs/2606.20547v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jun 19, 2026
Topic:
Robotics
Area:
Robotics
Comments:
0
Bookmark