Fixed-Point Reasoners: Stable and Adaptive Deep Looped Transformers
Abstract
Looped architectures provide an inductive bias toward learning step-by-step procedures for tasks that require compositional reasoning. The number of effective layers reached by looping determines the quality of the solution these models find. Like deep architectures, looped architectures are prone to a signal propagation problem induced by depth as the halting decision is postponed. In this paper, we address this signal propagation issue using pre-norm layers and residual scaling. Building on th...
Description / Details
Looped architectures provide an inductive bias toward learning step-by-step procedures for tasks that require compositional reasoning. The number of effective layers reached by looping determines the quality of the solution these models find. Like deep architectures, looped architectures are prone to a signal propagation problem induced by depth as the halting decision is postponed. In this paper, we address this signal propagation issue using pre-norm layers and residual scaling. Building on these architectural modifications, we propose FPRM, a Transformer-based Fixed-Point Reasoning Model that uses fixed-point convergence as an end-to-end halting mechanism in a looped architecture. We show that fixed-point halting allows FPRM to adapt its compute to task difficulty. FPRM is effective on common reasoning benchmarks, namely Sudoku, Maze, state-tracking, and ARC-AGI.
Source: arXiv:2606.18206v1 - http://arxiv.org/abs/2606.18206v1 PDF: https://arxiv.org/pdf/2606.18206v1 Original Link: http://arxiv.org/abs/2606.18206v1
Please sign in to join the discussion.
No comments yet. Be the first to share your thoughts!
Jun 17, 2026
Artificial Intelligence
AI
0