ExplorerComputer VisionComputer Vision
Research PaperResearchia:202602.25006

tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction

Chen Wang

Abstract

We propose tttLRM, a novel large 3D reconstruction model that leverages a Test-Time Training (TTT) layer to enable long-context, autoregressive 3D reconstruction with linear computational complexity, further scaling the model's capability. Our framework efficiently compresses multiple image observations into the fast weights of the TTT layer, forming an implicit 3D representation in the latent space that can be decoded into various explicit formats, such as Gaussian Splats (GS) for downstream ap...

Submitted: February 25, 2026Subjects: Computer Vision; Computer Vision

Description / Details

We propose tttLRM, a novel large 3D reconstruction model that leverages a Test-Time Training (TTT) layer to enable long-context, autoregressive 3D reconstruction with linear computational complexity, further scaling the model's capability. Our framework efficiently compresses multiple image observations into the fast weights of the TTT layer, forming an implicit 3D representation in the latent space that can be decoded into various explicit formats, such as Gaussian Splats (GS) for downstream applications. The online learning variant of our model supports progressive 3D reconstruction and refinement from streaming observations. We demonstrate that pretraining on novel view synthesis tasks effectively transfers to explicit 3D modeling, resulting in improved reconstruction quality and faster convergence. Extensive experiments show that our method achieves superior performance in feedforward 3D Gaussian reconstruction compared to state-of-the-art approaches on both objects and scenes.


Source: arXiv:2602.20160v1 - http://arxiv.org/abs/2602.20160v1 PDF: https://arxiv.org/pdf/2602.20160v1 Original Link: http://arxiv.org/abs/2602.20160v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Feb 25, 2026
Topic:
Computer Vision
Area:
Computer Vision
Comments:
0
Bookmark