ExplorerRoboticsRobotics
Research PaperResearchia:202604.18005

R3D: Revisiting 3D Policy Learning

Zhengdong Hong

Abstract

3D policy learning promises superior generalization and cross-embodiment transfer, but progress has been hindered by training instabilities and severe overfitting, precluding the adoption of powerful 3D perception models. In this work, we systematically diagnose these failures, identifying the omission of 3D data augmentation and the adverse effects of Batch Normalization as primary causes. We propose a new architecture coupling a scalable transformer-based 3D encoder with a diffusion decoder, e...

Submitted: April 18, 2026Subjects: Robotics; Robotics

Description / Details

3D policy learning promises superior generalization and cross-embodiment transfer, but progress has been hindered by training instabilities and severe overfitting, precluding the adoption of powerful 3D perception models. In this work, we systematically diagnose these failures, identifying the omission of 3D data augmentation and the adverse effects of Batch Normalization as primary causes. We propose a new architecture coupling a scalable transformer-based 3D encoder with a diffusion decoder, engineered specifically for stability at scale and designed to leverage large-scale pre-training. Our approach significantly outperforms state-of-the-art 3D baselines on challenging manipulation benchmarks, establishing a new and robust foundation for scalable 3D imitation learning. Project Page: https://r3d-policy.github.io/


Source: arXiv:2604.15281v1 - http://arxiv.org/abs/2604.15281v1 PDF: https://arxiv.org/pdf/2604.15281v1 Original Link: http://arxiv.org/abs/2604.15281v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Apr 18, 2026
Topic:
Robotics
Area:
Robotics
Comments:
0
Bookmark
R3D: Revisiting 3D Policy Learning | Researchia