Back to Explorer
Research PaperResearchia:202603.13012[Robotics > Robotics]

HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies

Amber Xie

Abstract

Mastering dexterous manipulation with multi-fingered hands has been a grand challenge in robotics for decades. Despite its potential, the difficulty of collecting high-quality data remains a primary bottleneck for high-precision tasks. While reinforcement learning and simulation-to-real-world transfer offer a promising alternative, the transferred policies often fail for tasks demanding millimeter-scale precision, such as bimanual piano playing. In this work, we introduce HandelBot, a framework that combines a simulation policy and rapid adaptation through a two-stage pipeline. Starting from a simulation-trained policy, we first apply a structured refinement stage to correct spatial alignments by adjusting lateral finger joints based on physical rollouts. Next, we use residual reinforcement learning to autonomously learn fine-grained corrective actions. Through extensive hardware experiments across five recognized songs, we demonstrate that HandelBot can successfully perform precise bimanual piano playing. Our system outperforms direct simulation deployment by a factor of 1.8x and requires only 30 minutes of physical interaction data.


Source: arXiv:2603.12243v1 - http://arxiv.org/abs/2603.12243v1 PDF: https://arxiv.org/pdf/2603.12243v1 Original Link: http://arxiv.org/abs/2603.12243v1

Submission:3/13/2026
Comments:0 comments
Subjects:Robotics; Robotics
Original Source:
View Original PDF
arXiv: This paper is hosted on arXiv, an open-access repository
Was this helpful?

Discussion (0)

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies | Researchia