ExplorerRoboticsRobotics
Research PaperResearchia:202603.11010

TiPToP: A Modular Open-Vocabulary Planning System for Robotic Manipulation

William Shen

Abstract

We present TiPToP, an extensible modular system that combines pretrained vision foundation models with an existing Task and Motion Planner (TAMP) to solve multi-step manipulation tasks directly from input RGB images and natural-language instructions. Our system aims to be simple and easy-to-use: it can be installed and run on a standard DROID setup in under one hour and adapted to new embodiments with minimal effort. We evaluate TiPToP -- which requires zero robot data -- over 28 tabletop manipu...

Submitted: March 11, 2026Subjects: Robotics; Robotics

Description / Details

We present TiPToP, an extensible modular system that combines pretrained vision foundation models with an existing Task and Motion Planner (TAMP) to solve multi-step manipulation tasks directly from input RGB images and natural-language instructions. Our system aims to be simple and easy-to-use: it can be installed and run on a standard DROID setup in under one hour and adapted to new embodiments with minimal effort. We evaluate TiPToP -- which requires zero robot data -- over 28 tabletop manipulation tasks in simulation and the real world and find it matches or outperforms π0.5-DROIDπ_{0.5}\text{-DROID}, a vision-language-action (VLA) model fine-tuned on 350 hours of embodiment-specific demonstrations. TiPToP's modular architecture enables us to analyze the system's failure modes at the component level. We analyze results from an evaluation of 173 trials and identify directions for improvement. We release TiPToP open-source to further research on modular manipulation systems and tighter integration between learning and planning. Project website and code: https://tiptop-robot.github.io


Source: arXiv:2603.09971v1 - http://arxiv.org/abs/2603.09971v1 PDF: https://arxiv.org/pdf/2603.09971v1 Original Link: http://arxiv.org/abs/2603.09971v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Mar 11, 2026
Topic:
Robotics
Area:
Robotics
Comments:
0
Bookmark
TiPToP: A Modular Open-Vocabulary Planning System for Robotic Manipulation | Researchia