ExplorerRoboticsRobotics
Research PaperResearchia:202606.15099

AERMANI-PLACE: Language Guided Object Placement with Aerial Manipulators

Sarthak Mishra

Abstract

Object placement is a fundamental component of aerial manipulation tasks, yet existing systems typically require the desired placement position to be specified explicitly in metric coordinates. Such interfaces are not intuitive and require users to reason about coordinate frames and scene geometry, making them difficult to use in practical deployments. In contrast, humans often communicate spatial goals through a combination of language and pointing gestures. Inspired by this observation, we pre...

Submitted: June 15, 2026Subjects: Robotics; Robotics

Description / Details

Object placement is a fundamental component of aerial manipulation tasks, yet existing systems typically require the desired placement position to be specified explicitly in metric coordinates. Such interfaces are not intuitive and require users to reason about coordinate frames and scene geometry, making them difficult to use in practical deployments. In contrast, humans often communicate spatial goals through a combination of language and pointing gestures. Inspired by this observation, we present AERMANI-PLACE, a framework for language-guided object placement with aerial manipulators. Given a scene image and a natural language instruction, an image editing model generates a modified version of the scene containing a visual marker that indicates where the object should be placed. This marker is then grounded into the physical environment using depth observations to recover a metric place point, after which a placement trajectory is generated and executed by the aerial manipulator. We evaluate the proposed approach on a test set of 100 language-guided placement tasks and demonstrate successful execution on a real aerial manipulation platform. Experimental results show that the proposed method reliably infers placement locations from language instructions with an average success rate of 87% on the test-set and transfers effectively to real-world aerial manipulation with an average success rate of 72%. Video: https://youtu.be/SgwwgLBsv0g


Source: arXiv:2606.14531v1 - http://arxiv.org/abs/2606.14531v1 PDF: https://arxiv.org/pdf/2606.14531v1 Original Link: http://arxiv.org/abs/2606.14531v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jun 15, 2026
Topic:
Robotics
Area:
Robotics
Comments:
0
Bookmark