ExplorerArtificial IntelligenceAI
Research PaperResearchia:202606.17001

Visual Verification Enables Inference-time Steering and Autonomous Policy Improvement

Mingtong Zhang

Abstract

Robots deployed in the real world should learn from their experience and improve over time. This requires a mechanism of practicing and learning from feedback. In this paper, we propose VERITAS, a generator-verifier framework for generalist robot policies for inference-time policy steering and self-improvement. We use a pre-trained generalist robot policy as a generator'' and pair it with a gradient-free visual verifier'' that evaluates actions at inference time. This framework enables inference...

Submitted: June 17, 2026Subjects: AI; Artificial Intelligence

Description / Details

Robots deployed in the real world should learn from their experience and improve over time. This requires a mechanism of practicing and learning from feedback. In this paper, we propose VERITAS, a generator-verifier framework for generalist robot policies for inference-time policy steering and self-improvement. We use a pre-trained generalist robot policy as a generator'' and pair it with a gradient-free visual verifier'' that evaluates actions at inference time. This framework enables inference-time steering that improves policy performance without additional training. We demonstrate that inference-time verification consistently outperforms vanilla generalists without training on additional demonstration data. Additionally, we demonstrate that the verified rollouts provide effective supervision for offline policy improvement: policies fine-tuned on verified self-generated trajectories achieve consistent performance gains. Notably, we find that post-training with verified rollouts achieves comparable efficiency to expert demonstrations, while requiring no human interventions. Our results highlight inference-time verification as a practical and scalable mechanism for improving robotic policies during deployment.


Source: arXiv:2606.18247v1 - http://arxiv.org/abs/2606.18247v1 PDF: https://arxiv.org/pdf/2606.18247v1 Original Link: http://arxiv.org/abs/2606.18247v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jun 17, 2026
Topic:
Artificial Intelligence
Area:
AI
Comments:
0
Bookmark
Visual Verification Enables Inference-time Steering and Autonomous Policy Improvement | Researchia