ExplorerRoboticsRobotics
Research PaperResearchia:202606.16090

When Should a Robot Replan? Regret-Guided Update Scheduling in Time-Varying MDPs

Negin Musavi

Abstract

Robots operating in non-stationary environments must continually adapt their policies as the dynamics drift, but onboard energy and compute budgets cap how often a full state estimation and re-planning step can be performed. This raises a question: \emph{when}, along a horizon, should a robot spend its limited budget? We formulate this problem in time-varying Markov decision processes (TVMDPs) with a known bound on the rate of transition drift. We model execution as a \emph{skip-update} scheme i...

Submitted: June 16, 2026Subjects: Robotics; Robotics

Description / Details

Robots operating in non-stationary environments must continually adapt their policies as the dynamics drift, but onboard energy and compute budgets cap how often a full state estimation and re-planning step can be performed. This raises a question: \emph{when}, along a horizon, should a robot spend its limited budget? We formulate this problem in time-varying Markov decision processes (TVMDPs) with a known bound on the rate of transition drift. We model execution as a \emph{skip-update} scheme in which, at chosen update times, the agent estimates the transition kernel by maximum likelihood and computes a finite-horizon policy, and between updates reuses this policy under a propagated state estimate. We analyze the dynamic regret of this scheme and show how it grows during skip intervals in terms of the properties of the TVMDP and the skip lengths; the resulting bound answers the opening question via an online, regret-guided update rule that allocates the budget adaptively. We evaluate the rule in a simulated Mars-rover navigation task with time-varying slip dynamics and on a Crazyflie quadrotor in indoor obstacle fields. Adaptive allocation outperforms other budgeted baselines.


Source: arXiv:2606.16972v1 - http://arxiv.org/abs/2606.16972v1 PDF: https://arxiv.org/pdf/2606.16972v1 Original Link: http://arxiv.org/abs/2606.16972v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jun 16, 2026
Topic:
Robotics
Area:
Robotics
Comments:
0
Bookmark
When Should a Robot Replan? Regret-Guided Update Scheduling in Time-Varying MDPs | Researchia