Conservative Continuous-Time Treatment Optimization
Abstract
We develop a conservative continuous-time stochastic control framework for treatment optimization from irregularly sampled patient trajectories. The unknown patient dynamics are modeled as a controlled stochastic differential equation with treatment as a continuous-time control. Naive model-based optimization can exploit model errors and propose out-of-support controls, so optimizing the estimated dynamics may not optimize the true dynamics. To limit extrapolation, we add a consistent signature-based MMD regularizer on path space that penalizes treatment plans whose induced trajectory distribution deviates from observed trajectories. The resulting objective minimizes a computable upper bound on the true cost. Experiments on benchmark datasets show improved robustness and performance compared to non-conservative baselines.
Source: arXiv:2603.16789v1 - http://arxiv.org/abs/2603.16789v1 PDF: https://arxiv.org/pdf/2603.16789v1 Original Link: http://arxiv.org/abs/2603.16789v1