Online Convex Optimization with Sublinear Noisy Probes
Abstract
We study Online Convex Optimization (OCO) over a convex set $K\subseteq \mathbb R^d$, where in each round $t$ the learner selects $x_t\in K$ and then observes a convex loss $f_t:K\to[0,1]$, with the goal of minimizing regret to the best fixed decision in hindsight. We introduce a unified probing model that generalizes two recent lines of work: sublinear best-expert queries in the experts setting, and pairwise (comparison-based) feedback available every round in OCO. In our framework, the learner...
Description / Details
We study Online Convex Optimization (OCO) over a convex set , where in each round the learner selects and then observes a convex loss , with the goal of minimizing regret to the best fixed decision in hindsight. We introduce a unified probing model that generalizes two recent lines of work: sublinear best-expert queries in the experts setting, and pairwise (comparison-based) feedback available every round in OCO. In our framework, the learner has a budget of pairwise probes; on a probed round it may query two points and learn which one has smaller loss. Our main result shows that even a sublinear and noisy probe budget can provably improve worst-case regret in the full feedback OCO regime. With -noisy pairwise probes, we obtain: , which is tight (up to logarithmic factors in ) across , and . Specifically regarding the noise parameter , the regret guarantee smoothly degrades as the oracle response approaches a coin flip, i.e., is close to . When applying the same techniques to a finite for the prediction with experts setting, the resulting rates are instead completely tight in all parameters, including . Our analysis gives a streamlined treatment of pairwise probing in OCO by quantifying the benefit of probing via a variance reduction effect, combined with a second-order (variance-based) analysis of Continuous Exponential Weights.
Source: arXiv:2606.14640v1 - http://arxiv.org/abs/2606.14640v1 PDF: https://arxiv.org/pdf/2606.14640v1 Original Link: http://arxiv.org/abs/2606.14640v1
Please sign in to join the discussion.
No comments yet. Be the first to share your thoughts!
Jun 15, 2026
Data Science
Machine Learning
0