Explorerβ€ΊData Scienceβ€ΊData Science
Research PaperResearchia:202601.0765f980

Online Learning with Limited Information in the Sliding Window Model

Vladimir Braverman

Abstract

Motivated by recent work on the experts problem in the streaming model, we consider the experts problem in the sliding window model. The sliding window model is a well-studied model that captures applications such as traffic monitoring, epidemic tracking, and automated trading, where recent information is more valuable than older data. Formally, we have $n$ experts, $T$ days, the ability to query the predictions of $q$ experts on each day, a limited amount of memory, and should achieve the (near...

Submitted: January 7, 2026Subjects: Data Science; Data Science

Description / Details

Motivated by recent work on the experts problem in the streaming model, we consider the experts problem in the sliding window model. The sliding window model is a well-studied model that captures applications such as traffic monitoring, epidemic tracking, and automated trading, where recent information is more valuable than older data. Formally, we have nn experts, TT days, the ability to query the predictions of qq experts on each day, a limited amount of memory, and should achieve the (near-)optimal regret nWpolylog(nT)\sqrt{nW}\text{polylog}(nT) regret over any window of the last WW days. While it is impossible to achieve such regret with 11 query, we show that with 22 queries we can achieve such regret and with only polylog(nT)\text{polylog}(nT) bits of memory. Not only are our algorithms optimal for sliding windows, but we also show for every interval I\mathcal{I} of days that we achieve n∣I∣polylog(nT)\sqrt{n|\mathcal{I}|}\text{polylog}(nT) regret with 22 queries and only polylog(nT)\text{polylog}(nT) bits of memory, providing an exponential improvement on the memory of previous interval regret algorithms. Building upon these techniques, we address the bandit problem in data streams, where q=1q=1, achieving nT2/3polylog(T)n T^{2/3}\text{polylog}(T) regret with polylog(nT)\text{polylog}(nT) memory, which is the first sublinear regret in the streaming model in the bandit setting with polylogarithmic memory; this can be further improved to the optimal O(nT)\mathcal{O}(\sqrt{nT}) regret if the best expert's losses are in a random order.

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jan 7, 2026
Topic:
Data Science
Area:
Data Science
Comments:
0
Bookmark