ExplorerData ScienceMachine Learning
Research PaperResearchia:202604.07021

Data Attribution in Adaptive Learning

Amit Kiran Rege

Abstract

Machine learning models increasingly generate their own training data -- online bandits, reinforcement learning, and post-training pipelines for language models are leading examples. In these adaptive settings, a single training observation both updates the learner and shifts the distribution of future data the learner will collect. Standard attribution methods, designed for static datasets, ignore this feedback. We formalize occurrence-level attribution for finite-horizon adaptive learning via ...

Submitted: April 7, 2026Subjects: Machine Learning; Data Science

Description / Details

Machine learning models increasingly generate their own training data -- online bandits, reinforcement learning, and post-training pipelines for language models are leading examples. In these adaptive settings, a single training observation both updates the learner and shifts the distribution of future data the learner will collect. Standard attribution methods, designed for static datasets, ignore this feedback. We formalize occurrence-level attribution for finite-horizon adaptive learning via a conditional interventional target, prove that replay-side information cannot recover it in general, and identify a structural class in which the target is identified from logged data.


Source: arXiv:2604.04892v1 - http://arxiv.org/abs/2604.04892v1 PDF: https://arxiv.org/pdf/2604.04892v1 Original Link: http://arxiv.org/abs/2604.04892v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Apr 7, 2026
Topic:
Data Science
Area:
Machine Learning
Comments:
0
Bookmark