ExplorerBiomedical EngineeringEngineering
Research PaperResearchia:202607.02031

Group-invariant Coresets for Data-efficient Active Learning

L. C. Ayres

Abstract

Active learning reduces labeling cost by querying the most informative unlabeled samples, but standard coreset methods ignore known data symmetries and can waste budget on transformed versions of the same instance. We propose GRINCO, a group-invariant coreset framework that performs acquisition in the quotient space induced by a transformation group, so that selection operates on orbits rather than raw samples. The method uses either canonical representatives or learned orbit-separating invarian...

Submitted: July 2, 2026Subjects: Engineering; Biomedical Engineering

Description / Details

Active learning reduces labeling cost by querying the most informative unlabeled samples, but standard coreset methods ignore known data symmetries and can waste budget on transformed versions of the same instance. We propose GRINCO, a group-invariant coreset framework that performs acquisition in the quotient space induced by a transformation group, so that selection operates on orbits rather than raw samples. The method uses either canonical representatives or learned orbit-separating invariant embeddings to define practical quotient metrics, and combines quotient-space k-center selection with invariant training through an orbit-averaged loss. We further derive a generalization bound that relates excess orbit-averaged risk to quotient-space coverage, label uncertainty, and intra-orbit variability. Experiments on synthetic scale-invariant data and image benchmarks with rotation-induced redundancy show that GRINCO improves orbit coverage and achieves stronger label efficiency than conventional coreset baselines, especially when group-induced redundancy is substantial.


Source: arXiv:2607.01089v1 - http://arxiv.org/abs/2607.01089v1 PDF: https://arxiv.org/pdf/2607.01089v1 Original Link: http://arxiv.org/abs/2607.01089v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jul 2, 2026
Topic:
Biomedical Engineering
Area:
Engineering
Comments:
0
Bookmark
Group-invariant Coresets for Data-efficient Active Learning | Researchia