Back to Explorer
Research PaperResearchia:202602.10047[Data Science > Machine Learning]

ShapeCond: Fast Shapelet-Guided Dataset Condensation for Time Series Classification

Sijia Peng

Abstract

Time series data supports many domains (e.g., finance and climate science), but its rapid growth strains storage and computation. Dataset condensation can alleviate this by synthesizing a compact training set that preserves key information. Yet most condensation methods are image-centric and often fail on time series because they miss time-series-specific temporal structure, especially local discriminative motifs such as shapelets. In this work, we propose ShapeCond, a novel and efficient condensation framework for time series classification that leverages shapelet-based dataset knowledge via a shapelet-guided optimization strategy. Our shapelet-assisted synthesis cost is independent of sequence length: longer series yield larger speedups in synthesis (e.g., 29×\times faster over prior state-of-the-art method CondTSC for time-series condensation, and up to 10,000×\times over naively using shapelets on the Sleep dataset with 3,000 timesteps). By explicitly preserving critical local patterns, ShapeCond improves downstream accuracy and consistently outperforms all prior state-of-the-art time series dataset condensation methods across extensive experiments. Code is available at https://github.com/lunaaa95/ShapeCond.


Source: arXiv:2602.09008v1 - http://arxiv.org/abs/2602.09008v1 PDF: https://arxiv.org/pdf/2602.09008v1 Original Link: http://arxiv.org/abs/2602.09008v1

Submission:2/10/2026
Comments:0 comments
Subjects:Machine Learning; Data Science
Original Source:
View Original PDF
arXiv: This paper is hosted on arXiv, an open-access repository
Was this helpful?

Discussion (0)

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

ShapeCond: Fast Shapelet-Guided Dataset Condensation for Time Series Classification | Researchia | Researchia