ExplorerData ScienceMachine Learning
Research PaperResearchia:202606.15070

HumP-KD: A Hybrid Uncertainty-Aware Multi-Stage Progressive Knowledge Distillation Framework for Efficient Fire Classification

Mohammed Arif Mainuddin

Abstract

Real-time fire classification systems require models that are simultaneously accurate, computationally efficient, and deployable on resource-constrained hardware. This work proposes \textbf{HumP-KD}, a Hybrid Uncertainty-aware Multi-stage Progressive Knowledge Distillation framework for efficient fire classification. Two datasets, FlameVision and Dataset-II, containing 8,600 and 31,309 images, are used. Various CNN and transformer baselines are applied under standard preprocessing, online augmen...

Submitted: June 15, 2026Subjects: Machine Learning; Data Science

Description / Details

Real-time fire classification systems require models that are simultaneously accurate, computationally efficient, and deployable on resource-constrained hardware. This work proposes \textbf{HumP-KD}, a Hybrid Uncertainty-aware Multi-stage Progressive Knowledge Distillation framework for efficient fire classification. Two datasets, FlameVision and Dataset-II, containing 8,600 and 31,309 images, are used. Various CNN and transformer baselines are applied under standard preprocessing, online augmentation, Gaussian noise and motion blur robustness conditions. The proposed HumP-KD model distills knowledge from two frozen heterogeneous transformer teachers, Swin-Tiny and ViT-Base, along with their Meta-MLP ensemble, into a lightweight MobileViT-S student via three tightly integrated components. Hierarchical Progressive Knowledge Distillation employs a Hierarchical Feature Builder. It generates a fused spatial attention mask to guide distillation toward discriminative regions selectively. Multi-Stage Knowledge Distillation progressively activates three distillation stages across training. On Dataset-II, HumP-KD achieves a mean F1 score of 0.9876±0.00630.9876 \pm 0.0063 across 10 independent trials, significantly outperforming the MobileViT-S baseline trained without distillation (0.9537±0.03510.9537 \pm 0.0351), with statistical significance confirmed by both independent t-test (p=0.0195p = 0.0195) and Wilcoxon signed-rank test (W=1W = 1, p=0.0039p = 0.0039). The proposed method also demonstrates strong generalization across datasets and robustness under degraded visual conditions. The student model retains only 4.94M parameters and 19.01Mb model size, representing a 5.7×5.7\times parameter reduction over Swin-Tiny and a 17.5×17.5\times reduction over ViT-Base, while achieving 37.72 CPU FPS, making it suitable for real-time deployment.


Source: arXiv:2606.14684v1 - http://arxiv.org/abs/2606.14684v1 PDF: https://arxiv.org/pdf/2606.14684v1 Original Link: http://arxiv.org/abs/2606.14684v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jun 15, 2026
Topic:
Data Science
Area:
Machine Learning
Comments:
0
Bookmark
HumP-KD: A Hybrid Uncertainty-Aware Multi-Stage Progressive Knowledge Distillation Framework for Efficient Fire Classification | Researchia