ExplorerArtificial IntelligenceAI
Research PaperResearchia:202606.15064

From Self-Supervised Speech Models to Mixture-of-Experts for Robust Anti-Spoofing

Hugo Daumain

Abstract

Recent advances in speech generation have significantly improved the naturalness of synthetic speech, making spoofing detection increasingly challenging. A key limitation of current anti-spoofing systems is their limited robustness to unseen synthesis methods. In this work, we transform a self-supervised speech representation model into a Mixture-of-Experts (MoE) architecture to improve generalization. Feed-forward blocks in selected encoder layers are replaced by multiple expert networks contro...

Submitted: June 15, 2026Subjects: AI; Artificial Intelligence

Description / Details

Recent advances in speech generation have significantly improved the naturalness of synthetic speech, making spoofing detection increasingly challenging. A key limitation of current anti-spoofing systems is their limited robustness to unseen synthesis methods. In this work, we transform a self-supervised speech representation model into a Mixture-of-Experts (MoE) architecture to improve generalization. Feed-forward blocks in selected encoder layers are replaced by multiple expert networks controlled by a layer-wise gating mechanism, allowing experts to capture complementary acoustic patterns while preserving the representations learned during self-supervised pretraining. We further analyze the architectural choices affecting the performance of this MoE conversion and investigate the activation behavior of the experts. The proposed approach is evaluated on 14 spoofing datasets and reduces the macro EER from 5.46% to 4.81%, corresponding to 11.9% relative improvement over the baseline.


Source: arXiv:2606.14639v1 - http://arxiv.org/abs/2606.14639v1 PDF: https://arxiv.org/pdf/2606.14639v1 Original Link: http://arxiv.org/abs/2606.14639v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jun 15, 2026
Topic:
Artificial Intelligence
Area:
AI
Comments:
0
Bookmark
From Self-Supervised Speech Models to Mixture-of-Experts for Robust Anti-Spoofing | Researchia