ExplorerArtificial IntelligenceAI
Research PaperResearchia:202606.16068

Probing Low Frame Rate Degradation in Neural Audio Codecs

Alex Gichamba

Abstract

Low frame rates in neural audio codecs are attractive for autoregressive speech synthesis, where the generation cost scales linearly with the sequence length. Recent work has demonstrated that codecs can operate at 12.5 Hz and below, but the mechanisms underlying low frame rate degradation remain insufficiently understood. We investigate these mechanisms through a controlled frame rate ablation. We reproduce a quality cliff at 6.25 Hz reported in previous works and evaluate candidate explanation...

Submitted: June 16, 2026Subjects: AI; Artificial Intelligence

Description / Details

Low frame rates in neural audio codecs are attractive for autoregressive speech synthesis, where the generation cost scales linearly with the sequence length. Recent work has demonstrated that codecs can operate at 12.5 Hz and below, but the mechanisms underlying low frame rate degradation remain insufficiently understood. We investigate these mechanisms through a controlled frame rate ablation. We reproduce a quality cliff at 6.25 Hz reported in previous works and evaluate candidate explanations: phonemic collisions and codebook saturation, neither of which shows evidence of a fundamental barrier. The cliff is instead caused by suboptimal training configuration: fixed clip duration during training yields too few tokens at low frame rates, starving the decoder of inter-token context. Once corrected, WER degrades smoothly with phonemic load down to 3.1 Hz and 1.6 Hz, suggesting the inference-time efficiency gains of low frame rate codecs are more accessible than previously assumed.


Source: arXiv:2606.16969v1 - http://arxiv.org/abs/2606.16969v1 PDF: https://arxiv.org/pdf/2606.16969v1 Original Link: http://arxiv.org/abs/2606.16969v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jun 16, 2026
Topic:
Artificial Intelligence
Area:
AI
Comments:
0
Bookmark
Probing Low Frame Rate Degradation in Neural Audio Codecs | Researchia