ExplorerMathematicsMathematics
Research PaperResearchia:202605.12027

Switching-Geometry Analysis of Deflated Q-Value Iteration

Donghwan Lee

Abstract

This paper develops a joint spectral radius (JSR) framework for analyzing rank-one deflated Q-value iteration (Q-VI) in discounted Markov decision process control. Focusing on an all-ones residual correction, we interpret the resulting algorithm through the geometry of switching systems and, to the best of our knowledge, give the first JSR-based convergence analysis of deflated Q-VI for policy optimization problems. Our analysis reveals that the standard Q-VI switching system model has JSR exact...

Submitted: May 12, 2026Subjects: Mathematics; Mathematics

Description / Details

This paper develops a joint spectral radius (JSR) framework for analyzing rank-one deflated Q-value iteration (Q-VI) in discounted Markov decision process control. Focusing on an all-ones residual correction, we interpret the resulting algorithm through the geometry of switching systems and, to the best of our knowledge, give the first JSR-based convergence analysis of deflated Q-VI for policy optimization problems. Our analysis reveals that the standard Q-VI switching system model has JSR exactly the discount factor γ(0,1)γ\in (0,1), since all admissible subsystems share the all-ones vector as an invariant direction. By passing to the quotient space that removes this direction, we obtain a projected switching system model whose JSR governs the relevant error dynamics and may be strictly smaller than γγ. Therefore, the deflated Q-VI admits a potentially sharper convergence-rate characterization than the ambient-space γγ-bound. Finally, we prove that the correction is equivalent to a scalar recentering of standard Q-VI. Hence, the projected trajectory, and therefore the greedy-policy sequence, is unchanged relative to standard Q-VI initialized from the same point. The benefit of deflation is not a change in the induced decision-making problem, but a more precise JSR-based description of the convergence geometry after the redundant all-ones component is removed.


Source: arXiv:2605.10811v1 - http://arxiv.org/abs/2605.10811v1 PDF: https://arxiv.org/pdf/2605.10811v1 Original Link: http://arxiv.org/abs/2605.10811v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
May 12, 2026
Topic:
Mathematics
Area:
Mathematics
Comments:
0
Bookmark