ExplorerMathematicsMathematics
Research PaperResearchia:202605.11032

Penalty-Based First-Order Methods for Bilevel Optimization with Minimax and Constrained Lower-Level Problems

Yiyang Shen

Abstract

We study a class of bilevel optimization problems in which both the upper- and lower-level problems have minimax structures. This setting captures a broad range of emerging applications. Despite the extensive literature on bilevel optimization and minimax optimization separately, existing methods mainly focus on bilevel optimization with lower-level minimization problems, often under strong convexity assumptions, and are not directly applicable to the minimax lower-level setting considered here....

Submitted: May 11, 2026Subjects: Mathematics; Mathematics

Description / Details

We study a class of bilevel optimization problems in which both the upper- and lower-level problems have minimax structures. This setting captures a broad range of emerging applications. Despite the extensive literature on bilevel optimization and minimax optimization separately, existing methods mainly focus on bilevel optimization with lower-level minimization problems, often under strong convexity assumptions, and are not directly applicable to the minimax lower-level setting considered here. To address this gap, we develop penalty-based first-order methods for bilevel minimax optimization without requiring strong convexity of the lower-level problem. In the deterministic setting, we establish that the proposed method finds an εε-KKT point with O~(ε4)\tilde{O}(ε^{-4}) oracle complexity. We further show that bilevel problems with convex constrained lower-level minimization can be reformulated as special cases of our framework via Lagrangian duality, leading to an O~(ε4)\tilde{O}(ε^{-4}) complexity bound that improves upon the existing O~(ε7)\tilde{O}(ε^{-7}) result. Finally, we extend our approach to the stochastic setting, where only stochastic gradient oracles are available, and prove that the proposed stochastic method finds a nearly εε-KKT point with O~(ε9)\tilde{O}(ε^{-9}) oracle complexity.


Source: arXiv:2605.08006v1 - http://arxiv.org/abs/2605.08006v1 PDF: https://arxiv.org/pdf/2605.08006v1 Original Link: http://arxiv.org/abs/2605.08006v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
May 11, 2026
Topic:
Mathematics
Area:
Mathematics
Comments:
0
Bookmark