Back to Explorer
Research PaperResearchia:202602.27038[Data Science > Machine Learning]

Zeroth-Order Stackelberg Control in Combinatorial Congestion Games

Saeed Masiha

Abstract

We study Stackelberg (leader--follower) tuning of network parameters (tolls, capacities, incentives) in combinatorial congestion games, where selfish users choose discrete routes (or other combinatorial strategies) and settle at a congestion equilibrium. The leader minimizes a system-level objective (e.g., total travel time) evaluated at equilibrium, but this objective is typically nonsmooth because the set of used strategies can change abruptly. We propose ZO-Stackelberg, which couples a projection-free Frank--Wolfe equilibrium solver with a zeroth-order outer update, avoiding differentiation through equilibria. We prove convergence to generalized Goldstein stationary points of the true equilibrium objective, with explicit dependence on the equilibrium approximation error, and analyze subsampled oracles: if an exact minimizer is sampled with probability κmκ_m, then the Frank--Wolfe error decays as O(1/(κmT))\mathcal{O}(1/(κ_m T)). We also propose stratified sampling as a practical way to avoid a vanishing κmκ_m when the strategies that matter most for the Wardrop equilibrium concentrate in a few dominant combinatorial classes (e.g., short paths). Experiments on real-world networks demonstrate that our method achieves orders-of-magnitude speedups over a differentiation-based baseline while converging to follower equilibria.


Source: arXiv:2602.23277v1 - http://arxiv.org/abs/2602.23277v1 PDF: https://arxiv.org/pdf/2602.23277v1 Original Link: http://arxiv.org/abs/2602.23277v1

Submission:2/27/2026
Comments:0 comments
Subjects:Machine Learning; Data Science
Original Source:
View Original PDF
arXiv: This paper is hosted on arXiv, an open-access repository
Was this helpful?

Discussion (0)

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!