ExplorerData ScienceMachine Learning
Research PaperResearchia:202602.27038

Zeroth-Order Stackelberg Control in Combinatorial Congestion Games

Saeed Masiha

Abstract

We study Stackelberg (leader--follower) tuning of network parameters (tolls, capacities, incentives) in combinatorial congestion games, where selfish users choose discrete routes (or other combinatorial strategies) and settle at a congestion equilibrium. The leader minimizes a system-level objective (e.g., total travel time) evaluated at equilibrium, but this objective is typically nonsmooth because the set of used strategies can change abruptly. We propose ZO-Stackelberg, which couples a projec...

Submitted: February 27, 2026Subjects: Machine Learning; Data Science

Description / Details

We study Stackelberg (leader--follower) tuning of network parameters (tolls, capacities, incentives) in combinatorial congestion games, where selfish users choose discrete routes (or other combinatorial strategies) and settle at a congestion equilibrium. The leader minimizes a system-level objective (e.g., total travel time) evaluated at equilibrium, but this objective is typically nonsmooth because the set of used strategies can change abruptly. We propose ZO-Stackelberg, which couples a projection-free Frank--Wolfe equilibrium solver with a zeroth-order outer update, avoiding differentiation through equilibria. We prove convergence to generalized Goldstein stationary points of the true equilibrium objective, with explicit dependence on the equilibrium approximation error, and analyze subsampled oracles: if an exact minimizer is sampled with probability κmκ_m, then the Frank--Wolfe error decays as O(1/(κmT))\mathcal{O}(1/(κ_m T)). We also propose stratified sampling as a practical way to avoid a vanishing κmκ_m when the strategies that matter most for the Wardrop equilibrium concentrate in a few dominant combinatorial classes (e.g., short paths). Experiments on real-world networks demonstrate that our method achieves orders-of-magnitude speedups over a differentiation-based baseline while converging to follower equilibria.


Source: arXiv:2602.23277v1 - http://arxiv.org/abs/2602.23277v1 PDF: https://arxiv.org/pdf/2602.23277v1 Original Link: http://arxiv.org/abs/2602.23277v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Feb 27, 2026
Topic:
Data Science
Area:
Machine Learning
Comments:
0
Bookmark