Back to Explorer
Research PaperResearchia:202603.04055[Artificial Intelligence > AI]

Boltzmann-based Exploration for Robust Decentralized Multi-Agent Planning

Nhat Nguyen

Abstract

Decentralized Monte Carlo Tree Search (Dec-MCTS) is widely used for cooperative multi-agent planning but struggles in sparse or skewed reward environments. We introduce Coordinated Boltzmann MCTS (CB-MCTS), which replaces deterministic UCT with a stochastic Boltzmann policy and a decaying entropy bonus for sustained yet focused exploration. While Boltzmann exploration has been studied in single-agent MCTS, applying it in multi-agent systems poses unique challenges. CB-MCTS is the first to address this. We analyze CB-MCTS in the simple-regret setting and show in simulations that it outperforms Dec-MCTS in deceptive scenarios and remains competitive on standard benchmarks, providing a robust solution for multi-agent planning.


Source: arXiv:2603.02154v1 - http://arxiv.org/abs/2603.02154v1 PDF: https://arxiv.org/pdf/2603.02154v1 Original Link: http://arxiv.org/abs/2603.02154v1

Submission:3/4/2026
Comments:0 comments
Subjects:AI; Artificial Intelligence
Original Source:
View Original PDF
arXiv: This paper is hosted on arXiv, an open-access repository
Was this helpful?

Discussion (0)

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Boltzmann-based Exploration for Robust Decentralized Multi-Agent Planning | Researchia | Researchia