ExplorerArtificial IntelligenceAI
Research PaperResearchia:202606.15002

Learning Coordinated Preference for Multi-Objective Multi-Agent Reinforcement Learning

Pengxin Wang

Abstract

Cooperative multi-objective multi-agent reinforcement learning (MOMARL) models team decision making under multiple, potentially conflicting objectives. In this setting, conflicts arise not only across objectives but also across agents with different observations, roles, and contributions. We propose Preference Coordinated Multi-agent Policy Optimization (PCMA), which learns coordinated agent-specific preferences to enable complementary trade-offs among agents. Theoretically, we formulate coopera...

Submitted: June 15, 2026Subjects: AI; Artificial Intelligence

Description / Details

Cooperative multi-objective multi-agent reinforcement learning (MOMARL) models team decision making under multiple, potentially conflicting objectives. In this setting, conflicts arise not only across objectives but also across agents with different observations, roles, and contributions. We propose Preference Coordinated Multi-agent Policy Optimization (PCMA), which learns coordinated agent-specific preferences to enable complementary trade-offs among agents. Theoretically, we formulate cooperative MOMARL as a team-optimal game and show that, under suitable conditions, preference diversity can induce team improvement through a first-order improvement decomposition. Experiments on multiple cooperative MOMA environments and a practical traffic-control scenario show that PCMA improves both performance and trade-off coordination.


Source: arXiv:2606.14693v1 - http://arxiv.org/abs/2606.14693v1 PDF: https://arxiv.org/pdf/2606.14693v1 Original Link: http://arxiv.org/abs/2606.14693v1

Please sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Access Paper
View Source PDF
Submission Info
Date:
Jun 15, 2026
Topic:
Artificial Intelligence
Area:
AI
Comments:
0
Bookmark