ASN: action semantics network for multiagent reinforcement learning

In multiagent systems (MASs), each agent makes individual decisions but all contribute globally to the system’s evolution. Learning in MASs is difficult since each agent’s selection of actions must take place in the presence of other co-learning agents. Moreover, the environmental stochasticity and...

Full description

Saved in:
Bibliographic Details
Published in:Autonomous agents and multi-agent systems Vol. 37; no. 2
Main Authors: Yang, Tianpei, Wang, Weixun, Hao, Jianye, Taylor, Matthew E., Liu, Yong, Hao, Xiaotian, Hu, Yujing, Chen, Yingfeng, Fan, Changjie, Ren, Chunxu, Huang, Ye, Zhu, Jiangcheng, Gao, Yang
Format: Journal Article
Language:English
Published: New York Springer US 01-12-2023
Springer Nature B.V
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In multiagent systems (MASs), each agent makes individual decisions but all contribute globally to the system’s evolution. Learning in MASs is difficult since each agent’s selection of actions must take place in the presence of other co-learning agents. Moreover, the environmental stochasticity and uncertainties increase exponentially with the number of agents. Previous works borrow various multiagent coordination mechanisms for use in deep learning architectures to facilitate multiagent coordination. However, none of them explicitly consider that different actions can have different influence on other agents, which we call the action semantics. In this paper, we propose a novel network architecture, named Action Semantics Network (ASN), that explicitly represents such action semantics between agents. ASN characterizes different actions’ influence on other agents using neural networks based on the action semantics between them. ASN can be easily combined with existing deep reinforcement learning (DRL) algorithms to boost their performance. Experimental results on StarCraft II micromanagement and Neural MMO show that ASN significantly improves the performance of state-of-the-art DRL approaches, compared with several other network architectures. We also successfully deploy ASN to a popular online MMORPG game called Justice Online, which indicates a promising future for ASN to be applied in even more complex scenarios.
ISSN:1387-2532
1573-7454
DOI:10.1007/s10458-023-09628-3