Bioinspired actor-critic algorithm for reinforcement learning interpretation with Levy–Brown hybrid exploration strategy

Currently, reinforcement learning, the interpretability of the algorithm is a challenge. The lack of interpretability limits the use of reinforcement learning limited when facing agents in the physical world. To improve the interpretability of reinforcement learning, this study proposes a Levy-Brown...

Full description

Saved in:

Bibliographic Details
Published in:	Neurocomputing (Amsterdam) Vol. 574; p. 127291
Main Authors:	Wang, Xiao, Li, Dazi
Format:	Journal Article
Language:	English
Published:	Elsevier B.V 14-03-2024
Subjects:	Actor-critic Interpretable reinforcement learning Levy motion Pareto distribution Levy motion Actor-critic Interpretable reinforcement learning Pareto distribution
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Currently, reinforcement learning, the interpretability of the algorithm is a challenge. The lack of interpretability limits the use of reinforcement learning limited when facing agents in the physical world. To improve the interpretability of reinforcement learning, this study proposes a Levy-Brown hybrid strategy to improve the working of the traditional Actor-Critic algorithm. The proposed strategy is bioinspired from the Brown motion and Levy motion in nature; therefore, it can explain the process of data acquisition in the learning process from biological principles. The main idea of this new strategy is to map the Gaussian strategy to the biological Brown motion, and introduce the biological Levy strategy to improve the exploration efficiency. By combining the two strategies, it effectively takes advantage of the Levy strategy to improve exploration speed and the Brown strategy to improve exploration stability. The experiments demonstrate the advantages of the proposed Levy-Brown hybrid strategy, which effectively make best use of the advantages and overcomes the disadvantages of the two strategies. The code is available at https://www.researchgate.net/publication/377014427_LevyBrown_Hyribd_strategy.
ISSN:	0925-2312 1872-8286
DOI:	10.1016/j.neucom.2024.127291