Search Results - "Castellini, Jacopo"
-
1
Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning
Published in Autonomous agents and multi-agent systems (2021)“…Recent years have seen the application of deep reinforcement learning techniques to cooperative multi-agent systems, with great empirical success. However,…”
Get full text
Journal Article -
2
Krylov iterative methods for the geometric mean of two matrices times a vector
Published in Numerical algorithms (01-02-2017)“…In this work, we are presenting an efficient way to compute the geometric mean of two positive definite matrices times a vector. For this purpose, we are…”
Get full text
Journal Article -
3
Difference rewards policy gradients
Published in Neural computing & applications (11-11-2022)“…Abstract Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key challenge, however,…”
Get full text
Journal Article -
4
Improved Representations for Cooperative Multi-Agent Reinforcement Learning
Published 01-01-2022“…Multi-agent systems [33, 136] are an ubiquitous presence in our everyday life: our entire society could be seen as a huge multi-agent system in which each…”
Get full text
Dissertation -
5
Learning Numeracy: Binary Arithmetic with Neural Turing Machines
Published 04-04-2019“…One of the main problems encountered so far with recurrent neural networks is that they struggle to retain long-time information dependencies in their…”
Get full text
Journal Article -
6
Krylov Iterative Methods for the Geometric Mean of Two Matrices Times a Vector
Published 04-03-2019“…Numerical Algorithms 74(2), 561-571, Springer, 2017 In this work, we are presenting an efficient way to compute the geometric mean of two positive definite…”
Get full text
Journal Article -
7
On Convex Optimal Value Functions For POSGs
Published 15-11-2023“…Multi-agent planning and reinforcement learning can be challenging when agents cannot see the state of the world or communicate with each other due to…”
Get full text
Journal Article -
8
Analysing Factorizations of Action-Value Networks for Cooperative Multi-Agent Reinforcement Learning
Published 09-11-2023“…Auton Agent Multi-Agent Syst 35, 25 (2021) Recent years have seen the application of deep reinforcement learning techniques to cooperative multi-agent systems,…”
Get full text
Journal Article -
9
Difference Rewards Policy Gradients
Published 09-11-2023“…Neural Comput & Applic (2022) Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key…”
Get full text
Journal Article -
10
Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach
Published 23-08-2024“…Centralized training for decentralized execution paradigm emerged as the state-of-the-art approach to epsilon-optimally solving decentralized partially…”
Get full text
Journal Article