Search Results - "Castellini, Jacopo"

1
Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning by Castellini, Jacopo, Oliehoek, Frans A., Savani, Rahul, Whiteson, Shimon

Published in Autonomous agents and multi-agent systems (2021)
“…Recent years have seen the application of deep reinforcement learning techniques to cooperative multi-agent systems, with great empirical success. However,…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
Krylov iterative methods for the geometric mean of two matrices times a vector by Castellini, Jacopo

Published in Numerical algorithms (01-02-2017)
“…In this work, we are presenting an efficient way to compute the geometric mean of two positive definite matrices times a vector. For this purpose, we are…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
3
Difference rewards policy gradients by Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A., Savani, Rahul

Published in Neural computing & applications (11-11-2022)
“…Abstract Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key challenge, however,…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
4
Improved Representations for Cooperative Multi-Agent Reinforcement Learning by Castellini, Jacopo

Published 01-01-2022
“…Multi-agent systems [33, 136] are an ubiquitous presence in our everyday life: our entire society could be seen as a huge multi-agent system in which each…”

Get full text

Dissertation
QR Code
Save to List

Saved in:
5
Learning Numeracy: Binary Arithmetic with Neural Turing Machines by Castellini, Jacopo

Published 04-04-2019
“…One of the main problems encountered so far with recurrent neural networks is that they struggle to retain long-time information dependencies in their…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
Krylov Iterative Methods for the Geometric Mean of Two Matrices Times a Vector by Castellini, Jacopo

Published 04-03-2019
“…Numerical Algorithms 74(2), 561-571, Springer, 2017 In this work, we are presenting an efficient way to compute the geometric mean of two positive definite…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
7
On Convex Optimal Value Functions For POSGs by Cunha, Rafael F, Castellini, Jacopo, Peralez, Johan, Dibangoye, Jilles S

Published 15-11-2023
“…Multi-agent planning and reinforcement learning can be challenging when agents cannot see the state of the world or communicate with each other due to…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
8
Analysing Factorizations of Action-Value Networks for Cooperative Multi-Agent Reinforcement Learning by Castellini, Jacopo, Oliehoek, Frans A, Savani, Rahul, Whiteson, Shimon

Published 09-11-2023
“…Auton Agent Multi-Agent Syst 35, 25 (2021) Recent years have seen the application of deep reinforcement learning techniques to cooperative multi-agent systems,…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
9
Difference Rewards Policy Gradients by Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A, Savani, Rahul

Published 09-11-2023
“…Neural Comput & Applic (2022) Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
10
Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach by Peralez, Johan, Delage, Aurélien, Castellini, Jacopo, Cunha, Rafael F, Dibangoye, Jilles S

Published 23-08-2024
“…Centralized training for decentralized execution paradigm emerged as the state-of-the-art approach to epsilon-optimally solving decentralized partially…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Castellini, Jacopo"

Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning by Castellini, Jacopo, Oliehoek, Frans A., Savani, Rahul, Whiteson, Shimon

Krylov iterative methods for the geometric mean of two matrices times a vector by Castellini, Jacopo

Difference rewards policy gradients by Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A., Savani, Rahul

Improved Representations for Cooperative Multi-Agent Reinforcement Learning by Castellini, Jacopo

Learning Numeracy: Binary Arithmetic with Neural Turing Machines by Castellini, Jacopo

Krylov Iterative Methods for the Geometric Mean of Two Matrices Times a Vector by Castellini, Jacopo

On Convex Optimal Value Functions For POSGs by Cunha, Rafael F, Castellini, Jacopo, Peralez, Johan, Dibangoye, Jilles S

Analysing Factorizations of Action-Value Networks for Cooperative Multi-Agent Reinforcement Learning by Castellini, Jacopo, Oliehoek, Frans A, Savani, Rahul, Whiteson, Shimon

Difference Rewards Policy Gradients by Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A, Savani, Rahul

Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach by Peralez, Johan, Delage, Aurélien, Castellini, Jacopo, Cunha, Rafael F, Dibangoye, Jilles S

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication