Search Results - "Castellini, Jacopo"

  • Showing 1 - 10 results of 10
Refine Results
  1. 1

    Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning by Castellini, Jacopo, Oliehoek, Frans A., Savani, Rahul, Whiteson, Shimon

    “…Recent years have seen the application of deep reinforcement learning techniques to cooperative multi-agent systems, with great empirical success. However,…”
    Get full text
    Journal Article
  2. 2

    Krylov iterative methods for the geometric mean of two matrices times a vector by Castellini, Jacopo

    Published in Numerical algorithms (01-02-2017)
    “…In this work, we are presenting an efficient way to compute the geometric mean of two positive definite matrices times a vector. For this purpose, we are…”
    Get full text
    Journal Article
  3. 3

    Difference rewards policy gradients by Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A., Savani, Rahul

    Published in Neural computing & applications (11-11-2022)
    “…Abstract Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key challenge, however,…”
    Get full text
    Journal Article
  4. 4

    Improved Representations for Cooperative Multi-Agent Reinforcement Learning by Castellini, Jacopo

    Published 01-01-2022
    “…Multi-agent systems [33, 136] are an ubiquitous presence in our everyday life: our entire society could be seen as a huge multi-agent system in which each…”
    Get full text
    Dissertation
  5. 5

    Learning Numeracy: Binary Arithmetic with Neural Turing Machines by Castellini, Jacopo

    Published 04-04-2019
    “…One of the main problems encountered so far with recurrent neural networks is that they struggle to retain long-time information dependencies in their…”
    Get full text
    Journal Article
  6. 6

    Krylov Iterative Methods for the Geometric Mean of Two Matrices Times a Vector by Castellini, Jacopo

    Published 04-03-2019
    “…Numerical Algorithms 74(2), 561-571, Springer, 2017 In this work, we are presenting an efficient way to compute the geometric mean of two positive definite…”
    Get full text
    Journal Article
  7. 7

    On Convex Optimal Value Functions For POSGs by Cunha, Rafael F, Castellini, Jacopo, Peralez, Johan, Dibangoye, Jilles S

    Published 15-11-2023
    “…Multi-agent planning and reinforcement learning can be challenging when agents cannot see the state of the world or communicate with each other due to…”
    Get full text
    Journal Article
  8. 8

    Analysing Factorizations of Action-Value Networks for Cooperative Multi-Agent Reinforcement Learning by Castellini, Jacopo, Oliehoek, Frans A, Savani, Rahul, Whiteson, Shimon

    Published 09-11-2023
    “…Auton Agent Multi-Agent Syst 35, 25 (2021) Recent years have seen the application of deep reinforcement learning techniques to cooperative multi-agent systems,…”
    Get full text
    Journal Article
  9. 9

    Difference Rewards Policy Gradients by Castellini, Jacopo, Devlin, Sam, Oliehoek, Frans A, Savani, Rahul

    Published 09-11-2023
    “…Neural Comput & Applic (2022) Policy gradient methods have become one of the most popular classes of algorithms for multi-agent reinforcement learning. A key…”
    Get full text
    Journal Article
  10. 10

    Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach by Peralez, Johan, Delage, Aurélien, Castellini, Jacopo, Cunha, Rafael F, Dibangoye, Jilles S

    Published 23-08-2024
    “…Centralized training for decentralized execution paradigm emerged as the state-of-the-art approach to epsilon-optimally solving decentralized partially…”
    Get full text
    Journal Article