Search Results - "Hayes, Conor F"

  • Showing 1 - 17 results of 17
Refine Results
  1. 1
  2. 2

    Actor-critic multi-objective reinforcement learning for non-linear utility functions by Reymond, Mathieu, Hayes, Conor F., Steckelmacher, Denis, Roijers, Diederik M., Nowé, Ann

    Published in Autonomous agents and multi-agent systems (01-10-2023)
    “…We propose a novel multi-objective reinforcement learning algorithm that successfully learns the optimal policy even for non-linear utility functions…”
    Get full text
    Journal Article
  3. 3

    Scalar reward is not enough: a response to Silver, Singh, Precup and Sutton (2021) by Vamplew, Peter, Smith, Benjamin J., Källström, Johan, Ramos, Gabriel, Rădulescu, Roxana, Roijers, Diederik M., Hayes, Conor F., Heintz, Fredrik, Mannion, Patrick, Libin, Pieter J. K., Dazeley, Richard, Foale, Cameron

    Published in Autonomous agents and multi-agent systems (01-10-2022)
    “…The recent paper “Reward is Enough” by Silver, Singh, Precup and Sutton posits that the concept of reward maximisation is sufficient to underpin all…”
    Get full text
    Journal Article
  4. 4

    Monte Carlo tree search algorithms for risk-aware and multi-objective reinforcement learning by Hayes, Conor F., Reymond, Mathieu, Roijers, Diederik M., Howley, Enda, Mannion, Patrick

    Published in Autonomous agents and multi-agent systems (01-10-2023)
    “…In many risk-aware and multi-objective reinforcement learning settings, the utility of the user is derived from a single execution of a policy. In these…”
    Get full text
    Journal Article
  5. 5

    Expected scalarised returns dominance: a new solution concept for multi-objective decision making by Hayes, Conor F., Verstraeten, Timothy, Roijers, Diederik M., Howley, Enda, Mannion, Patrick

    Published in Neural computing & applications (05-07-2022)
    “…Abstract In many real-world scenarios, the utility of a user is derived from a single execution of a policy. In this case, to apply multi-objective…”
    Get full text
    Journal Article
  6. 6

    Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning by Reymond, Mathieu, Hayes, Conor F., Willem, Lander, Rădulescu, Roxana, Abrams, Steven, Roijers, Diederik M., Howley, Enda, Mannion, Patrick, Hens, Niel, Nowé, Ann, Libin, Pieter

    Published in Expert systems with applications (01-09-2024)
    “…Infectious disease outbreaks can have a disruptive impact on public health and societal processes. As decision-making in the context of epidemic mitigation is…”
    Get full text
    Journal Article
  7. 7

    Multi-objective Reinforcement Learning: A Tool for Pluralistic Alignment by Vamplew, Peter, Hayes, Conor F, Foale, Cameron, Dazeley, Richard, Harland, Hadassah

    Published 14-10-2024
    “…Reinforcement learning (RL) is a valuable tool for the creation of AI systems. However it may be problematic to adequately align RL based on scalar rewards if…”
    Get full text
    Journal Article
  8. 8

    Distributional Multi-Objective Decision Making by Röpke, Willem, Hayes, Conor F, Mannion, Patrick, Howley, Enda, Nowé, Ann, Roijers, Diederik M

    Published 09-05-2023
    “…For effective decision support in scenarios with conflicting objectives, sets of potentially optimal solutions can be presented to the decision maker. We…”
    Get full text
    Journal Article
  9. 9

    Monte Carlo Tree Search Algorithms for Risk-Aware and Multi-Objective Reinforcement Learning by Hayes, Conor F, Reymond, Mathieu, Roijers, Diederik M, Howley, Enda, Mannion, Patrick

    Published 23-11-2022
    “…In many risk-aware and multi-objective reinforcement learning settings, the utility of the user is derived from a single execution of a policy. In these…”
    Get full text
    Journal Article
  10. 10

    From Text to Life: On the Reciprocal Relationship between Artificial Life and Large Language Models by Nisioti, Eleni, Glanois, Claire, Najarro, Elias, Dai, Andrew, Meyerson, Elliot, Pedersen, Joachim Winther, Teodorescu, Laetitia, Hayes, Conor F, Sudhakaran, Shyam, Risi, Sebastian

    Published 14-06-2024
    “…Large Language Models (LLMs) have taken the field of AI by storm, but their adoption in the field of Artificial Life (ALife) has been, so far, relatively…”
    Get full text
    Journal Article
  11. 11

    Expected Scalarised Returns Dominance: A New Solution Concept for Multi-Objective Decision Making by Hayes, Conor F, Verstraeten, Timothy, Roijers, Diederik M, Howley, Enda, Mannion, Patrick

    Published 01-07-2022
    “…In many real-world scenarios, the utility of a user is derived from the single execution of a policy. In this case, to apply multi-objective reinforcement…”
    Get full text
    Journal Article
  12. 12

    Multi-Objective Coordination Graphs for the Expected Scalarised Returns with Generative Flow Models by Hayes, Conor F, Verstraeten, Timothy, Roijers, Diederik M, Howley, Enda, Mannion, Patrick

    Published 01-07-2022
    “…Many real-world problems contain multiple objectives and agents, where a trade-off exists between objectives. Key to solving such problems is to exploit sparse…”
    Get full text
    Journal Article
  13. 13

    Utility-Based Reinforcement Learning: Unifying Single-objective and Multi-objective Reinforcement Learning by Vamplew, Peter, Foale, Cameron, Hayes, Conor F, Mannion, Patrick, Howley, Enda, Dazeley, Richard, Johnson, Scott, Källström, Johan, Ramos, Gabriel, Rădulescu, Roxana, Röpke, Willem, Roijers, Diederik M

    Published 04-02-2024
    “…Research in multi-objective reinforcement learning (MORL) has introduced the utility-based paradigm, which makes use of both environmental rewards and a…”
    Get full text
    Journal Article
  14. 14

    Risk Aware and Multi-Objective Decision Making with Distributional Monte Carlo Tree Search by Hayes, Conor F, Reymond, Mathieu, Roijers, Diederik M, Howley, Enda, Mannion, Patrick

    Published 01-02-2021
    “…In many risk-aware and multi-objective reinforcement learning settings, the utility of the user is derived from the single execution of a policy. In these…”
    Get full text
    Journal Article
  15. 15

    Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning by Reymond, Mathieu, Hayes, Conor F, Willem, Lander, Rădulescu, Roxana, Abrams, Steven, Roijers, Diederik M, Howley, Enda, Mannion, Patrick, Hens, Niel, Nowé, Ann, Libin, Pieter

    Published 11-04-2022
    “…Infectious disease outbreaks can have a disruptive impact on public health and societal processes. As decision making in the context of epidemic mitigation is…”
    Get full text
    Journal Article
  16. 16

    Scalar reward is not enough: A response to Silver, Singh, Precup and Sutton (2021) by Vamplew, Peter, Smith, Benjamin J, Kallstrom, Johan, Ramos, Gabriel, Radulescu, Roxana, Roijers, Diederik M, Hayes, Conor F, Heintz, Fredrik, Mannion, Patrick, Libin, Pieter J. K, Dazeley, Richard, Foale, Cameron

    Published 24-11-2021
    “…The recent paper `"Reward is Enough" by Silver, Singh, Precup and Sutton posits that the concept of reward maximisation is sufficient to underpin all…”
    Get full text
    Journal Article
  17. 17