Search Results - "Borsa, Diana"

1
Detecting disease outbreaks in mass gatherings using Internet data by Yom-Tov, Elad, Borsa, Diana, Cox, Ingemar J, McKendry, Rachel A

Published in Journal of medical Internet research (01-06-2014)
“…Mass gatherings, such as music festivals and religious events, pose a health care challenge because of the risk of transmission of communicable diseases. This…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
Fast reinforcement learning with generalized policy updates by Barreto, André, Hou, Shaobo, Borsa, Diana, Silver, David, Precup, Doina

Published in Proceedings of the National Academy of Sciences - PNAS (01-12-2020)
“…The combination of reinforcement learning with deep learning is a promising approach to tackle important sequential decision-making problems that are currently…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
3
Reinforcement Learning in Persistent Environments : Representation Learning and Transfer by Borsa, Diana

Published 01-01-2020
“…Reinforcement learning (RL) provides a general framework for modelling and reasoning about agents capable of sequential decision making, with the goal of…”

Get full text

Dissertation
QR Code
Save to List

Saved in:
4
Automatic identification of Web-based risk markers for health events by Yom-Tov, Elad, Borsa, Diana, Hayward, Andrew C, McKendry, Rachel A, Cox, Ingemar J

Published in Journal of medical Internet research (01-01-2015)
“…The escalating cost of global health care is driving the development of new technologies to identify early indicators of an individual's risk of disease…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
5
A State Representation for Diminishing Rewards by Moskovitz, Ted, Hromadka, Samo, Touati, Ahmed, Borsa, Diana, Sahani, Maneesh

Published 07-09-2023
“…A common setting in multitask reinforcement learning (RL) demands that an agent rapidly adapt to various stationary reward functions randomly sampled from a…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
Selective Credit Assignment by Chelu, Veronica, Borsa, Diana, Precup, Doina, van Hasselt, Hado

Published 19-02-2022
“…Efficient credit assignment is essential for reinforcement learning algorithms in both prediction and control settings. We describe a unified view on…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
7
When should agents explore? by Pîslar, Miruna, Szepesvari, David, Ostrovski, Georg, Borsa, Diana, Schaul, Tom

Published 26-08-2021
“…Exploration remains a central challenge for reinforcement learning (RL). Virtually all existing methods share the feature of a monolithic behaviour policy that…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
8
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition by Chandak, Yash, Thakoor, Shantanu, Guo, Zhaohan Daniel, Tang, Yunhao, Munos, Remi, Dabney, Will, Borsa, Diana L

Published 01-05-2023
“…Representation learning and exploration are among the key challenges for any deep reinforcement learning agent. In this work, we provide a singular value…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
9
Return-based Scaling: Yet Another Normalisation Trick for Deep RL by Schaul, Tom, Ostrovski, Georg, Kemaev, Iurii, Borsa, Diana

Published 11-05-2021
“…Scaling issues are mundane yet irritating for practitioners of reinforcement learning. Error scales vary across domains, tasks, and stages of learning;…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
10
A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning by Khetarpal, Khimya, Guo, Zhaohan Daniel, Pires, Bernardo Avila, Tang, Yunhao, Lyle, Clare, Rowland, Mark, Heess, Nicolas, Borsa, Diana, Guez, Arthur, Dabney, Will

Published 04-06-2024
“…Learning a good representation is a crucial challenge for Reinforcement Learning (RL) agents. Self-predictive learning provides means to jointly learn a latent…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
11
Generalised Policy Improvement with Geometric Policy Composition by Thakoor, Shantanu, Rowland, Mark, Borsa, Diana, Dabney, Will, Munos, Rémi, Barreto, André

Published 17-06-2022
“…We introduce a method for policy improvement that interpolates between the greedy approach of value-based reinforcement learning (RL) and the full planning…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
12
Model-Value Inconsistency as a Signal for Epistemic Uncertainty by Filos, Angelos, Vértes, Eszter, Marinho, Zita, Farquhar, Gregory, Borsa, Diana, Friesen, Abram, Behbahani, Feryal, Schaul, Tom, Barreto, André, Osindero, Simon

Published 08-12-2021
“…Using a model of the environment and a value function, an agent can construct many estimates of a state's value, by unrolling the model for different lengths…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
13
Ray Interference: a Source of Plateaus in Deep Reinforcement Learning by Schaul, Tom, Borsa, Diana, Modayil, Joseph, Pascanu, Razvan

Published 25-04-2019
“…Rather than proposing a new method, this paper investigates an issue present in existing learning algorithms. We study the learning dynamics of reinforcement…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
14
Expected Eligibility Traces by van Hasselt, Hado, Madjiheurem, Sephora, Hessel, Matteo, Silver, David, Barreto, André, Borsa, Diana

Published 03-07-2020
“…The question of how to determine which states and actions are responsible for a certain outcome is known as the credit assignment problem and remains a central…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
15
Adapting Behaviour for Learning Progress by Schaul, Tom, Borsa, Diana, Ding, David, Szepesvari, David, Ostrovski, Georg, Dabney, Will, Osindero, Simon

Published 14-12-2019
“…Determining what experience to generate to best facilitate learning (i.e. exploration) is one of the distinguishing features and open challenges in…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
16
Conditional Importance Sampling for Off-Policy Learning by Rowland, Mark, Harutyunyan, Anna, van Hasselt, Hado, Borsa, Diana, Schaul, Tom, Munos, Rémi, Dabney, Will

Published 16-10-2019
“…The principal contribution of this paper is a conceptual framework for off-policy reinforcement learning, based on conditional expectations of importance…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
17
The Option Keyboard: Combining Skills in Reinforcement Learning by Barreto, André, Borsa, Diana, Hou, Shaobo, Comanici, Gheorghe, Aygün, Eser, Hamel, Philippe, Toyama, Daniel, Hunt, Jonathan, Mourad, Shibl, Silver, David, Precup, Doina

Published 24-06-2021
“…The ability to combine known skills to create new ones may be crucial in the solution of complex reinforcement learning problems that unfold over extended…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
18
Learning Shared Representations in Multi-task Reinforcement Learning by Borsa, Diana, Graepel, Thore, Shawe-Taylor, John

Published 07-03-2016
“…We investigate a paradigm in multi-task reinforcement learning (MT-RL) in which an agent is placed in an environment and needs to learn to perform a series of…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
19
General non-linear Bellman equations by van Hasselt, Hado, Quan, John, Hessel, Matteo, Xu, Zhongwen, Borsa, Diana, Barreto, Andre

Published 08-07-2019
“…We consider a general class of non-linear Bellman equations. These open up a design space of algorithms that have interesting properties, which has two…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
20
Observational Learning by Reinforcement Learning by Borsa, Diana, Piot, Bilal, Munos, Rémi, Pietquin, Olivier

Published 20-06-2017
“…Observational learning is a type of learning that occurs as a function of observing, retaining and possibly replicating or imitating the behaviour of another…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Borsa, Diana"

Detecting disease outbreaks in mass gatherings using Internet data by Yom-Tov, Elad, Borsa, Diana, Cox, Ingemar J, McKendry, Rachel A

Fast reinforcement learning with generalized policy updates by Barreto, André, Hou, Shaobo, Borsa, Diana, Silver, David, Precup, Doina

Reinforcement Learning in Persistent Environments : Representation Learning and Transfer by Borsa, Diana

Automatic identification of Web-based risk markers for health events by Yom-Tov, Elad, Borsa, Diana, Hayward, Andrew C, McKendry, Rachel A, Cox, Ingemar J

A State Representation for Diminishing Rewards by Moskovitz, Ted, Hromadka, Samo, Touati, Ahmed, Borsa, Diana, Sahani, Maneesh

Selective Credit Assignment by Chelu, Veronica, Borsa, Diana, Precup, Doina, van Hasselt, Hado

When should agents explore? by Pîslar, Miruna, Szepesvari, David, Ostrovski, Georg, Borsa, Diana, Schaul, Tom

Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition by Chandak, Yash, Thakoor, Shantanu, Guo, Zhaohan Daniel, Tang, Yunhao, Munos, Remi, Dabney, Will, Borsa, Diana L

Return-based Scaling: Yet Another Normalisation Trick for Deep RL by Schaul, Tom, Ostrovski, Georg, Kemaev, Iurii, Borsa, Diana

A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning by Khetarpal, Khimya, Guo, Zhaohan Daniel, Pires, Bernardo Avila, Tang, Yunhao, Lyle, Clare, Rowland, Mark, Heess, Nicolas, Borsa, Diana, Guez, Arthur, Dabney, Will

Generalised Policy Improvement with Geometric Policy Composition by Thakoor, Shantanu, Rowland, Mark, Borsa, Diana, Dabney, Will, Munos, Rémi, Barreto, André

Model-Value Inconsistency as a Signal for Epistemic Uncertainty by Filos, Angelos, Vértes, Eszter, Marinho, Zita, Farquhar, Gregory, Borsa, Diana, Friesen, Abram, Behbahani, Feryal, Schaul, Tom, Barreto, André, Osindero, Simon

Ray Interference: a Source of Plateaus in Deep Reinforcement Learning by Schaul, Tom, Borsa, Diana, Modayil, Joseph, Pascanu, Razvan

Expected Eligibility Traces by van Hasselt, Hado, Madjiheurem, Sephora, Hessel, Matteo, Silver, David, Barreto, André, Borsa, Diana

Adapting Behaviour for Learning Progress by Schaul, Tom, Borsa, Diana, Ding, David, Szepesvari, David, Ostrovski, Georg, Dabney, Will, Osindero, Simon

Conditional Importance Sampling for Off-Policy Learning by Rowland, Mark, Harutyunyan, Anna, van Hasselt, Hado, Borsa, Diana, Schaul, Tom, Munos, Rémi, Dabney, Will

The Option Keyboard: Combining Skills in Reinforcement Learning by Barreto, André, Borsa, Diana, Hou, Shaobo, Comanici, Gheorghe, Aygün, Eser, Hamel, Philippe, Toyama, Daniel, Hunt, Jonathan, Mourad, Shibl, Silver, David, Precup, Doina

Learning Shared Representations in Multi-task Reinforcement Learning by Borsa, Diana, Graepel, Thore, Shawe-Taylor, John

General non-linear Bellman equations by van Hasselt, Hado, Quan, John, Hessel, Matteo, Xu, Zhongwen, Borsa, Diana, Barreto, Andre

Observational Learning by Reinforcement Learning by Borsa, Diana, Piot, Bilal, Munos, Rémi, Pietquin, Olivier

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication