Search Results - "Yarats, Denis"
-
1
Learning Navigation Skills for Legged Robots with Learned Robot Embeddings
Published in 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (27-09-2021)“…Recent work has shown results on learning navigation policies for idealized cylinder agents in simulation and transferring them to real wheeled robots…”
Get full text
Conference Proceeding -
2
On the adequacy of untuned warmup for adaptive optimization
Published 09-10-2019“…Adaptive optimization algorithms such as Adam are widely used in deep learning. The stability of such algorithms is often improved with a warmup schedule for…”
Get full text
Journal Article -
3
The Differentiable Cross-Entropy Method
Published 27-09-2019“…We study the cross-entropy method (CEM) for the non-convex optimization of a continuous and parameterized objective function and introduce a differentiable…”
Get full text
Journal Article -
4
Quasi-hyperbolic momentum and Adam for deep learning
Published 15-10-2018“…Momentum-based acceleration of stochastic gradient descent (SGD) is widely used in deep learning. We propose the quasi-hyperbolic momentum algorithm (QHM) as…”
Get full text
Journal Article -
5
Hierarchical Text Generation and Planning for Strategic Dialogue
Published 15-12-2017“…End-to-end models for goal-orientated dialogue are challenging to train, because linguistic and strategic aspects are entangled in latent state vectors. We…”
Get full text
Journal Article -
6
Watch and Match: Supercharging Imitation with Regularized Optimal Transport
Published 30-06-2022“…Imitation learning holds tremendous promise in learning policies efficiently for complex decision making problems. Current state-of-the-art algorithms often…”
Get full text
Journal Article -
7
Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels
Published 28-04-2020“…We propose a simple data augmentation technique that can be applied to standard model-free reinforcement learning algorithms, enabling robust learning directly…”
Get full text
Journal Article -
8
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning
Published 20-07-2021“…We present DrQ-v2, a model-free reinforcement learning (RL) algorithm for visual continuous control. DrQ-v2 builds on DrQ, an off-policy actor-critic approach…”
Get full text
Journal Article -
9
Reinforcement Learning with Prototypical Representations
Published 22-02-2021“…ICML 2021 Learning effective representations in image-based environments is crucial for sample efficient Reinforcement Learning (RL). Unfortunately, in RL,…”
Get full text
Journal Article -
10
On the model-based stochastic value gradient for continuous reinforcement learning
Published 28-08-2020“…For over a decade, model-based reinforcement learning has been seen as a way to leverage control-based domain knowledge to improve the sample-efficiency of…”
Get full text
Journal Article -
11
Automatic Data Augmentation for Generalization in Deep Reinforcement Learning
Published 23-06-2020“…Deep reinforcement learning (RL) agents often fail to generalize to unseen scenarios, even when they are trained on many instances of semantically similar…”
Get full text
Journal Article -
12
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
Published 31-01-2022“…We introduce Contrastive Intrinsic Control (CIC), an algorithm for unsupervised skill discovery that maximizes the mutual information between state-transitions…”
Get full text
Journal Article -
13
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Published 31-01-2022“…Recent progress in deep learning has relied on access to large and diverse datasets. Such data-driven progress has been less evident in offline reinforcement…”
Get full text
Journal Article -
14
Hierarchical Decision Making by Generating and Following Natural Language Instructions
Published 03-06-2019“…We explore using latent natural language instructions as an expressive and compositional representation of complex actions for hierarchical decision making…”
Get full text
Journal Article -
15
URLB: Unsupervised Reinforcement Learning Benchmark
Published 28-10-2021“…Deep Reinforcement Learning (RL) has emerged as a powerful paradigm to solve a range of complex yet specific control tasks. Yet training generalist agents that…”
Get full text
Journal Article -
16
Learning Navigation Skills for Legged Robots with Learned Robot Embeddings
Published 24-11-2020“…Recent work has shown results on learning navigation policies for idealized cylinder agents in simulation and transferring them to real wheeled robots…”
Get full text
Journal Article -
17
Improving Sample Efficiency in Model-Free Reinforcement Learning from Images
Published 02-10-2019“…Training an agent to solve control tasks directly from high-dimensional images with model-free reinforcement learning (RL) has proven difficult. A promising…”
Get full text
Journal Article -
18
Generalized Inner Loop Meta-Learning
Published 03-10-2019“…Many (but not all) approaches self-qualifying as "meta-learning" in deep learning and reinforcement learning fit a common pattern of approximating the solution…”
Get full text
Journal Article -
19
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
Published 15-06-2017“…Much of human dialogue occurs in semi-cooperative settings, where agents with different goals attempt to agree on common decisions. Negotiations require…”
Get full text
Journal Article -
20
Convolutional Sequence to Sequence Learning
Published 08-05-2017“…The prevalent approach to sequence to sequence learning maps an input sequence to a variable length output sequence via recurrent neural networks. We introduce…”
Get full text
Journal Article