Search Results - "Yarats, Denis"

Refine Results
  1. 1

    Learning Navigation Skills for Legged Robots with Learned Robot Embeddings by Truong, Joanne, Yarats, Denis, Li, Tianyu, Meier, Franziska, Chernova, Sonia, Batra, Dhruv, Rai, Akshara

    “…Recent work has shown results on learning navigation policies for idealized cylinder agents in simulation and transferring them to real wheeled robots…”
    Get full text
    Conference Proceeding
  2. 2

    On the adequacy of untuned warmup for adaptive optimization by Ma, Jerry, Yarats, Denis

    Published 09-10-2019
    “…Adaptive optimization algorithms such as Adam are widely used in deep learning. The stability of such algorithms is often improved with a warmup schedule for…”
    Get full text
    Journal Article
  3. 3

    The Differentiable Cross-Entropy Method by Amos, Brandon, Yarats, Denis

    Published 27-09-2019
    “…We study the cross-entropy method (CEM) for the non-convex optimization of a continuous and parameterized objective function and introduce a differentiable…”
    Get full text
    Journal Article
  4. 4

    Quasi-hyperbolic momentum and Adam for deep learning by Ma, Jerry, Yarats, Denis

    Published 15-10-2018
    “…Momentum-based acceleration of stochastic gradient descent (SGD) is widely used in deep learning. We propose the quasi-hyperbolic momentum algorithm (QHM) as…”
    Get full text
    Journal Article
  5. 5

    Hierarchical Text Generation and Planning for Strategic Dialogue by Yarats, Denis, Lewis, Mike

    Published 15-12-2017
    “…End-to-end models for goal-orientated dialogue are challenging to train, because linguistic and strategic aspects are entangled in latent state vectors. We…”
    Get full text
    Journal Article
  6. 6

    Watch and Match: Supercharging Imitation with Regularized Optimal Transport by Haldar, Siddhant, Mathur, Vaibhav, Yarats, Denis, Pinto, Lerrel

    Published 30-06-2022
    “…Imitation learning holds tremendous promise in learning policies efficiently for complex decision making problems. Current state-of-the-art algorithms often…”
    Get full text
    Journal Article
  7. 7

    Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels by Kostrikov, Ilya, Yarats, Denis, Fergus, Rob

    Published 28-04-2020
    “…We propose a simple data augmentation technique that can be applied to standard model-free reinforcement learning algorithms, enabling robust learning directly…”
    Get full text
    Journal Article
  8. 8

    Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning by Yarats, Denis, Fergus, Rob, Lazaric, Alessandro, Pinto, Lerrel

    Published 20-07-2021
    “…We present DrQ-v2, a model-free reinforcement learning (RL) algorithm for visual continuous control. DrQ-v2 builds on DrQ, an off-policy actor-critic approach…”
    Get full text
    Journal Article
  9. 9

    Reinforcement Learning with Prototypical Representations by Yarats, Denis, Fergus, Rob, Lazaric, Alessandro, Pinto, Lerrel

    Published 22-02-2021
    “…ICML 2021 Learning effective representations in image-based environments is crucial for sample efficient Reinforcement Learning (RL). Unfortunately, in RL,…”
    Get full text
    Journal Article
  10. 10

    On the model-based stochastic value gradient for continuous reinforcement learning by Amos, Brandon, Stanton, Samuel, Yarats, Denis, Wilson, Andrew Gordon

    Published 28-08-2020
    “…For over a decade, model-based reinforcement learning has been seen as a way to leverage control-based domain knowledge to improve the sample-efficiency of…”
    Get full text
    Journal Article
  11. 11

    Automatic Data Augmentation for Generalization in Deep Reinforcement Learning by Raileanu, Roberta, Goldstein, Max, Yarats, Denis, Kostrikov, Ilya, Fergus, Rob

    Published 23-06-2020
    “…Deep reinforcement learning (RL) agents often fail to generalize to unseen scenarios, even when they are trained on many instances of semantically similar…”
    Get full text
    Journal Article
  12. 12

    CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery by Laskin, Michael, Liu, Hao, Peng, Xue Bin, Yarats, Denis, Rajeswaran, Aravind, Abbeel, Pieter

    Published 31-01-2022
    “…We introduce Contrastive Intrinsic Control (CIC), an algorithm for unsupervised skill discovery that maximizes the mutual information between state-transitions…”
    Get full text
    Journal Article
  13. 13

    Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning by Yarats, Denis, Brandfonbrener, David, Liu, Hao, Laskin, Michael, Abbeel, Pieter, Lazaric, Alessandro, Pinto, Lerrel

    Published 31-01-2022
    “…Recent progress in deep learning has relied on access to large and diverse datasets. Such data-driven progress has been less evident in offline reinforcement…”
    Get full text
    Journal Article
  14. 14

    Hierarchical Decision Making by Generating and Following Natural Language Instructions by Hu, Hengyuan, Yarats, Denis, Gong, Qucheng, Tian, Yuandong, Lewis, Mike

    Published 03-06-2019
    “…We explore using latent natural language instructions as an expressive and compositional representation of complex actions for hierarchical decision making…”
    Get full text
    Journal Article
  15. 15

    URLB: Unsupervised Reinforcement Learning Benchmark by Laskin, Michael, Yarats, Denis, Liu, Hao, Lee, Kimin, Zhan, Albert, Lu, Kevin, Cang, Catherine, Pinto, Lerrel, Abbeel, Pieter

    Published 28-10-2021
    “…Deep Reinforcement Learning (RL) has emerged as a powerful paradigm to solve a range of complex yet specific control tasks. Yet training generalist agents that…”
    Get full text
    Journal Article
  16. 16

    Learning Navigation Skills for Legged Robots with Learned Robot Embeddings by Truong, Joanne, Yarats, Denis, Li, Tianyu, Meier, Franziska, Chernova, Sonia, Batra, Dhruv, Rai, Akshara

    Published 24-11-2020
    “…Recent work has shown results on learning navigation policies for idealized cylinder agents in simulation and transferring them to real wheeled robots…”
    Get full text
    Journal Article
  17. 17

    Improving Sample Efficiency in Model-Free Reinforcement Learning from Images by Yarats, Denis, Zhang, Amy, Kostrikov, Ilya, Amos, Brandon, Pineau, Joelle, Fergus, Rob

    Published 02-10-2019
    “…Training an agent to solve control tasks directly from high-dimensional images with model-free reinforcement learning (RL) has proven difficult. A promising…”
    Get full text
    Journal Article
  18. 18

    Generalized Inner Loop Meta-Learning by Grefenstette, Edward, Amos, Brandon, Yarats, Denis, Htut, Phu Mon, Molchanov, Artem, Meier, Franziska, Kiela, Douwe, Cho, Kyunghyun, Chintala, Soumith

    Published 03-10-2019
    “…Many (but not all) approaches self-qualifying as "meta-learning" in deep learning and reinforcement learning fit a common pattern of approximating the solution…”
    Get full text
    Journal Article
  19. 19

    Deal or No Deal? End-to-End Learning for Negotiation Dialogues by Lewis, Mike, Yarats, Denis, Dauphin, Yann N, Parikh, Devi, Batra, Dhruv

    Published 15-06-2017
    “…Much of human dialogue occurs in semi-cooperative settings, where agents with different goals attempt to agree on common decisions. Negotiations require…”
    Get full text
    Journal Article
  20. 20

    Convolutional Sequence to Sequence Learning by Gehring, Jonas, Auli, Michael, Grangier, David, Yarats, Denis, Dauphin, Yann N

    Published 08-05-2017
    “…The prevalent approach to sequence to sequence learning maps an input sequence to a variable length output sequence via recurrent neural networks. We introduce…”
    Get full text
    Journal Article