Search Results - "Danihelka, Ivo"

  • Showing 1 - 20 results of 20
Refine Results
  1. 1
  2. 2
  3. 3
  4. 4

    Planning and Policy Improvement by Danihelka, Ivo

    Published 01-01-2023
    “…MuZero is currently the most successful general reinforcement learning algorithm, achieving the state of the art on Go, chess, shogi, and Atari. We want to…”
    Get full text
    Dissertation
  5. 5

    Muesli: Combining Improvements in Policy Optimization by Hessel, Matteo, Danihelka, Ivo, Viola, Fabio, Guez, Arthur, Schmitt, Simon, Sifre, Laurent, Weber, Theophane, Silver, David, van Hasselt, Hado

    Published 13-04-2021
    “…We propose a novel policy update that combines regularized policy optimization with model learning as an auxiliary loss. The update (henceforth Muesli) matches…”
    Get full text
    Journal Article
  6. 6

    Optimistic Simulated Exploration as an Incentive for Real Exploration by Danihelka, Ivo

    Published 17-03-2009
    “…POSTER 2009 Many reinforcement learning exploration techniques are overly optimistic and try to explore every state. Such exploration is impossible in…”
    Get full text
    Journal Article
  7. 7

    Comparison of Maximum Likelihood and GAN-based training of Real NVPs by Danihelka, Ivo, Lakshminarayanan, Balaji, Uria, Benigno, Wierstra, Daan, Dayan, Peter

    Published 15-05-2017
    “…We train a generator by maximum likelihood and we also train the same generator architecture by Wasserstein GAN. We then compare the generated samples, exact…”
    Get full text
    Journal Article
  8. 8

    Grid Long Short-Term Memory by Kalchbrenner, Nal, Danihelka, Ivo, Graves, Alex

    Published 06-07-2015
    “…This paper introduces Grid Long Short-Term Memory, a network of LSTM cells arranged in a multidimensional grid that can be applied to vectors, sequences or…”
    Get full text
    Journal Article
  9. 9

    Neural Turing Machines by Graves, Alex, Wayne, Greg, Danihelka, Ivo

    Published 20-10-2014
    “…We extend the capabilities of neural networks by coupling them to external memory resources, which they can interact with by attentional processes. The…”
    Get full text
    Journal Article
  10. 10

    Memory-Efficient Backpropagation Through Time by Gruslys, Audrūnas, Munos, Remi, Danihelka, Ivo, Lanctot, Marc, Graves, Alex

    Published 10-06-2016
    “…We propose a novel approach to reduce memory consumption of the backpropagation through time (BPTT) algorithm when training recurrent neural networks (RNNs)…”
    Get full text
    Journal Article
  11. 11

    Associative Long Short-Term Memory by Danihelka, Ivo, Wayne, Greg, Uria, Benigno, Kalchbrenner, Nal, Graves, Alex

    Published 09-02-2016
    “…We investigate a new method to augment recurrent neural networks with extra memory without increasing the number of network parameters. The system has an…”
    Get full text
    Journal Article
  12. 12

    The Cramer Distance as a Solution to Biased Wasserstein Gradients by Bellemare, Marc G, Danihelka, Ivo, Dabney, Will, Mohamed, Shakir, Lakshminarayanan, Balaji, Hoyer, Stephan, Munos, Rémi

    Published 30-05-2017
    “…The Wasserstein probability metric has received much attention from the machine learning community. Unlike the Kullback-Leibler divergence, which strictly…”
    Get full text
    Journal Article
  13. 13

    Causally Correct Partial Models for Reinforcement Learning by Rezende, Danilo J, Danihelka, Ivo, Papamakarios, George, Ke, Nan Rosemary, Jiang, Ray, Weber, Theophane, Gregor, Karol, Merzic, Hamza, Viola, Fabio, Wang, Jane, Mitrovic, Jovana, Besse, Frederic, Antonoglou, Ioannis, Buesing, Lars

    Published 07-02-2020
    “…In reinforcement learning, we can learn a model of future observations and rewards, and use it to plan the agent's next actions. However, jointly modeling…”
    Get full text
    Journal Article
  14. 14

    Video Pixel Networks by Kalchbrenner, Nal, Oord, Aaron van den, Simonyan, Karen, Danihelka, Ivo, Vinyals, Oriol, Graves, Alex, Kavukcuoglu, Koray

    Published 03-10-2016
    “…We propose a probabilistic video model, the Video Pixel Network (VPN), that estimates the discrete joint distribution of the raw pixel values in a video. The…”
    Get full text
    Journal Article
  15. 15

    Towards Conceptual Compression by Gregor, Karol, Besse, Frederic, Rezende, Danilo Jimenez, Danihelka, Ivo, Wierstra, Daan

    Published 29-04-2016
    “…We introduce a simple recurrent variational auto-encoder architecture that significantly improves image modeling. The system represents the state-of-the-art in…”
    Get full text
    Journal Article
  16. 16

    One-Shot Generalization in Deep Generative Models by Rezende, Danilo Jimenez, Mohamed, Shakir, Danihelka, Ivo, Gregor, Karol, Wierstra, Daan

    Published 16-03-2016
    “…Humans have an impressive ability to reason about new concepts and experiences from just a single example. In particular, humans have an ability for one-shot…”
    Get full text
    Journal Article
  17. 17
  18. 18

    Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes by Rae, Jack W, Hunt, Jonathan J, Harley, Tim, Danihelka, Ivo, Senior, Andrew, Wayne, Greg, Graves, Alex, Lillicrap, Timothy P

    Published 27-10-2016
    “…Neural networks augmented with external memory have the ability to learn algorithmic solutions to complex tasks. These models appear promising for applications…”
    Get full text
    Journal Article
  19. 19

    Deep AutoRegressive Networks by Gregor, Karol, Danihelka, Ivo, Mnih, Andriy, Blundell, Charles, Wierstra, Daan

    Published 31-10-2013
    “…Karol Gregor, Ivo Danihelka, Andriy Mnih, Charles Blundell, Daan Wierstra. Deep AutoRegressive Networks. In Proceedings of the 31st International Conference on…”
    Get full text
    Journal Article
  20. 20

    DRAW: A Recurrent Neural Network For Image Generation by Gregor, Karol, Danihelka, Ivo, Graves, Alex, Rezende, Danilo Jimenez, Wierstra, Daan

    Published 16-02-2015
    “…This paper introduces the Deep Recurrent Attentive Writer (DRAW) neural network architecture for image generation. DRAW networks combine a novel spatial…”
    Get full text
    Journal Article