Search Results - "Sifre, Laurent"
-
1
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
Published in Science (American Association for the Advancement of Science) (07-12-2018)“…The game of chess is the longest-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated…”
Get full text
Journal Article -
2
Rotation, Scaling and Deformation Invariant Scattering for Texture Discrimination
Published in 2013 IEEE Conference on Computer Vision and Pattern Recognition (01-06-2013)“…An affine invariant representation is constructed with a cascade of invariants, which preserves information for classification. A joint translation and…”
Get full text
Conference Proceeding -
3
Mastering the game of Go without human knowledge
Published in Nature (London) (19-10-2017)“…A long-standing goal of artificial intelligence is an algorithm that learns, tabula rasa , superhuman proficiency in challenging domains. Recently, AlphaGo…”
Get full text
Journal Article -
4
Protein structure prediction using multiple deep neural networks in the 13th Critical Assessment of Protein Structure Prediction (CASP13)
Published in Proteins, structure, function, and bioinformatics (01-12-2019)“…We describe AlphaFold, the protein structure prediction system that was entered by the group A7D in CASP13. Submissions were made by three free‐modeling (FM)…”
Get full text
Journal Article -
5
Mastering Atari, Go, chess and shogi by planning with a learned model
Published in Nature (London) (24-12-2020)“…Constructing agents with planning capabilities has long been one of the main challenges in the pursuit of artificial intelligence. Tree-based planning methods…”
Get full text
Journal Article -
6
Improved protein structure prediction using potentials from deep learning
Published in Nature (London) (30-01-2020)“…Protein structure prediction can be used to determine the three-dimensional shape of a protein from its amino acid sequence 1 . This problem is of fundamental…”
Get full text
Journal Article -
7
Mastering the game of Go with deep neural networks and tree search
Published in Nature (London) (28-01-2016)“…The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty…”
Get full text
Journal Article -
8
Grandmaster level in StarCraft II using multi-agent reinforcement learning
Published in Nature (London) (01-11-2019)“…Many real-world applications require artificial agents to compete and coordinate with other agents in complex environments. As a stepping stone to this goal,…”
Get full text
Journal Article -
9
Mastering the game of Stratego with model-free multiagent reinforcement learning
Published in Science (American Association for the Advancement of Science) (02-12-2022)“…We introduce DeepNash, an autonomous agent that plays the imperfect information game Stratego at a human expert level. Stratego is one of the few iconic board…”
Get full text
Journal Article -
10
Accelerating Large Language Model Decoding with Speculative Sampling
Published 02-02-2023“…We present speculative sampling, an algorithm for accelerating transformer decoding by enabling the generation of multiple tokens from each transformer call…”
Get full text
Journal Article -
11
Large-Scale Retrieval for Reinforcement Learning
Published 10-06-2022“…Effective decision making involves flexibly relating past experiences and relevant contextual information to a novel situation. In deep reinforcement learning…”
Get full text
Journal Article -
12
Self-conditioned Embedding Diffusion for Text Generation
Published 08-11-2022“…Can continuous diffusion models bring the same performance breakthrough on natural language they did for image generation? To circumvent the discrete nature of…”
Get full text
Journal Article -
13
Rigid-Motion Scattering for Texture Classification
Published 07-03-2014“…A rigid-motion scattering computes adaptive invariants along translations and rotations, with a deep convolutional network. Convolutions are calculated on the…”
Get full text
Journal Article -
14
Muesli: Combining Improvements in Policy Optimization
Published 13-04-2021“…We propose a novel policy update that combines regularized policy optimization with model learning as an auxiliary loss. The update (henceforth Muesli) matches…”
Get full text
Journal Article -
15
Machine Translation Decoding beyond Beam Search
Published 12-04-2021“…Beam search is the go-to method for decoding auto-regressive machine translation models. While it yields consistent improvements in terms of BLEU, it is only…”
Get full text
Journal Article -
16
Retrieval-Augmented Reinforcement Learning
Published 16-02-2022“…Most deep reinforcement learning (RL) algorithms distill experience into parametric behavior policies or value functions via gradient updates. While effective,…”
Get full text
Journal Article -
17
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Published 11-04-2024“…We introduce RecurrentGemma, a family of open language models which uses Google's novel Griffin architecture. Griffin combines linear recurrences with local…”
Get full text
Journal Article -
18
Training Compute-Optimal Large Language Models
Published 29-03-2022“…We investigate the optimal model size and number of tokens for training a transformer language model under a given compute budget. We find that current large…”
Get full text
Journal Article -
19
Unified Scaling Laws for Routed Language Models
Published 02-02-2022“…The performance of a language model has been shown to be effectively modeled as a power-law in its parameter count. Here we study the scaling behaviors of…”
Get full text
Journal Article -
20
Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Published 30-06-2022“…We introduce DeepNash, an autonomous agent capable of learning to play the imperfect information game Stratego from scratch, up to a human expert level…”
Get full text
Journal Article