Search Results - "Dalibard, Valentin"
-
1
Grandmaster level in StarCraft II using multi-agent reinforcement learning
Published in Nature (London) (01-11-2019)“…Many real-world applications require artificial agents to compete and coordinate with other agents in complex environments. As a stepping stone to this goal,…”
Get full text
Journal Article -
2
Faster Improvement Rate Population Based Training
Published 28-09-2021“…The successful training of neural networks typically involves careful and time consuming hyperparameter tuning. Population Based Training (PBT) has recently…”
Get full text
Journal Article -
3
Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization
Published 08-04-2023“…Genetic algorithms constitute a family of black-box optimization algorithms, which take inspiration from the principles of biological evolution. While they…”
Get full text
Journal Article -
4
DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots
Published 10-09-2024“…We present DemoStart, a novel auto-curriculum reinforcement learning method capable of learning complex manipulation behaviors on an arm equipped with a…”
Get full text
Journal Article -
5
Rapid training of deep neural networks without skip connections or normalization layers using Deep Kernel Shaping
Published 04-10-2021“…Using an extended and formalized version of the Q/C map analysis of Poole et al. (2016), along with Neural Tangent Kernel theory, we identify the main…”
Get full text
Journal Article -
6
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
Published 26-06-2020“…We introduce a new recurrent agent architecture and associated auxiliary losses which improve reinforcement learning in partially observable tasks requiring…”
Get full text
Journal Article -
7
Tuning the Scheduling of Distributed Stochastic Gradient Descent with Bayesian Optimization
Published 01-12-2016“…We present an optimizer which uses Bayesian optimization to tune the system parameters of distributed stochastic gradient descent (SGD). Given a specific…”
Get full text
Journal Article -
8
RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation
Published 20-06-2023“…The ability to leverage heterogeneous robotic experience from different robots and tasks to quickly master novel skills and embodiments has the potential to…”
Get full text
Journal Article -
9
Open-Ended Learning Leads to Generally Capable Agents
Published 27-07-2021“…In this work we create agents that can perform well beyond a single, individual task, that exhibit much wider generalisation of behaviour to a massive, rich…”
Get full text
Journal Article -
10
Learning Runtime Parameters in Computer Systems with Delayed Experience Injection
Published 31-10-2016“…Learning effective configurations in computer systems without hand-crafting models for every parameter is a long-standing problem. This paper investigates the…”
Get full text
Journal Article -
11
A Generalized Framework for Population Based Training
Published 05-02-2019“…Population Based Training (PBT) is a recent approach that jointly optimizes neural network weights and hyperparameters which periodically copies weights of the…”
Get full text
Journal Article -
12
Population Based Training of Neural Networks
Published 27-11-2017“…Neural networks dominate the modern machine learning landscape, but their training and success still suffer from sensitivity to empirical choices of…”
Get full text
Journal Article