Search Results - "Dalibard, Valentin" :: Katalog Arama

1
Grandmaster level in StarCraft II using multi-agent reinforcement learning by Vinyals, Oriol, Babuschkin, Igor, Czarnecki, Wojciech M., Mathieu, Michaël, Dudzik, Andrew, Chung, Junyoung, Choi, David H., Powell, Richard, Ewalds, Timo, Georgiev, Petko, Oh, Junhyuk, Horgan, Dan, Kroiss, Manuel, Danihelka, Ivo, Huang, Aja, Sifre, Laurent, Cai, Trevor, Agapiou, John P., Jaderberg, Max, Vezhnevets, Alexander S., Leblond, Rémi, Pohlen, Tobias, Dalibard, Valentin, Budden, David, Sulsky, Yury, Molloy, James, Paine, Tom L., Gulcehre, Caglar, Wang, Ziyu, Pfaff, Tobias, Wu, Yuhuai, Ring, Roman, Yogatama, Dani, Wünsch, Dario, McKinney, Katrina, Smith, Oliver, Schaul, Tom, Lillicrap, Timothy, Kavukcuoglu, Koray, Hassabis, Demis, Apps, Chris, Silver, David

Published in Nature (London) (01-11-2019)
“…Many real-world applications require artificial agents to compete and coordinate with other agents in complex environments. As a stepping stone to this goal,…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
Faster Improvement Rate Population Based Training by Dalibard, Valentin, Jaderberg, Max

Published 28-09-2021
“…The successful training of neural networks typically involves careful and time consuming hyperparameter tuning. Population Based Training (PBT) has recently…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
3
Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization by Lange, Robert Tjarko, Schaul, Tom, Chen, Yutian, Lu, Chris, Zahavy, Tom, Dalibard, Valentin, Flennerhag, Sebastian

Published 08-04-2023
“…Genetic algorithms constitute a family of black-box optimization algorithms, which take inspiration from the principles of biological evolution. While they…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
4
DemoStart: Demonstration-led auto-curriculum applied to sim-to-real with multi-fingered robots by Bauza, Maria, Chen, Jose Enrique, Dalibard, Valentin, Gileadi, Nimrod, Hafner, Roland, Martins, Murilo F, Moore, Joss, Pevceviciute, Rugile, Laurens, Antoine, Rao, Dushyant, Zambelli, Martina, Riedmiller, Martin, Scholz, Jon, Bousmalis, Konstantinos, Nori, Francesco, Heess, Nicolas

Published 10-09-2024
“…We present DemoStart, a novel auto-curriculum reinforcement learning method capable of learning complex manipulation behaviors on an arm equipped with a…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
5
Rapid training of deep neural networks without skip connections or normalization layers using Deep Kernel Shaping by Martens, James, Ballard, Andy, Desjardins, Guillaume, Swirszcz, Grzegorz, Dalibard, Valentin, Sohl-Dickstein, Jascha, Schoenholz, Samuel S

Published 04-10-2021
“…Using an extended and formalized version of the Q/C map analysis of Poole et al. (2016), along with Neural Tangent Kernel theory, we identify the main…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
Perception-Prediction-Reaction Agents for Deep Reinforcement Learning by Stooke, Adam, Dalibard, Valentin, Jayakumar, Siddhant M, Czarnecki, Wojciech M, Jaderberg, Max

Published 26-06-2020
“…We introduce a new recurrent agent architecture and associated auxiliary losses which improve reinforcement learning in partially observable tasks requiring…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
7
Tuning the Scheduling of Distributed Stochastic Gradient Descent with Bayesian Optimization by Dalibard, Valentin, Schaarschmidt, Michael, Yoneki, Eiko

Published 01-12-2016
“…We present an optimizer which uses Bayesian optimization to tune the system parameters of distributed stochastic gradient descent (SGD). Given a specific…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
8
RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation by Bousmalis, Konstantinos, Vezzani, Giulia, Rao, Dushyant, Devin, Coline, Lee, Alex X, Bauza, Maria, Davchev, Todor, Zhou, Yuxiang, Gupta, Agrim, Raju, Akhil, Laurens, Antoine, Fantacci, Claudio, Dalibard, Valentin, Zambelli, Martina, Martins, Murilo, Pevceviciute, Rugile, Blokzijl, Michiel, Denil, Misha, Batchelor, Nathan, Lampe, Thomas, Parisotto, Emilio, Żołna, Konrad, Reed, Scott, Colmenarejo, Sergio Gómez, Scholz, Jon, Abdolmaleki, Abbas, Groth, Oliver, Regli, Jean-Baptiste, Sushkov, Oleg, Rothörl, Tom, Chen, José Enrique, Aytar, Yusuf, Barker, Dave, Ortiz, Joy, Riedmiller, Martin, Springenberg, Jost Tobias, Hadsell, Raia, Nori, Francesco, Heess, Nicolas

Published 20-06-2023
“…The ability to leverage heterogeneous robotic experience from different robots and tasks to quickly master novel skills and embodiments has the potential to…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
9
Open-Ended Learning Leads to Generally Capable Agents by Open Ended Learning Team, Stooke, Adam, Mahajan, Anuj, Barros, Catarina, Deck, Charlie, Bauer, Jakob, Sygnowski, Jakub, Trebacz, Maja, Jaderberg, Max, Mathieu, Michael, McAleese, Nat, Bradley-Schmieg, Nathalie, Wong, Nathaniel, Porcel, Nicolas, Raileanu, Roberta, Hughes-Fitt, Steph, Dalibard, Valentin, Czarnecki, Wojciech Marian

Published 27-07-2021
“…In this work we create agents that can perform well beyond a single, individual task, that exhibit much wider generalisation of behaviour to a massive, rich…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
10
Learning Runtime Parameters in Computer Systems with Delayed Experience Injection by Schaarschmidt, Michael, Gessert, Felix, Dalibard, Valentin, Yoneki, Eiko

Published 31-10-2016
“…Learning effective configurations in computer systems without hand-crafting models for every parameter is a long-standing problem. This paper investigates the…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
11
A Generalized Framework for Population Based Training by Li, Ang, Spyra, Aleksandra, Perel, Sagi, Dalibard, Valentin, Jaderberg, Max, Gu, Chenjie, Budden, David, Harley, Tim, Gupta, Pramod

Published 05-02-2019
“…Population Based Training (PBT) is a recent approach that jointly optimizes neural network weights and hyperparameters which periodically copies weights of the…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
12
Population Based Training of Neural Networks by Jaderberg, Max, Dalibard, Valentin, Osindero, Simon, Czarnecki, Wojciech M, Donahue, Jeff, Razavi, Ali, Vinyals, Oriol, Green, Tim, Dunning, Iain, Simonyan, Karen, Fernando, Chrisantha, Kavukcuoglu, Koray

Published 27-11-2017
“…Neural networks dominate the modern machine learning landscape, but their training and success still suffer from sensitivity to empirical choices of…”

Get full text

Journal Article
QR Code
Save to List

Saved in: