Search Results - "Kurin, Vitaly"

  • Showing 1 - 16 results of 16
Refine Results
  1. 1

    Towards a Principled Integration of Multi-camera Re-identification and Tracking Through Optimal Bayes Filters by Beyer, Lucas, Breuers, Stefan, Kurin, Vitaly, Leibe, Bastian

    “…With the rise of end-to-end learning through deep learning, person detectors and re-identification (ReID) models have recently become very strong. Multi-target…”
    Get full text
    Conference Proceeding
  2. 2

    Learning From Demonstration in the Wild by Behbahani, Feryal, Shiarlis, Kyriacos, Chen, Xi, Kurin, Vitaly, Kasewa, Sudhanshu, Stirbu, Ciprian, Gomes, Joao, Paul, Supratik, Oliehoek, Frans A., Messias, Joao, Whiteson, Shimon

    “…Learning from demonstration (LfD) is useful in settings where hand-coding behaviour or a reward function is impractical. It has succeeded in a wide range of…”
    Get full text
    Conference Proceeding
  3. 3

    Deep Coordination Graphs by Böhmer, Wendelin, Kurin, Vitaly, Whiteson, Shimon

    Published 27-09-2019
    “…This paper introduces the deep coordination graph (DCG) for collaborative multi-agent reinforcement learning. DCG strikes a flexible trade-off between…”
    Get full text
    Journal Article
  4. 4

    Snowflake: Scaling GNNs to High-Dimensional Continuous Control via Parameter Freezing by Blake, Charlie, Kurin, Vitaly, Igl, Maximilian, Whiteson, Shimon

    Published 01-03-2021
    “…Recent research has shown that graph neural networks (GNNs) can learn policies for locomotion control that are as effective as a typical multi-layer perceptron…”
    Get full text
    Journal Article
  5. 5

    Fast Efficient Hyperparameter Tuning for Policy Gradients by Paul, Supratik, Kurin, Vitaly, Whiteson, Shimon

    Published 18-02-2019
    “…The performance of policy gradient methods is sensitive to hyperparameter settings that must be tuned for any new application. Widely used grid search methods…”
    Get full text
    Journal Article
  6. 6

    My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control by Kurin, Vitaly, Igl, Maximilian, Rocktäschel, Tim, Boehmer, Wendelin, Whiteson, Shimon

    Published 05-10-2020
    “…Multitask Reinforcement Learning is a promising way to obtain models with better performance, generalisation, data efficiency, and robustness. Most existing…”
    Get full text
    Journal Article
  7. 7

    You May Not Need Ratio Clipping in PPO by Sun, Mingfei, Kurin, Vitaly, Liu, Guoqing, Devlin, Sam, Qin, Tao, Hofmann, Katja, Whiteson, Shimon

    Published 31-01-2022
    “…Proximal Policy Optimization (PPO) methods learn a policy by iteratively performing multiple mini-batch optimization epochs of a surrogate objective with one…”
    Get full text
    Journal Article
  8. 8

    In Defense of the Unitary Scalarization for Deep Multi-Task Learning by Kurin, Vitaly, De Palma, Alessandro, Kostrikov, Ilya, Whiteson, Shimon, Kumar, M. Pawan

    Published 11-01-2022
    “…Recent multi-task learning research argues against unitary scalarization, where training simply minimizes the sum of the task losses. Several ad-hoc multi-task…”
    Get full text
    Journal Article
  9. 9

    Can $Q$-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? by Kurin, Vitaly, Godil, Saad, Whiteson, Shimon, Catanzaro, Bryan

    Published 25-09-2019
    “…We present Graph-$Q$-SAT, a branching heuristic for a Boolean SAT solver trained with value-based reinforcement learning (RL) using Graph Neural Networks for…”
    Get full text
    Journal Article
  10. 10

    A Generalist Neural Algorithmic Learner by Ibarz, Borja, Kurin, Vitaly, Papamakarios, George, Nikiforou, Kyriacos, Bennani, Mehdi, Csordás, Róbert, Dudzik, Andrew, Bošnjak, Matko, Vitvitskyi, Alex, Rubanova, Yulia, Deac, Andreea, Bevilacqua, Beatrice, Ganin, Yaroslav, Blundell, Charles, Veličković, Petar

    Published 22-09-2022
    “…The cornerstone of neural algorithmic reasoning is the ability to solve algorithmic tasks, especially in a way that generalises out of distribution. While…”
    Get full text
    Journal Article
  11. 11

    MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research by Samvelyan, Mikayel, Kirk, Robert, Kurin, Vitaly, Parker-Holder, Jack, Jiang, Minqi, Hambro, Eric, Petroni, Fabio, Küttler, Heinrich, Grefenstette, Edward, Rocktäschel, Tim

    Published 27-09-2021
    “…Progress in deep reinforcement learning (RL) is heavily driven by the availability of challenging benchmarks used for training agents. However, benchmarks that…”
    Get full text
    Journal Article
  12. 12

    The Atari Grand Challenge Dataset by Kurin, Vitaly, Nowozin, Sebastian, Hofmann, Katja, Beyer, Lucas, Leibe, Bastian

    Published 31-05-2017
    “…Recent progress in Reinforcement Learning (RL), fueled by its combination, with Deep Learning has enabled impressive results in learning to interact with…”
    Get full text
    Journal Article
  13. 13

    Towards a Principled Integration of Multi-Camera Re-Identification and Tracking through Optimal Bayes Filters by Beyer, Lucas, Breuers, Stefan, Kurin, Vitaly, Leibe, Bastian

    Published 12-05-2017
    “…With the rise of end-to-end learning through deep learning, person detectors and re-identification (ReID) models have recently become very strong. Multi-camera…”
    Get full text
    Journal Article
  14. 14

    Fast Context Adaptation via Meta-Learning by Zintgraf, Luisa M, Shiarlis, Kyriacos, Kurin, Vitaly, Hofmann, Katja, Whiteson, Shimon

    Published 08-10-2018
    “…We propose CAVIA for meta-learning, a simple extension to MAML that is less prone to meta-overfitting, easier to parallelise, and more interpretable. CAVIA…”
    Get full text
    Journal Article
  15. 15
  16. 16

    Learning from Demonstration in the Wild by Behbahani, Feryal, Shiarlis, Kyriacos, Chen, Xi, Kurin, Vitaly, Kasewa, Sudhanshu, Stirbu, Ciprian, Gomes, João, Paul, Supratik, Oliehoek, Frans A, Messias, João, Whiteson, Shimon

    Published 08-11-2018
    “…Learning from demonstration (LfD) is useful in settings where hand-coding behaviour or a reward function is impractical. It has succeeded in a wide range of…”
    Get full text
    Journal Article