Search Results - "Kurin, Vitaly"
-
1
Towards a Principled Integration of Multi-camera Re-identification and Tracking Through Optimal Bayes Filters
Published in 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (01-07-2017)“…With the rise of end-to-end learning through deep learning, person detectors and re-identification (ReID) models have recently become very strong. Multi-target…”
Get full text
Conference Proceeding -
2
Learning From Demonstration in the Wild
Published in 2019 International Conference on Robotics and Automation (ICRA) (01-05-2019)“…Learning from demonstration (LfD) is useful in settings where hand-coding behaviour or a reward function is impractical. It has succeeded in a wide range of…”
Get full text
Conference Proceeding -
3
Deep Coordination Graphs
Published 27-09-2019“…This paper introduces the deep coordination graph (DCG) for collaborative multi-agent reinforcement learning. DCG strikes a flexible trade-off between…”
Get full text
Journal Article -
4
Snowflake: Scaling GNNs to High-Dimensional Continuous Control via Parameter Freezing
Published 01-03-2021“…Recent research has shown that graph neural networks (GNNs) can learn policies for locomotion control that are as effective as a typical multi-layer perceptron…”
Get full text
Journal Article -
5
Fast Efficient Hyperparameter Tuning for Policy Gradients
Published 18-02-2019“…The performance of policy gradient methods is sensitive to hyperparameter settings that must be tuned for any new application. Widely used grid search methods…”
Get full text
Journal Article -
6
My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control
Published 05-10-2020“…Multitask Reinforcement Learning is a promising way to obtain models with better performance, generalisation, data efficiency, and robustness. Most existing…”
Get full text
Journal Article -
7
You May Not Need Ratio Clipping in PPO
Published 31-01-2022“…Proximal Policy Optimization (PPO) methods learn a policy by iteratively performing multiple mini-batch optimization epochs of a surrogate objective with one…”
Get full text
Journal Article -
8
In Defense of the Unitary Scalarization for Deep Multi-Task Learning
Published 11-01-2022“…Recent multi-task learning research argues against unitary scalarization, where training simply minimizes the sum of the task losses. Several ad-hoc multi-task…”
Get full text
Journal Article -
9
Can $Q$-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?
Published 25-09-2019“…We present Graph-$Q$-SAT, a branching heuristic for a Boolean SAT solver trained with value-based reinforcement learning (RL) using Graph Neural Networks for…”
Get full text
Journal Article -
10
A Generalist Neural Algorithmic Learner
Published 22-09-2022“…The cornerstone of neural algorithmic reasoning is the ability to solve algorithmic tasks, especially in a way that generalises out of distribution. While…”
Get full text
Journal Article -
11
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Published 27-09-2021“…Progress in deep reinforcement learning (RL) is heavily driven by the availability of challenging benchmarks used for training agents. However, benchmarks that…”
Get full text
Journal Article -
12
The Atari Grand Challenge Dataset
Published 31-05-2017“…Recent progress in Reinforcement Learning (RL), fueled by its combination, with Deep Learning has enabled impressive results in learning to interact with…”
Get full text
Journal Article -
13
Towards a Principled Integration of Multi-Camera Re-Identification and Tracking through Optimal Bayes Filters
Published 12-05-2017“…With the rise of end-to-end learning through deep learning, person detectors and re-identification (ReID) models have recently become very strong. Multi-camera…”
Get full text
Journal Article -
14
Fast Context Adaptation via Meta-Learning
Published 08-10-2018“…We propose CAVIA for meta-learning, a simple extension to MAML that is less prone to meta-overfitting, easier to parallelise, and more interpretable. CAVIA…”
Get full text
Journal Article -
15
Insights From the NeurIPS 2021 NetHack Challenge
Published 22-03-2022“…In this report, we summarize the takeaways from the first NeurIPS 2021 NetHack Challenge. Participants were tasked with developing a program or agent that can…”
Get full text
Journal Article -
16
Learning from Demonstration in the Wild
Published 08-11-2018“…Learning from demonstration (LfD) is useful in settings where hand-coding behaviour or a reward function is impractical. It has succeeded in a wide range of…”
Get full text
Journal Article