Search Results - "Song, Xingyou"
-
1
Automated Reinforcement Learning (AutoRL): A Survey and Open Problems
Published in The Journal of artificial intelligence research (2022)“…The combination of Reinforcement Learning (RL) with deep learning has led to a series of impressive feats, with many believing (deep) RL provides a path…”
Get full text
Journal Article -
2
-
3
Rapidly Adaptable Legged Robots via Evolutionary Meta-Learning
Published in 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (24-10-2020)“…Learning adaptable policies is crucial for robots to operate autonomously in our complex and quickly changing world. In this work, we present a new…”
Get full text
Conference Proceeding -
4
Robotic Table Tennis with Model-Free Reinforcement Learning
Published in 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (24-10-2020)“…We propose a model-free algorithm for learning efficient policies capable of returning table tennis balls by controlling robot joints at a rate of 100Hz. We…”
Get full text
Conference Proceeding -
5
Discovering Adaptable Symbolic Algorithms from Scratch
Published in 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (01-10-2023)“…Autonomous robots deployed in the real world will need control policies that rapidly adapt to environmental changes. To this end, we propose AutoRobotics-Zero…”
Get full text
Conference Proceeding -
6
Quantum Cellular Automata Models for General Dirac Equation
Published 10-10-2016“…The goal of this study is to provide an exact unitary quantum cellular automata that, under discrete time steps, converges towards the Generalized Dirac…”
Get full text
Journal Article -
7
The Principle of Unchanged Optimality in Reinforcement Learning Generalization
Published 01-06-2019“…Several recent papers have examined generalization in reinforcement learning (RL), by proposing new environments or ways to add noise to existing environments,…”
Get full text
Journal Article -
8
Hardness of Low Rank Approximation of Entrywise Transformed Matrix Products
Published 03-11-2023“…Inspired by fast algorithms in natural language processing, we study low rank approximation in the entrywise transformed setting where we want to find a good…”
Get full text
Journal Article -
9
Position: Leverage Foundational Models for Black-Box Optimization
Published 06-05-2024“…Undeniably, Large Language Models (LLMs) have stirred an extraordinary wave of innovation in the machine learning research domain, resulting in substantial…”
Get full text
Journal Article -
10
Open Source Vizier: Distributed Infrastructure and API for Reliable and Flexible Blackbox Optimization
Published 27-07-2022“…Vizier is the de-facto blackbox and hyperparameter optimization service across Google, having optimized some of Google's largest products and research efforts…”
Get full text
Journal Article -
11
OmniPred: Language Models as Universal Regressors
Published 22-02-2024“…Over the broad landscape of experimental design, regression has been a powerful tool to accurately predict the outcome metrics of a system or model given a set…”
Get full text
Journal Article -
12
Predicting from Strings: Language Model Embeddings for Bayesian Optimization
Published 14-10-2024“…Bayesian Optimization is ubiquitous in the field of experimental design and blackbox optimization for improving search efficiency, but has been traditionally…”
Get full text
Journal Article -
13
Debiasing a First-order Heuristic for Approximate Bi-level Optimization
Published 04-06-2021“…Approximate bi-level optimization (ABLO) consists of (outer-level) optimization problems, involving numerical (inner-level) optimization loops. While ABLO has…”
Get full text
Journal Article -
14
The Vizier Gaussian Process Bandit Algorithm
Published 21-08-2024“…Google Vizier has performed millions of optimizations and accelerated numerous research and production systems at Google, demonstrating the success of Bayesian…”
Get full text
Journal Article -
15
An Empirical Study on Hyperparameters and their Interdependence for RL Generalization
Published 02-06-2019“…Recent results in Reinforcement Learning (RL) have shown that agents with limited training environments are susceptible to a large amount of overfitting across…”
Get full text
Journal Article -
16
Sub-Linear Memory: How to Make Performers SLiM
Published 21-12-2020“…The Transformer architecture has revolutionized deep learning on sequential data, becoming ubiquitous in state-of-the-art solutions for a wide variety of…”
Get full text
Journal Article -
17
UFO-BLO: Unbiased First-Order Bilevel Optimization
Published 05-06-2020“…Bilevel optimization (BLO) is a popular approach with many applications including hyperparameter optimization, neural architecture search, adversarial…”
Get full text
Journal Article -
18
Discovering Adaptable Symbolic Algorithms from Scratch
Published 31-07-2023“…Autonomous robots deployed in the real world will need control policies that rapidly adapt to environmental changes. To this end, we propose AutoRobotics-Zero…”
Get full text
Journal Article -
19
Observational Overfitting in Reinforcement Learning
Published 05-12-2019“…A major component of overfitting in model-free reinforcement learning (RL) involves the case where the agent may mistakenly correlate reward with certain…”
Get full text
Journal Article -
20
Differentiable Architecture Search for Reinforcement Learning
Published 03-06-2021“…In this paper, we investigate the fundamental question: To what extent are gradient-based neural architecture search (NAS) techniques applicable to RL? Using…”
Get full text
Journal Article