Search Results - "Song, Xingyou"

Refine Results
  1. 1

    Automated Reinforcement Learning (AutoRL): A Survey and Open Problems by Parker-Holder, Jack, Rajan, Raghu, Song, Xingyou, Biedenkapp, André, Miao, Yingjie, Eimer, Theresa, Zhang, Baohe, Nguyen, Vu, Calandra, Roberto, Faust, Aleksandra, Hutter, Frank, Lindauer, Marius

    “…The combination of Reinforcement Learning (RL) with deep learning has led to a series of impressive feats, with many believing (deep) RL provides a path…”
    Get full text
    Journal Article
  2. 2
  3. 3

    Rapidly Adaptable Legged Robots via Evolutionary Meta-Learning by Song, Xingyou, Yang, Yuxiang, Choromanski, Krzysztof, Caluwaerts, Ken, Gao, Wenbo, Finn, Chelsea, Tan, Jie

    “…Learning adaptable policies is crucial for robots to operate autonomously in our complex and quickly changing world. In this work, we present a new…”
    Get full text
    Conference Proceeding
  4. 4

    Robotic Table Tennis with Model-Free Reinforcement Learning by Gao, Wenbo, Graesser, Laura, Choromanski, Krzysztof, Song, Xingyou, Lazic, Nevena, Sanketi, Pannag, Sindhwani, Vikas, Jaitly, Navdeep

    “…We propose a model-free algorithm for learning efficient policies capable of returning table tennis balls by controlling robot joints at a rate of 100Hz. We…”
    Get full text
    Conference Proceeding
  5. 5

    Discovering Adaptable Symbolic Algorithms from Scratch by Kelly, Stephen, Park, Daniel S., Song, Xingyou, McIntire, Mitchell, Nashikkar, Pranav, Guha, Ritam, Banzhaf, Wolfgang, Deb, Kalyanmoy, Boddeti, Vishnu Naresh, Tan, Jie, Real, Esteban

    “…Autonomous robots deployed in the real world will need control policies that rapidly adapt to environmental changes. To this end, we propose AutoRobotics-Zero…”
    Get full text
    Conference Proceeding
  6. 6

    Quantum Cellular Automata Models for General Dirac Equation by Song, Xingyou

    Published 10-10-2016
    “…The goal of this study is to provide an exact unitary quantum cellular automata that, under discrete time steps, converges towards the Generalized Dirac…”
    Get full text
    Journal Article
  7. 7

    The Principle of Unchanged Optimality in Reinforcement Learning Generalization by Irpan, Alex, Song, Xingyou

    Published 01-06-2019
    “…Several recent papers have examined generalization in reinforcement learning (RL), by proposing new environments or ways to add noise to existing environments,…”
    Get full text
    Journal Article
  8. 8

    Hardness of Low Rank Approximation of Entrywise Transformed Matrix Products by Sarlos, Tamas, Song, Xingyou, Woodruff, David, Qiuyi, Zhang

    Published 03-11-2023
    “…Inspired by fast algorithms in natural language processing, we study low rank approximation in the entrywise transformed setting where we want to find a good…”
    Get full text
    Journal Article
  9. 9

    Position: Leverage Foundational Models for Black-Box Optimization by Song, Xingyou, Tian, Yingtao, Lange, Robert Tjarko, Lee, Chansoo, Tang, Yujin, Chen, Yutian

    Published 06-05-2024
    “…Undeniably, Large Language Models (LLMs) have stirred an extraordinary wave of innovation in the machine learning research domain, resulting in substantial…”
    Get full text
    Journal Article
  10. 10

    Open Source Vizier: Distributed Infrastructure and API for Reliable and Flexible Blackbox Optimization by Song, Xingyou, Perel, Sagi, Lee, Chansoo, Kochanski, Greg, Golovin, Daniel

    Published 27-07-2022
    “…Vizier is the de-facto blackbox and hyperparameter optimization service across Google, having optimized some of Google's largest products and research efforts…”
    Get full text
    Journal Article
  11. 11

    OmniPred: Language Models as Universal Regressors by Song, Xingyou, Li, Oscar, Lee, Chansoo, Yang, Bangding, Peng, Daiyi, Perel, Sagi, Chen, Yutian

    Published 22-02-2024
    “…Over the broad landscape of experimental design, regression has been a powerful tool to accurately predict the outcome metrics of a system or model given a set…”
    Get full text
    Journal Article
  12. 12

    Predicting from Strings: Language Model Embeddings for Bayesian Optimization by Nguyen, Tung, Zhang, Qiuyi, Yang, Bangding, Lee, Chansoo, Bornschein, Jorg, Miao, Yingjie, Perel, Sagi, Chen, Yutian, Song, Xingyou

    Published 14-10-2024
    “…Bayesian Optimization is ubiquitous in the field of experimental design and blackbox optimization for improving search efficiency, but has been traditionally…”
    Get full text
    Journal Article
  13. 13

    Debiasing a First-order Heuristic for Approximate Bi-level Optimization by Likhosherstov, Valerii, Song, Xingyou, Choromanski, Krzysztof, Davis, Jared, Weller, Adrian

    Published 04-06-2021
    “…Approximate bi-level optimization (ABLO) consists of (outer-level) optimization problems, involving numerical (inner-level) optimization loops. While ABLO has…”
    Get full text
    Journal Article
  14. 14

    The Vizier Gaussian Process Bandit Algorithm by Song, Xingyou, Zhang, Qiuyi, Lee, Chansoo, Fertig, Emily, Huang, Tzu-Kuo, Belenki, Lior, Kochanski, Greg, Ariafar, Setareh, Vasudevan, Srinivas, Perel, Sagi, Golovin, Daniel

    Published 21-08-2024
    “…Google Vizier has performed millions of optimizations and accelerated numerous research and production systems at Google, demonstrating the success of Bayesian…”
    Get full text
    Journal Article
  15. 15

    An Empirical Study on Hyperparameters and their Interdependence for RL Generalization by Song, Xingyou, Du, Yilun, Jackson, Jacob

    Published 02-06-2019
    “…Recent results in Reinforcement Learning (RL) have shown that agents with limited training environments are susceptible to a large amount of overfitting across…”
    Get full text
    Journal Article
  16. 16

    Sub-Linear Memory: How to Make Performers SLiM by Likhosherstov, Valerii, Choromanski, Krzysztof, Davis, Jared, Song, Xingyou, Weller, Adrian

    Published 21-12-2020
    “…The Transformer architecture has revolutionized deep learning on sequential data, becoming ubiquitous in state-of-the-art solutions for a wide variety of…”
    Get full text
    Journal Article
  17. 17

    UFO-BLO: Unbiased First-Order Bilevel Optimization by Likhosherstov, Valerii, Song, Xingyou, Choromanski, Krzysztof, Davis, Jared, Weller, Adrian

    Published 05-06-2020
    “…Bilevel optimization (BLO) is a popular approach with many applications including hyperparameter optimization, neural architecture search, adversarial…”
    Get full text
    Journal Article
  18. 18

    Discovering Adaptable Symbolic Algorithms from Scratch by Kelly, Stephen, Park, Daniel S, Song, Xingyou, McIntire, Mitchell, Nashikkar, Pranav, Guha, Ritam, Banzhaf, Wolfgang, Deb, Kalyanmoy, Boddeti, Vishnu Naresh, Tan, Jie, Real, Esteban

    Published 31-07-2023
    “…Autonomous robots deployed in the real world will need control policies that rapidly adapt to environmental changes. To this end, we propose AutoRobotics-Zero…”
    Get full text
    Journal Article
  19. 19

    Observational Overfitting in Reinforcement Learning by Song, Xingyou, Jiang, Yiding, Tu, Stephen, Du, Yilun, Neyshabur, Behnam

    Published 05-12-2019
    “…A major component of overfitting in model-free reinforcement learning (RL) involves the case where the agent may mistakenly correlate reward with certain…”
    Get full text
    Journal Article
  20. 20

    Differentiable Architecture Search for Reinforcement Learning by Miao, Yingjie, Song, Xingyou, Co-Reyes, John D, Peng, Daiyi, Yue, Summer, Brevdo, Eugene, Faust, Aleksandra

    Published 03-06-2021
    “…In this paper, we investigate the fundamental question: To what extent are gradient-based neural architecture search (NAS) techniques applicable to RL? Using…”
    Get full text
    Journal Article