Search Results - "Toyama, Daniel"
-
1
Not All LLM Reasoners Are Created Equal
Published 02-10-2024“…We study the depth of grade-school math (GSM) problem-solving capabilities of LLMs. To this end, we evaluate their performance on pairs of existing math word…”
Get full text
Journal Article -
2
Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Published 21-04-2022“…Hierarchical Reinforcement Learning (HRL) allows interactive agents to decompose complex problems into a hierarchy of sub-tasks. Higher-level tasks can invoke…”
Get full text
Journal Article -
3
AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents
Published 23-05-2024“…Autonomous agents that execute human tasks by controlling computers can enhance human productivity and application accessibility. However, progress in this…”
Get full text
Journal Article -
4
Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Published 06-11-2023“…This work studies a central extremal graph theory problem inspired by a 1975 conjecture of Erd\H{o}s, which aims to find graphs with a given size (number of…”
Get full text
Journal Article -
5
AndroidEnv: A Reinforcement Learning Platform for Android
Published 27-05-2021“…We introduce AndroidEnv, an open-source platform for Reinforcement Learning (RL) research built on top of the Android ecosystem. AndroidEnv allows RL agents to…”
Get full text
Journal Article -
6
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Published 04-11-2021“…We introduce RLDS (Reinforcement Learning Datasets), an ecosystem for recording, replaying, manipulating, annotating and sharing data in the context of…”
Get full text
Journal Article -
7
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Published 07-08-2023“…StarCraft II is one of the most challenging simulated reinforcement learning environments; it is partially observable, stochastic, multi-agent, and mastering…”
Get full text
Journal Article -
8
The Option Keyboard: Combining Skills in Reinforcement Learning
Published 24-06-2021“…The ability to combine known skills to create new ones may be crucial in the solution of complex reinforcement learning problems that unfold over extended…”
Get full text
Journal Article -
9
Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Published 08-12-2021“…Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and…”
Get full text
Journal Article