Search Results - "Suhr, Alane"

Refine Results
  1. 1

    Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior by Kojima, Noriyuki, Suhr, Alane, Artzi, Yoav

    “…We study continual learning for natural language instruction generation, by observing human users’ instruction execution. We focus on a collaborative scenario,…”
    Get full text
    Journal Article
  2. 2

    TOUCHDOWN: Natural Language Navigation and Spatial Reasoning in Visual Street Environments by Chen, Howard, Suhr, Alane, Misra, Dipendra, Snavely, Noah, Artzi, Yoav

    “…We study the problem of jointly reasoning about language and vision through a navigation and spatial reasoning task. We introduce the Touchdown task and…”
    Get full text
    Conference Proceeding
  3. 3

    Reasoning and Learning in Interactive Natural Language Systems by Suhr, Alane Laughlin

    Published 01-01-2022
    “…Systems that support expressive, situated natural language interactions are essential for expanding access to complex computing systems, such as robots and…”
    Get full text
    Dissertation
  4. 4

    Continual Learning for Instruction Following from Realtime Feedback by Suhr, Alane, Artzi, Yoav

    Published 19-12-2022
    “…We propose and deploy an approach to continually train an instruction-following agent from feedback provided by users during collaborative interactions. During…”
    Get full text
    Journal Article
  5. 5

    Grounding Language in Multi-Perspective Referential Communication by Tang, Zineng, Mao, Lingjun, Suhr, Alane

    Published 04-10-2024
    “…We introduce a task and dataset for referring expression generation and comprehension in multi-agent embodied environments. In this task, two agents in a…”
    Get full text
    Journal Article
  6. 6

    Using Language Models to Disambiguate Lexical Choices in Translation by Barua, Josh, Subramanian, Sanjay, Yin, Kayo, Suhr, Alane

    Published 08-11-2024
    “…In translation, a concept represented by a single word in a source language can have multiple variations in a target language. The task of lexical selection…”
    Get full text
    Journal Article
  7. 7

    NLVR2 Visual Bias Analysis by Suhr, Alane, Artzi, Yoav

    Published 23-09-2019
    “…NLVR2 (Suhr et al., 2019) was designed to be robust for language bias through a data collection process that resulted in each natural language sentence…”
    Get full text
    Journal Article
  8. 8

    Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting by Sclar, Melanie, Choi, Yejin, Tsvetkov, Yulia, Suhr, Alane

    Published 17-10-2023
    “…As large language models (LLMs) are adopted as a fundamental component of language technologies, it is crucial to accurately characterize their performance…”
    Get full text
    Journal Article
  9. 9

    Continual Learning for Grounded Instruction Generation by Observing Human Following Behavior by Kojima, Noriyuki, Suhr, Alane, Artzi, Yoav

    Published 10-08-2021
    “…We study continual learning for natural language instruction generation, by observing human users' instruction execution. We focus on a collaborative scenario,…”
    Get full text
    Journal Article
  10. 10

    Situated Mapping of Sequential Instructions to Actions with Single-step Reward Observation by Suhr, Alane, Artzi, Yoav

    Published 25-05-2018
    “…We propose a learning approach for mapping context-dependent sequential instructions to actions. We address the problem of discourse and state dependencies…”
    Get full text
    Journal Article
  11. 11

    DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning by Bai, Hao, Zhou, Yifei, Cemri, Mert, Pan, Jiayi, Suhr, Alane, Levine, Sergey, Kumar, Aviral

    Published 14-06-2024
    “…Training corpuses for vision language models (VLMs) typically lack sufficient amounts of decision-centric data. This renders off-the-shelf VLMs sub-optimal for…”
    Get full text
    Journal Article
  12. 12

    Autonomous Evaluation and Refinement of Digital Agents by Pan, Jiayi, Zhang, Yichi, Tomlin, Nicholas, Zhou, Yifei, Levine, Sergey, Suhr, Alane

    Published 09-04-2024
    “…We show that domain-general automatic evaluators can significantly improve the performance of agents for web navigation and device control. We experiment with…”
    Get full text
    Journal Article
  13. 13

    Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker by Sclar, Melanie, Kumar, Sachin, West, Peter, Suhr, Alane, Choi, Yejin, Tsvetkov, Yulia

    Published 01-06-2023
    “…ACL 2023 Theory of Mind (ToM)$\unicode{x2014}$the ability to reason about the mental states of other people$\unicode{x2014}$is a key element of our social…”
    Get full text
    Journal Article
  14. 14

    Analysis of Language Change in Collaborative Instruction Following by Effenberger, Anna, Yan, Eva, Singh, Rhia, Suhr, Alane, Artzi, Yoav

    Published 09-09-2021
    “…We analyze language change over time in a collaborative, goal-oriented instructional task, where utility-maximizing participants form conventions and increase…”
    Get full text
    Journal Article
  15. 15

    Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling by Nottingham, Kolby, Ammanabrolu, Prithviraj, Suhr, Alane, Choi, Yejin, Hajishirzi, Hannaneh, Singh, Sameer, Fox, Roy

    Published 27-01-2023
    “…Reinforcement learning (RL) agents typically learn tabula rasa, without prior knowledge of the world. However, if initialized with knowledge of high-level…”
    Get full text
    Journal Article
  16. 16

    Abstract Visual Reasoning with Tangram Shapes by Ji, Anya, Kojima, Noriyuki, Rush, Noah, Suhr, Alane, Vong, Wai Keen, Hawkins, Robert D, Artzi, Yoav

    Published 29-11-2022
    “…We introduce KiloGram, a resource for studying abstract visual reasoning in humans and machines. Drawing on the history of tangram puzzles as stimuli in…”
    Get full text
    Journal Article
  17. 17

    Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning by Zhai, Yuexiang, Bai, Hao, Lin, Zipeng, Pan, Jiayi, Tong, Shengbang, Zhou, Yifei, Suhr, Alane, Xie, Saining, LeCun, Yann, Ma, Yi, Levine, Sergey

    Published 16-05-2024
    “…Large vision-language models (VLMs) fine-tuned on specialized visual instruction-following data have exhibited impressive language reasoning capabilities…”
    Get full text
    Journal Article
  18. 18

    Fine-Grained Human Feedback Gives Better Rewards for Language Model Training by Wu, Zeqiu, Hu, Yushi, Shi, Weijia, Dziri, Nouha, Suhr, Alane, Ammanabrolu, Prithviraj, Smith, Noah A, Ostendorf, Mari, Hajishirzi, Hannaneh

    Published 02-06-2023
    “…Language models (LMs) often exhibit undesirable text generation behaviors, including generating false, toxic, or irrelevant outputs. Reinforcement learning…”
    Get full text
    Journal Article
  19. 19

    We're Afraid Language Models Aren't Modeling Ambiguity by Liu, Alisa, Wu, Zhaofeng, Michael, Julian, Suhr, Alane, West, Peter, Koller, Alexander, Swayamdipta, Swabha, Smith, Noah A, Choi, Yejin

    Published 27-04-2023
    “…Ambiguity is an intrinsic feature of natural language. Managing ambiguity is a key part of human language understanding, allowing us to anticipate…”
    Get full text
    Journal Article
  20. 20

    UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations by Zhao, Wenting, Chiu, Justin T, Hwang, Jena D, Brahman, Faeze, Hessel, Jack, Choudhury, Sanjiban, Choi, Yejin, Li, Xiang Lorraine, Suhr, Alane

    Published 14-11-2023
    “…Language technologies that accurately model the dynamics of events must perform commonsense reasoning. Existing work evaluating commonsense reasoning focuses…”
    Get full text
    Journal Article