Search Results - "Foote, Davis"

  • Showing 1 - 9 results of 9
Refine Results
  1. 1
  2. 2
  3. 3
  4. 4
  5. 5

    AI Alignment with Changing and Influenceable Reward Functions by Carroll, Micah, Foote, Davis, Siththaranjan, Anand, Russell, Stuart, Dragan, Anca

    Published 27-05-2024
    “…Existing AI alignment approaches assume that preferences are static, which is unrealistic: our preferences change, and may even be influenced by our…”
    Get full text
    Journal Article
  6. 6
  7. 7

    When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback by Lang, Leon, Foote, Davis, Russell, Stuart, Dragan, Anca, Jenner, Erik, Emmons, Scott

    Published 27-02-2024
    “…Past analyses of reinforcement learning from human feedback (RLHF) assume that the human evaluators fully observe the environment. What happens when human…”
    Get full text
    Journal Article
  8. 8

    Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning by Tang, Haoran, Houthooft, Rein, Foote, Davis, Stooke, Adam, Chen, Xi, Duan, Yan, Schulman, John, De Turck, Filip, Abbeel, Pieter

    Published 15-11-2016
    “…Count-based exploration algorithms are known to perform near-optimally when used in conjunction with tabular reinforcement learning (RL) methods for solving…”
    Get full text
    Journal Article
  9. 9