Search Results - "Voelcker, Claas"

  • Showing 1 - 9 results of 9
Refine Results
  1. 1

    Can we hop in general? A discussion of benchmark selection and design using the Hopper environment by Voelcker, Claas A, Hussing, Marcel, Eaton, Eric

    Published 11-10-2024
    “…Empirical, benchmark-driven testing is a fundamental paradigm in the current RL community. While using off-the-shelf benchmarks in reinforcement learning (RL)…”
    Get full text
    Journal Article
  2. 2

    When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning by Voelcker, Claas, Kastner, Tyler, Gilitschenski, Igor, Farahmand, Amir-massoud

    Published 25-06-2024
    “…We investigate the impact of auxiliary learning tasks such as observation reconstruction and latent self-prediction on the representation learning problem in…”
    Get full text
    Journal Article
  3. 3

    Temporal-Difference Learning Using Distributed Error Signals by Guan, Jonas, Verch, Shon Eduard, Voelcker, Claas, Jackson, Ethan C, Papernot, Nicolas, Cunningham, William A

    Published 05-11-2024
    “…A computational problem in biological reward-based learning is how credit assignment is performed in the nucleus accumbens (NAc). Much research suggests that…”
    Get full text
    Journal Article
  4. 4

    MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL by Voelcker, Claas A, Hussing, Marcel, Eaton, Eric, Farahmand, Amir-massoud, Gilitschenski, Igor

    Published 11-10-2024
    “…Building deep reinforcement learning (RL) agents that find a good policy with few samples has proven notoriously challenging. To achieve sample efficiency,…”
    Get full text
    Journal Article
  5. 5

    Dissecting Deep RL with High Update Ratios: Combatting Value Divergence by Hussing, Marcel, Voelcker, Claas, Gilitschenski, Igor, Farahmand, Amir-massoud, Eaton, Eric

    Published 09-03-2024
    “…We show that deep reinforcement learning algorithms can retain their ability to learn without resetting network parameters in settings where the number of…”
    Get full text
    Journal Article
  6. 6

    Value Gradient weighted Model-Based Reinforcement Learning by Voelcker, Claas, Liao, Victor, Garg, Animesh, Farahmand, Amir-massoud

    Published 04-04-2022
    “…Model-based reinforcement learning (MBRL) is a sample efficient technique to obtain control policies, yet unavoidable modeling errors often lead performance…”
    Get full text
    Journal Article
  7. 7

    lambda$-models: Effective Decision-Aware Reinforcement Learning with Latent Models by Voelcker, Claas A, Ahmadian, Arash, Abachi, Romina, Gilitschenski, Igor, Farahmand, Amir-massoud

    Published 29-06-2023
    “…The idea of decision-aware model learning, that models should be accurate where it matters for decision-making, has gained prominence in model-based…”
    Get full text
    Journal Article
  8. 8

    Structured Object-Aware Physics Prediction for Video Modeling and Planning by Kossen, Jannik, Stelzner, Karl, Hussing, Marcel, Voelcker, Claas, Kersting, Kristian

    Published 06-10-2019
    “…When humans observe a physical system, they can easily locate objects, understand their interactions, and anticipate future behavior, even in settings with…”
    Get full text
    Journal Article
  9. 9