Search Results - "Voelcker, Claas"
-
1
Can we hop in general? A discussion of benchmark selection and design using the Hopper environment
Published 11-10-2024“…Empirical, benchmark-driven testing is a fundamental paradigm in the current RL community. While using off-the-shelf benchmarks in reinforcement learning (RL)…”
Get full text
Journal Article -
2
When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning
Published 25-06-2024“…We investigate the impact of auxiliary learning tasks such as observation reconstruction and latent self-prediction on the representation learning problem in…”
Get full text
Journal Article -
3
Temporal-Difference Learning Using Distributed Error Signals
Published 05-11-2024“…A computational problem in biological reward-based learning is how credit assignment is performed in the nucleus accumbens (NAc). Much research suggests that…”
Get full text
Journal Article -
4
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL
Published 11-10-2024“…Building deep reinforcement learning (RL) agents that find a good policy with few samples has proven notoriously challenging. To achieve sample efficiency,…”
Get full text
Journal Article -
5
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
Published 09-03-2024“…We show that deep reinforcement learning algorithms can retain their ability to learn without resetting network parameters in settings where the number of…”
Get full text
Journal Article -
6
Value Gradient weighted Model-Based Reinforcement Learning
Published 04-04-2022“…Model-based reinforcement learning (MBRL) is a sample efficient technique to obtain control policies, yet unavoidable modeling errors often lead performance…”
Get full text
Journal Article -
7
lambda$-models: Effective Decision-Aware Reinforcement Learning with Latent Models
Published 29-06-2023“…The idea of decision-aware model learning, that models should be accurate where it matters for decision-making, has gained prominence in model-based…”
Get full text
Journal Article -
8
Structured Object-Aware Physics Prediction for Video Modeling and Planning
Published 06-10-2019“…When humans observe a physical system, they can easily locate objects, understand their interactions, and anticipate future behavior, even in settings with…”
Get full text
Journal Article -
9
Queer In AI: A Case Study in Community-Led Participatory AI
Published 08-06-2023“…2023 ACM Conference on Fairness, Accountability, and Transparency We present Queer in AI as a case study for community-led participatory design in AI. We…”
Get full text
Journal Article