Search Results - "Voelcker, Claas"

1
Can we hop in general? A discussion of benchmark selection and design using the Hopper environment by Voelcker, Claas A, Hussing, Marcel, Eaton, Eric

Published 11-10-2024
“…Empirical, benchmark-driven testing is a fundamental paradigm in the current RL community. While using off-the-shelf benchmarks in reinforcement learning (RL)…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning by Voelcker, Claas, Kastner, Tyler, Gilitschenski, Igor, Farahmand, Amir-massoud

Published 25-06-2024
“…We investigate the impact of auxiliary learning tasks such as observation reconstruction and latent self-prediction on the representation learning problem in…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
3
Temporal-Difference Learning Using Distributed Error Signals by Guan, Jonas, Verch, Shon Eduard, Voelcker, Claas, Jackson, Ethan C, Papernot, Nicolas, Cunningham, William A

Published 05-11-2024
“…A computational problem in biological reward-based learning is how credit assignment is performed in the nucleus accumbens (NAc). Much research suggests that…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
4
MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL by Voelcker, Claas A, Hussing, Marcel, Eaton, Eric, Farahmand, Amir-massoud, Gilitschenski, Igor

Published 11-10-2024
“…Building deep reinforcement learning (RL) agents that find a good policy with few samples has proven notoriously challenging. To achieve sample efficiency,…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
5
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence by Hussing, Marcel, Voelcker, Claas, Gilitschenski, Igor, Farahmand, Amir-massoud, Eaton, Eric

Published 09-03-2024
“…We show that deep reinforcement learning algorithms can retain their ability to learn without resetting network parameters in settings where the number of…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
Value Gradient weighted Model-Based Reinforcement Learning by Voelcker, Claas, Liao, Victor, Garg, Animesh, Farahmand, Amir-massoud

Published 04-04-2022
“…Model-based reinforcement learning (MBRL) is a sample efficient technique to obtain control policies, yet unavoidable modeling errors often lead performance…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
7
lambda$-models: Effective Decision-Aware Reinforcement Learning with Latent Models by Voelcker, Claas A, Ahmadian, Arash, Abachi, Romina, Gilitschenski, Igor, Farahmand, Amir-massoud

Published 29-06-2023
“…The idea of decision-aware model learning, that models should be accurate where it matters for decision-making, has gained prominence in model-based…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
8
Structured Object-Aware Physics Prediction for Video Modeling and Planning by Kossen, Jannik, Stelzner, Karl, Hussing, Marcel, Voelcker, Claas, Kersting, Kristian

Published 06-10-2019
“…When humans observe a physical system, they can easily locate objects, understand their interactions, and anticipate future behavior, even in settings with…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
9
Queer In AI: A Case Study in Community-Led Participatory AI by QueerInAI, Organizers Of, :, Ovalle, Anaelia, Subramonian, Arjun, Singh, Ashwin, Voelcker, Claas, Sutherland, Danica J, Locatelli, Davide, Breznik, Eva, Klubička, Filip, Yuan, Hang, J, Hetvi, Zhang, Huan, Shriram, Jaidev, Lehman, Kruno, Soldaini, Luca, Sap, Maarten, Deisenroth, Marc Peter, Pacheco, Maria Leonor, Ryskina, Maria, Mundt, Martin, Agarwal, Milind, McLean, Nyx, Xu, Pan, Pranav, A, Korpan, Raj, Ray, Ruchira, Mathew, Sarah, Arora, Sarthak, John, ST, Anand, Tanvi, Agrawal, Vishakha, Agnew, William, Long, Yanan, Wang, Zijie J, Talat, Zeerak, Ghosh, Avijit, Dennler, Nathaniel, Noseworthy, Michael, Jha, Sharvani, Baylor, Emi, Joshi, Aditya, Bilenko, Natalia Y, McNamara, Andrew, Gontijo-Lopes, Raphael, Markham, Alex, Dǒng, Evyn, Kay, Jackie, Saraswat, Manu, Vytla, Nikhil, Stark, Luke

Published 08-06-2023
“…2023 ACM Conference on Fairness, Accountability, and Transparency We present Queer in AI as a case study for community-led participatory design in AI. We…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Voelcker, Claas"

Can we hop in general? A discussion of benchmark selection and design using the Hopper environment by Voelcker, Claas A, Hussing, Marcel, Eaton, Eric

When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning by Voelcker, Claas, Kastner, Tyler, Gilitschenski, Igor, Farahmand, Amir-massoud

Temporal-Difference Learning Using Distributed Error Signals by Guan, Jonas, Verch, Shon Eduard, Voelcker, Claas, Jackson, Ethan C, Papernot, Nicolas, Cunningham, William A

MAD-TD: Model-Augmented Data stabilizes High Update Ratio RL by Voelcker, Claas A, Hussing, Marcel, Eaton, Eric, Farahmand, Amir-massoud, Gilitschenski, Igor

Dissecting Deep RL with High Update Ratios: Combatting Value Divergence by Hussing, Marcel, Voelcker, Claas, Gilitschenski, Igor, Farahmand, Amir-massoud, Eaton, Eric

Value Gradient weighted Model-Based Reinforcement Learning by Voelcker, Claas, Liao, Victor, Garg, Animesh, Farahmand, Amir-massoud

lambda$-models: Effective Decision-Aware Reinforcement Learning with Latent Models by Voelcker, Claas A, Ahmadian, Arash, Abachi, Romina, Gilitschenski, Igor, Farahmand, Amir-massoud

Structured Object-Aware Physics Prediction for Video Modeling and Planning by Kossen, Jannik, Stelzner, Karl, Hussing, Marcel, Voelcker, Claas, Kersting, Kristian

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication