Search Results - "Bai, Chenjia"
-
1
Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain
Published in IEEE transaction on neural networks and learning systems (01-07-2024)“…Deep reinforcement learning (DRL) and deep multiagent reinforcement learning (MARL) have achieved significant success across a wide range of domains, including…”
Get full text
Journal Article -
2
Diverse randomized value functions: A provably pessimistic approach for offline reinforcement learning
Published in Information sciences (01-10-2024)“…Offline Reinforcement Learning (RL) faces challenges such as distributional shift and unreliable value estimation, especially for out-of-distribution (OOD)…”
Get full text
Journal Article -
3
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
Published in IEEE transaction on neural networks and learning systems (01-08-2023)“…Efficient exploration remains a challenging problem in reinforcement learning, especially for tasks where extrinsic rewards from environments are sparse or…”
Get full text
Journal Article -
4
False Correlation Reduction for Offline Reinforcement Learning
Published in IEEE transactions on pattern analysis and machine intelligence (01-02-2024)“…Offline reinforcement learning (RL) harnesses the power of massive datasets for resolving sequential decision problems. Most existing papers only discuss…”
Get full text
Journal Article -
5
Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning
Published in Artificial intelligence (01-01-2024)Get full text
Journal Article -
6
Skill matters: Dynamic skill learning for multi-agent cooperative reinforcement learning
Published in Neural networks (01-01-2025)“…With the popularization of intelligence, the necessity of cooperation between intelligent machines makes the research of collaborative multi-agent…”
Get full text
Journal Article -
7
Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning
Published in IEEE transaction on neural networks and learning systems (01-07-2024)“…A key challenge in offline reinforcement learning (RL) is how to ensure the learned offline policy is safe, especially in safety-critical domains. In this…”
Get full text
Journal Article -
8
Ensemble successor representations for task generalization in offline-to-online reinforcement learning
Published in Science China. Information sciences (01-07-2024)“…In reinforcement learning (RL), training a policy from scratch with online experiences can be inefficient because of the difficulties in exploration. Recently,…”
Get full text
Journal Article -
9
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness
Published in The Journal of artificial intelligence research (13-11-2024)“…To obtain a near-optimal policy with fewer interactions in Reinforcement Learning (RL), a promising approach involves the combination of offline RL, which…”
Get full text
Journal Article -
10
Guided goal generation for hindsight multi-goal reinforcement learning
Published in Neurocomputing (Amsterdam) (24-09-2019)“…Typical reinforcement learning (RL) can only perform a single task and thus cannot scale to problems for which an agent needs to perform multiple tasks, such…”
Get full text
Journal Article -
11
Elucidation of the using condition and working mechanism of tertiary lactose in dry powder formulations for inhalation
Published in Powder technology (01-09-2023)“…Tertiary lactose has been shown to improve aerosolization performance of carrier-based dry powder inhalation formulations. The working mechanism of the…”
Get full text
Journal Article -
12
Addressing Hindsight Bias in Multigoal Reinforcement Learning
Published in IEEE transactions on cybernetics (01-01-2023)“…Multigoal reinforcement learning (RL) extends the typical RL with goal-conditional value functions and policies. One efficient multigoal RL algorithm is the…”
Get full text
Journal Article -
13
Obtaining accurate estimated action values in categorical distributional reinforcement learning
Published in Knowledge-based systems (22-04-2020)“…Categorical Distributional Reinforcement Learning (CDRL) uses a categorical distribution with evenly spaced outcomes to model the entire distribution of…”
Get full text
Journal Article -
14
Generating attentive goals for prioritized hindsight reinforcement learning
Published in Knowledge-based systems (05-09-2020)“…Typical reinforcement learning (RL) performs a single task and does not scale to problems in which an agent must perform multiple tasks, such as moving a robot…”
Get full text
Journal Article -
15
Self-Supervised Imitation for Offline Reinforcement Learning With Hindsight Relabeling
Published in IEEE transactions on systems, man, and cybernetics. Systems (01-12-2023)“…Reinforcement learning (RL) requires a lot of interactions with the environment, which is usually expensive or dangerous in real-world tasks. To address this…”
Get full text
Journal Article -
16
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Published 09-09-2024“…Diffusion models have achieved remarkable success in sequential decision-making by leveraging the highly expressive model capabilities in policy learning. A…”
Get full text
Journal Article -
17
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
Published 24-05-2024“…It is vital to learn effective policies that can be transferred to different domains with dynamics discrepancies in reinforcement learning (RL). In this paper,…”
Get full text
Journal Article -
18
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning
Published 12-05-2024“…In Reinforcement Learning (RL), training a policy from scratch with online experiences can be inefficient because of the difficulties in exploration. Recently,…”
Get full text
Journal Article -
19
Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning
Published 30-04-2024“…This work designs and analyzes a novel set of algorithms for multi-agent reinforcement learning (MARL) based on the principle of information-directed sampling…”
Get full text
Journal Article -
20
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning
Published 09-04-2024“…Offline Reinforcement Learning (RL) faces distributional shift and unreliable value estimation, especially for out-of-distribution (OOD) actions. To address…”
Get full text
Journal Article