Search Results - "Bai, Chenjia"

Refine Results
  1. 1

    Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain by Hao, Jianye, Yang, Tianpei, Tang, Hongyao, Bai, Chenjia, Liu, Jinyi, Meng, Zhaopeng, Liu, Peng, Wang, Zhen

    “…Deep reinforcement learning (DRL) and deep multiagent reinforcement learning (MARL) have achieved significant success across a wide range of domains, including…”
    Get full text
    Journal Article
  2. 2

    Diverse randomized value functions: A provably pessimistic approach for offline reinforcement learning by Yu, Xudong, Bai, Chenjia, Guo, Hongyi, Wang, Changhong, Wang, Zhen

    Published in Information sciences (01-10-2024)
    “…Offline Reinforcement Learning (RL) faces challenges such as distributional shift and unreliable value estimation, especially for out-of-distribution (OOD)…”
    Get full text
    Journal Article
  3. 3

    Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning by Bai, Chenjia, Liu, Peng, Liu, Kaiyu, Wang, Lingxiao, Zhao, Yingnan, Han, Lei, Wang, Zhaoran

    “…Efficient exploration remains a challenging problem in reinforcement learning, especially for tasks where extrinsic rewards from environments are sparse or…”
    Get full text
    Journal Article
  4. 4

    False Correlation Reduction for Offline Reinforcement Learning by Deng, Zhihong, Fu, Zuyue, Wang, Lingxiao, Yang, Zhuoran, Bai, Chenjia, Zhou, Tianyi, Wang, Zhaoran, Jiang, Jing

    “…Offline reinforcement learning (RL) harnesses the power of massive datasets for resolving sequential decision problems. Most existing papers only discuss…”
    Get full text
    Journal Article
  5. 5
  6. 6

    Skill matters: Dynamic skill learning for multi-agent cooperative reinforcement learning by Li, Tong, Bai, Chenjia, Xu, Kang, Chu, Chen, Zhu, Peican, Wang, Zhen

    Published in Neural networks (01-01-2025)
    “…With the popularization of intelligence, the necessity of cooperation between intelligent machines makes the research of collaborative multi-agent…”
    Get full text
    Journal Article
  7. 7

    Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning by Bai, Chenjia, Xiao, Ting, Zhu, Zhoufan, Wang, Lingxiao, Zhou, Fan, Garg, Animesh, He, Bin, Liu, Peng, Wang, Zhaoran

    “…A key challenge in offline reinforcement learning (RL) is how to ensure the learned offline policy is safe, especially in safety-critical domains. In this…”
    Get full text
    Journal Article
  8. 8

    Ensemble successor representations for task generalization in offline-to-online reinforcement learning by Wang, Changhong, Yu, Xudong, Bai, Chenjia, Zhang, Qiaosheng, Wang, Zhen

    Published in Science China. Information sciences (01-07-2024)
    “…In reinforcement learning (RL), training a policy from scratch with online experiences can be inefficient because of the difficulties in exploration. Recently,…”
    Get full text
    Journal Article
  9. 9

    Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness by Wen, Xiaoyu, Yu, Xudong, Yang, Rui, Chen, Haoyuan, Bai, Chenjia, Wang, Zhen

    “…To obtain a near-optimal policy with fewer interactions in Reinforcement Learning (RL), a promising approach involves the combination of offline RL, which…”
    Get full text
    Journal Article
  10. 10

    Guided goal generation for hindsight multi-goal reinforcement learning by Bai, Chenjia, Liu, Peng, Zhao, Wei, Tang, Xianglong

    Published in Neurocomputing (Amsterdam) (24-09-2019)
    “…Typical reinforcement learning (RL) can only perform a single task and thus cannot scale to problems for which an agent needs to perform multiple tasks, such…”
    Get full text
    Journal Article
  11. 11

    Elucidation of the using condition and working mechanism of tertiary lactose in dry powder formulations for inhalation by Ye, Yuqing, Shi, Tingting, Fan, Ziyi, Bai, Chenjia, Ma, Ying, Zhu, Jesse

    Published in Powder technology (01-09-2023)
    “…Tertiary lactose has been shown to improve aerosolization performance of carrier-based dry powder inhalation formulations. The working mechanism of the…”
    Get full text
    Journal Article
  12. 12

    Addressing Hindsight Bias in Multigoal Reinforcement Learning by Bai, Chenjia, Wang, Lingxiao, Wang, Yixin, Wang, Zhaoran, Zhao, Rui, Bai, Chenyao, Liu, Peng

    Published in IEEE transactions on cybernetics (01-01-2023)
    “…Multigoal reinforcement learning (RL) extends the typical RL with goal-conditional value functions and policies. One efficient multigoal RL algorithm is the…”
    Get full text
    Journal Article
  13. 13

    Obtaining accurate estimated action values in categorical distributional reinforcement learning by Zhao, Yingnan, Liu, Peng, Bai, Chenjia, Zhao, Wei, Tang, Xianglong

    Published in Knowledge-based systems (22-04-2020)
    “…Categorical Distributional Reinforcement Learning (CDRL) uses a categorical distribution with evenly spaced outcomes to model the entire distribution of…”
    Get full text
    Journal Article
  14. 14

    Generating attentive goals for prioritized hindsight reinforcement learning by Liu, Peng, Bai, Chenjia, Zhao, Yingnan, Bai, Chenyao, Zhao, Wei, Tang, Xianglong

    Published in Knowledge-based systems (05-09-2020)
    “…Typical reinforcement learning (RL) performs a single task and does not scale to problems in which an agent must perform multiple tasks, such as moving a robot…”
    Get full text
    Journal Article
  15. 15

    Self-Supervised Imitation for Offline Reinforcement Learning With Hindsight Relabeling by Yu, Xudong, Bai, Chenjia, Wang, Changhong, Yu, Dengxiu, Chen, C. L. Philip, Wang, Zhen

    “…Reinforcement learning (RL) requires a lot of interactions with the environment, which is usually expensive or dangerous in real-world tasks. To address this…”
    Get full text
    Journal Article
  16. 16

    Forward KL Regularized Preference Optimization for Aligning Diffusion Policies by Shan, Zhao, Fan, Chenyou, Qiu, Shuang, Shi, Jiyuan, Bai, Chenjia

    Published 09-09-2024
    “…Diffusion models have achieved remarkable success in sequential decision-making by leveraging the highly expressive model capabilities in policy learning. A…”
    Get full text
    Journal Article
  17. 17

    Cross-Domain Policy Adaptation by Capturing Representation Mismatch by Lyu, Jiafei, Bai, Chenjia, Yang, Jingwen, Lu, Zongqing, Li, Xiu

    Published 24-05-2024
    “…It is vital to learn effective policies that can be transferred to different domains with dynamics discrepancies in reinforcement learning (RL). In this paper,…”
    Get full text
    Journal Article
  18. 18

    Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning by Wang, Changhong, Yu, Xudong, Bai, Chenjia, Zhang, Qiaosheng, Wang, Zhen

    Published 12-05-2024
    “…In Reinforcement Learning (RL), training a policy from scratch with online experiences can be inefficient because of the difficulties in exploration. Recently,…”
    Get full text
    Journal Article
  19. 19

    Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning by Zhang, Qiaosheng, Bai, Chenjia, Hu, Shuyue, Wang, Zhen, Li, Xuelong

    Published 30-04-2024
    “…This work designs and analyzes a novel set of algorithms for multi-agent reinforcement learning (MARL) based on the principle of information-directed sampling…”
    Get full text
    Journal Article
  20. 20

    Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning by Yu, Xudong, Bai, Chenjia, Guo, Hongyi, Wang, Changhong, Wang, Zhen

    Published 09-04-2024
    “…Offline Reinforcement Learning (RL) faces distributional shift and unreliable value estimation, especially for out-of-distribution (OOD) actions. To address…”
    Get full text
    Journal Article