Search Results - "Yang, Tianpei"

Refine Results
  1. 1

    Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain by Hao, Jianye, Yang, Tianpei, Tang, Hongyao, Bai, Chenjia, Liu, Jinyi, Meng, Zhaopeng, Liu, Peng, Wang, Zhen

    “…Deep reinforcement learning (DRL) and deep multiagent reinforcement learning (MARL) have achieved significant success across a wide range of domains, including…”
    Get full text
    Journal Article
  2. 2

    Efficient policy detecting and reusing for non-stationarity in Markov games by Zheng, Yan, Hao, Jianye, Zhang, Zongzhang, Meng, Zhaopeng, Yang, Tianpei, Li, Yanran, Fan, Changjie

    Published in Autonomous agents and multi-agent systems (01-01-2021)
    “…One challenging problem in multiagent systems is to cooperate or compete with non-stationary agents that change behavior from time to time. An agent in such a…”
    Get full text
    Journal Article
  3. 3
  4. 4

    ASN: action semantics network for multiagent reinforcement learning by Yang, Tianpei, Wang, Weixun, Hao, Jianye, Taylor, Matthew E., Liu, Yong, Hao, Xiaotian, Hu, Yujing, Chen, Yingfeng, Fan, Changjie, Ren, Chunxu, Huang, Ye, Zhu, Jiangcheng, Gao, Yang

    Published in Autonomous agents and multi-agent systems (01-12-2023)
    “…In multiagent systems (MASs), each agent makes individual decisions but all contribute globally to the system’s evolution. Learning in MASs is difficult since…”
    Get full text
    Journal Article
  5. 5

    Coupling Glycerol Conversion with Hydrogen Production Using Alloyed Electrocatalysts by Yang, Tianpei, Shen, Yi

    Published in Langmuir (12-09-2023)
    “…Herein, uniform precious alloys including PtAg, PdAg, and PtPdAg nanoparticles were synthesized as electrocatalysts for glycerol oxidation reaction (GOR). The…”
    Get full text
    Journal Article
  6. 6
  7. 7

    A survey on interpretable reinforcement learning by Glanois, Claire, Weng, Paul, Zimmer, Matthieu, Li, Dong, Yang, Tianpei, Hao, Jianye, Liu, Wulong

    Published in Machine learning (01-08-2024)
    “…Although deep reinforcement learning has become a promising machine learning approach for sequential decision-making problems, it is still not mature enough…”
    Get full text
    Journal Article
  8. 8

    Neighborhood Cooperative Multiagent Reinforcement Learning for Adaptive Traffic Signal Control in Epidemic Regions by Zhang, Chengwei, Tian, Yu, Zhang, Zhibin, Xue, Wanli, Xie, Xiaofei, Yang, Tianpei, Ge, Xin, Chen, Rong

    “…Nowadays, multiagent reinforcement learning (MARL) have shared significant advances in the adaptive traffic signal control (ATSC) problems. For most of the…”
    Get full text
    Journal Article
  9. 9

    Human-in-the-Loop Reinforcement Learning: A Survey and Position on Requirements, Challenges, and Opportunities by Retzlaff, Carl Orge, Das, Srijita, Wayllace, Christabel, Mousavi, Payam, Afshari, Mohammad, Yang, Tianpei, Saranti, Anna, Angerschmid, Alessa, Taylor, Matthew E., Holzinger, Andreas

    “…Artificial intelligence (AI) and especially reinforcement learning (RL) have the potential to enable agents to learn and perform tasks autonomously with…”
    Get full text
    Journal Article
  10. 10

    Advertising Impression Resource Allocation Strategy with Multi-Level Budget Constraint DQN in Real-Time Bidding by Zhang, Chengwei, Zheng, Kangjie, Tian, Yu, Xue, Wanli, Yang, Tianpei, An, Dou, Pi, Yongqi, Chen, Rong

    Published in Neurocomputing (Amsterdam) (01-06-2022)
    “…How to allocate advertising impressions under budget constraint is one of the leading research issues in Real-Time Bidding (RTB). Traditional methods are…”
    Get full text
    Journal Article
  11. 11

    Towards quantitative and intuitive percutaneous tumor puncture via augmented virtual reality by Li, Ruotong, Tong, Yuqi, Yang, Tianpei, Guo, Jianxi, Si, Weixin, Zhang, Yanfang, Klein, Reinhard, Heng, Pheng-Ann

    Published in Computerized medical imaging and graphics (01-06-2021)
    “…•Optimal needle puncture path planning for RFA via constraint-based method to fulfill the requirements in specialist consensus.•Precisely overlying the…”
    Get full text
    Journal Article
  12. 12

    T3S: Improving Multi-Task Reinforcement Learning with Task-Specific Feature Selector and Scheduler by Yu, Yuanqiang, Yang, Tianpei, Lv, Yongliang, Zheng, Yan, Hao, Jianye

    “…Multi-task reinforcement learning (MTRL) is a technique to train multiple tasks simultaneously, where previous works usually train a single model to solve…”
    Get full text
    Conference Proceeding
  13. 13

    FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning by Wang, Shang, Mamillapalli, Deepak Ranganatha Sastry, Yang, Tianpei, Taylor, Matthew E.

    “…This paper introduces the problem of learning to place logic blocks in Field-Programmable Gate Arrays (FPGAs) and a preliminary learning-based method. In…”
    Get full text
    Conference Proceeding
  14. 14

    FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning by Wang, Shang, Mamillapalli, Deepak Ranganatha Sastry, Yang, Tianpei, Taylor, Matthew E

    Published 11-04-2024
    “…This paper introduces the problem of learning to place logic blocks in Field-Programmable Gate Arrays (FPGAs) and a learning-based method. In contrast to…”
    Get full text
    Journal Article
  15. 15

    LaFFi: Leveraging Hybrid Natural Language Feedback for Fine-tuning Language Models by Li, Qianxi, Cao, Yingyue, Kang, Jikun, Yang, Tianpei, Chen, Xi, Jin, Jun, Taylor, Matthew E

    Published 31-12-2023
    “…Fine-tuning Large Language Models (LLMs) adapts a trained model to specific downstream tasks, significantly improving task-specific performance. Supervised…”
    Get full text
    Journal Article
  16. 16

    Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain by Hao, Jianye, Yang, Tianpei, Tang, Hongyao, Bai, Chenjia, Liu, Jinyi, Meng, Zhaopeng, Liu, Peng, Wang, Zhen

    Published 02-02-2023
    “…Deep Reinforcement Learning (DRL) and Deep Multi-agent Reinforcement Learning (MARL) have achieved significant successes across a wide range of domains,…”
    Get full text
    Journal Article
  17. 17

    GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis by Cao, Yushi, Li, Zhiming, Yang, Tianpei, Zhang, Hao, Zheng, Yan, Li, Yi, Hao, Jianye, Liu, Yang

    Published 26-05-2022
    “…Despite achieving superior performance in human-level control problems, unlike humans, deep reinforcement learning (DRL) lacks high-order intelligence (e.g.,…”
    Get full text
    Journal Article
  18. 18

    A Survey on Interpretable Reinforcement Learning by Glanois, Claire, Weng, Paul, Zimmer, Matthieu, Li, Dong, Yang, Tianpei, Hao, Jianye, Liu, Wulong

    Published 24-12-2021
    “…Although deep reinforcement learning has become a promising machine learning approach for sequential decision-making problems, it is still not mature enough…”
    Get full text
    Journal Article
  19. 19

    Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction by Jafferjee, Taher, Ziomek, Juliusz, Yang, Tianpei, Dai, Zipeng, Wang, Jianhong, Taylor, Matthew, Shao, Kun, Wang, Jun, Mguni, David

    Published 02-09-2022
    “…Centralised training with decentralised execution (CT-DE) serves as the foundation of many leading multi-agent reinforcement learning (MARL) algorithms…”
    Get full text
    Journal Article
  20. 20

    Learning Shaping Strategies in Human-in-the-loop Interactive Reinforcement Learning by Yu, Chao, Yang, Tianpei, Zhu, Wenxuan, wang, Dongxu, Li, Guangliang

    Published 10-11-2018
    “…Providing reinforcement learning agents with informationally rich human knowledge can dramatically improve various aspects of learning. Prior work has…”
    Get full text
    Journal Article