Search Results - "Huang, ShengYi"

1
An Empirical Investigation of Early Stopping Optimizations in Proximal Policy Optimization by Dossa, Rousslan Fernand Julien, Huang, Shengyi, Ontanon, Santiago, Matsubara, Takashi

Published in IEEE access (2021)
“…Code-level optimizations, which are low-level optimization techniques used in the implementation of algorithms, have generally been considered as tangential…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
Prognostic impact of preoperative prognostic nutritional index in resected advanced gastric cancer: A multicenter propensity score analysis by Luo, Zeyu, Zhou, Lin, Balde, Alpha I., Li, Zhou, He, Linyun, ZhenWei, Cai, Zou, ZeNan, Huang, ShengYi, Han, Shuai, Wei Zhou, Min, Zhang, Gang Qing, Cai, Zhai

Published in European journal of surgical oncology (01-03-2019)
“…Advanced gastric cancer (AGC) causes debilitating malnutrition and leads to deterioration of the immune response. However, the concept of the prognostic…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
3
Comparison of the outcomes of cytoreductive surgery versus surgery plus hyperthermic intraperitoneal chemotherapy for peritoneal carcinomatosis: a propensity score matching analysis by Li, Zhou, Redondo Ntutumu, Juan de Dios, Huang, Shengyi, Cai, Zhai, Han, Shuai, Balde, A. I., Luo, Zeyu, Fang, Suzhen

Published in Surgical endoscopy (01-06-2021)
“…Background Cytoreductive surgery (CRS) and hyperthermic intraperitoneal chemotherapy (HIPEC) are effective treatment options for selected patients with…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
4
Variations in plant–microbe–soil C:N:P stoichiometry along a 900-year age gradient in Torreya grandis ‘Merrillii’ plantations in Southeast China by He, Sijia, Huang, Juying, Huang, Shengyi, Fang, Zhao, Zhang, Shuoxin, Zhou, Zhichun, Wang, Bin

Published in Frontiers in sustainable food systems (30-07-2024)
“…Researches on the ecological stoichiometry of forest vegetation at different growth stages under long-term human management activities and its driving factors…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
5
A conceptual study on the formulation of a permeable reactive pavement with activated carbon additives for controlling the fate of non-point source environmental organic contaminants by Huang, Shengyi, Liang, Chenju

Published in Chemosphere (Oxford) (01-02-2018)
“…To take advantage of the road pavement network where non-point source (NPS) pollution such as benzene, toluene, ethyl-benzene, and xylene (BTEX) from vehicle…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
A column study of persulfate chemical oxidative regeneration of toluene gas saturated activated carbon by Jatta, Simon, Huang, Shengyi, Liang, Chenju

Published in Chemical engineering journal (Lausanne, Switzerland : 1996) (01-11-2019)
“…[Display omitted] •Thermal activated persulfate (TAP) for regenerating toluene gas saturated AC was studied.•The TAP regenerated ACs retained >90% of…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
7
Reproducible and Efficient Deep Reinforcement Learning by Huang, Shengyi

Published 01-01-2023
“…Deep reinforcement learning (DRL), a paradigm by which agents learn how to do tasks through trial and error, has achieved great success in many domains…”

Get full text

Dissertation
QR Code
Save to List

Saved in:
8
Factors driving the assembly of prokaryotic communities in bulk soil and rhizosphere of Torreya grandis along a 900-year age gradient by Wang, Bin, Huang, Shengyi, Li, Zhengcai, Zhou, Zhichun, Huang, Juying, Yu, Hailong, Peng, Tong, Song, Yanfang, Na, Xiaofan

Published in The Science of the total environment (01-09-2022)
“…Excessive nutrient inputs imperil the stability of forest ecosystems via modifying the interactions among soil properties, microbes, and plants, particularly…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
9
Persulfate Chemical Functionalization of Carbon Nanotubes and Associated Adsorption Behavior in Aqueous Phase by Huang, Shengyi, Liang, Chenju, Chen, Yan-Jyun

Published in Industrial & engineering chemistry research (01-06-2016)
“…The chemical functionalization of carbon nanotubes (CNTs) using sodium persulfate (SPS) oxidation was designed to improve their dispersion stability in water…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
10
Differences in the dielectric properties of various benign and malignant thyroid nodules by Huang, Shengyi, Cai, Weizhen, Han, Shuai, lin, Yu, Wang, Yu, Chen, Fei, Shao, Guoli, Liu, Yonghong, Yu, Xuefei, Cai, Zhai, Zou, Zenan, Yao, Shun, Wang, Qiaohui, Li, Zhou

Published in Medical physics (Lancaster) (01-02-2021)
“…Purpose This experiment was conducted to investigate the dielectric properties of different types of thyroid nodules. Our goal was to find a simple and fast…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
11
B/Al Codoped/Coated Ultra-High Nickel Cobalt-Free Material with Excellent High Voltage/Rate Cycle Stability by Zhang, Liang, Huang, Jinfu, Tang, Hongyu, Huang, Shengyi, Tang, Yang, Ma, Jianyao, Yang, Jianwen, Huang, Bin, Li, Yanwei, Xiao, Shunhua

Published in ACS sustainable chemistry & engineering (17-06-2024)
“…Ultrahigh nickel cobalt-free cathode materials have high energy density and are very promising materials for application in lithium-ion batteries. However,…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
12
Enhanced stratospheric intrusion at Lulin Mountain, Taiwan inferred from beryllium-7 activity by Huang, Shengyi, Huang, Pin-Ru, Newman, Sally, Li, King-Fai, Lin, Yu-Chi, Huh, Chih-An, Lin, Neng-Huei, Hsu, Shih-Chieh, Liang, Mao-Chang

Published in Atmospheric environment (1994) (01-01-2022)
“…Beryllium-7 (7Be), produced by the interaction of cosmic radiation with atoms and molecules primarily in the upper troposphere and lower stratosphere, provides…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
13
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms by Huang, Shengyi, Ontañón, Santiago

Published 31-05-2022
“…FLAIRS. Vol. 35 (2022) In recent years, Deep Reinforcement Learning (DRL) algorithms have achieved state-of-the-art performance in many challenging strategy…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
14
Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games by Huang, Shengyi, Ontañón, Santiago

Published 04-10-2020
“…Training agents using Reinforcement Learning in games with sparse rewards is a challenging problem, since large amounts of exploration are required to retrieve…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
15
$Comparing Observation and Action Representations for Deep Reinforcement Learning in $\mu$RTS$
Comparing Observation and Action Representations for Deep Reinforcement Learning in $\mu$RTS by Huang, Shengyi, Ontañón, Santiago

Published 26-10-2019
“…This paper presents a preliminary study comparing different observation and action space representations for Deep Reinforcement Learning (DRL) in the context…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
16
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models by Noukhovitch, Michael, Huang, Shengyi, Xhonneux, Sophie, Hosseini, Arian, Agarwal, Rishabh, Courville, Aaron

Published 23-10-2024
“…The dominant paradigm for RLHF is online and on-policy RL: synchronously generating from the large language model (LLM) policy, labelling with a reward model,…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
17
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization by Huang, Shengyi, Noukhovitch, Michael, Hosseini, Arian, Rasul, Kashif, Wang, Weixun, Tunstall, Lewis

Published 23-03-2024
“…This work is the first to openly reproduce the Reinforcement Learning from Human Feedback (RLHF) scaling behaviors reported in OpenAI's seminal TL;DR…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
18
Griddly: A platform for AI research in games by Bamford, Chris, Huang, Shengyi, Lucas, Simon

Published 12-11-2020
“…In recent years, there have been immense breakthroughs in Game AI research, particularly with Reinforcement Learning (RL). Despite their success, the…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
19
Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 Tricks by Sullivan, Ryan, Kumar, Akarsh, Huang, Shengyi, Dickerson, John P, Suarez, Joseph

Published 26-10-2023
“…Most reinforcement learning methods rely heavily on dense, well-normalized environment rewards. DreamerV3 recently introduced a model-based method with a…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
20
Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform by Huang, Shengyi, Weng, Jiayi, Charakorn, Rujikorn, Lin, Min, Xu, Zhongwen, Ontañón, Santiago

Published 29-09-2023
“…Distributed Deep Reinforcement Learning (DRL) aims to leverage more computational resources to train autonomous agents with less training time. Despite recent…”

Get full text

Journal Article
QR Code
Save to List

Saved in:

Search Results - "Huang, ShengYi"

An Empirical Investigation of Early Stopping Optimizations in Proximal Policy Optimization by Dossa, Rousslan Fernand Julien, Huang, Shengyi, Ontanon, Santiago, Matsubara, Takashi

Variations in plant–microbe–soil C:N:P stoichiometry along a 900-year age gradient in Torreya grandis ‘Merrillii’ plantations in Southeast China by He, Sijia, Huang, Juying, Huang, Shengyi, Fang, Zhao, Zhang, Shuoxin, Zhou, Zhichun, Wang, Bin

A conceptual study on the formulation of a permeable reactive pavement with activated carbon additives for controlling the fate of non-point source environmental organic contaminants by Huang, Shengyi, Liang, Chenju

A column study of persulfate chemical oxidative regeneration of toluene gas saturated activated carbon by Jatta, Simon, Huang, Shengyi, Liang, Chenju

Reproducible and Efficient Deep Reinforcement Learning by Huang, Shengyi

Factors driving the assembly of prokaryotic communities in bulk soil and rhizosphere of Torreya grandis along a 900-year age gradient by Wang, Bin, Huang, Shengyi, Li, Zhengcai, Zhou, Zhichun, Huang, Juying, Yu, Hailong, Peng, Tong, Song, Yanfang, Na, Xiaofan

Persulfate Chemical Functionalization of Carbon Nanotubes and Associated Adsorption Behavior in Aqueous Phase by Huang, Shengyi, Liang, Chenju, Chen, Yan-Jyun

Differences in the dielectric properties of various benign and malignant thyroid nodules by Huang, Shengyi, Cai, Weizhen, Han, Shuai, lin, Yu, Wang, Yu, Chen, Fei, Shao, Guoli, Liu, Yonghong, Yu, Xuefei, Cai, Zhai, Zou, Zenan, Yao, Shun, Wang, Qiaohui, Li, Zhou

B/Al Codoped/Coated Ultra-High Nickel Cobalt-Free Material with Excellent High Voltage/Rate Cycle Stability by Zhang, Liang, Huang, Jinfu, Tang, Hongyu, Huang, Shengyi, Tang, Yang, Ma, Jianyao, Yang, Jianwen, Huang, Bin, Li, Yanwei, Xiao, Shunhua

Enhanced stratospheric intrusion at Lulin Mountain, Taiwan inferred from beryllium-7 activity by Huang, Shengyi, Huang, Pin-Ru, Newman, Sally, Li, King-Fai, Lin, Yu-Chi, Huh, Chih-An, Lin, Neng-Huei, Hsu, Shih-Chieh, Liang, Mao-Chang

A Closer Look at Invalid Action Masking in Policy Gradient Algorithms by Huang, Shengyi, Ontañón, Santiago

Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games by Huang, Shengyi, Ontañón, Santiago

Comparing Observation and Action Representations for Deep Reinforcement Learning in $\mu$RTS by Huang, Shengyi, Ontañón, Santiago

Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models by Noukhovitch, Michael, Huang, Shengyi, Xhonneux, Sophie, Hosseini, Arian, Agarwal, Rishabh, Courville, Aaron

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization by Huang, Shengyi, Noukhovitch, Michael, Hosseini, Arian, Rasul, Kashif, Wang, Weixun, Tunstall, Lewis

Griddly: A platform for AI research in games by Bamford, Chris, Huang, Shengyi, Lucas, Simon

Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 Tricks by Sullivan, Ryan, Kumar, Akarsh, Huang, Shengyi, Dickerson, John P, Suarez, Joseph

Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform by Huang, Shengyi, Weng, Jiayi, Charakorn, Rujikorn, Lin, Min, Xu, Zhongwen, Ontañón, Santiago

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication