Search Results - "Huang, ShengYi"
-
1
An Empirical Investigation of Early Stopping Optimizations in Proximal Policy Optimization
Published in IEEE access (2021)“…Code-level optimizations, which are low-level optimization techniques used in the implementation of algorithms, have generally been considered as tangential…”
Get full text
Journal Article -
2
Prognostic impact of preoperative prognostic nutritional index in resected advanced gastric cancer: A multicenter propensity score analysis
Published in European journal of surgical oncology (01-03-2019)“…Advanced gastric cancer (AGC) causes debilitating malnutrition and leads to deterioration of the immune response. However, the concept of the prognostic…”
Get full text
Journal Article -
3
Comparison of the outcomes of cytoreductive surgery versus surgery plus hyperthermic intraperitoneal chemotherapy for peritoneal carcinomatosis: a propensity score matching analysis
Published in Surgical endoscopy (01-06-2021)“…Background Cytoreductive surgery (CRS) and hyperthermic intraperitoneal chemotherapy (HIPEC) are effective treatment options for selected patients with…”
Get full text
Journal Article -
4
Variations in plant–microbe–soil C:N:P stoichiometry along a 900-year age gradient in Torreya grandis ‘Merrillii’ plantations in Southeast China
Published in Frontiers in sustainable food systems (30-07-2024)“…Researches on the ecological stoichiometry of forest vegetation at different growth stages under long-term human management activities and its driving factors…”
Get full text
Journal Article -
5
A conceptual study on the formulation of a permeable reactive pavement with activated carbon additives for controlling the fate of non-point source environmental organic contaminants
Published in Chemosphere (Oxford) (01-02-2018)“…To take advantage of the road pavement network where non-point source (NPS) pollution such as benzene, toluene, ethyl-benzene, and xylene (BTEX) from vehicle…”
Get full text
Journal Article -
6
A column study of persulfate chemical oxidative regeneration of toluene gas saturated activated carbon
Published in Chemical engineering journal (Lausanne, Switzerland : 1996) (01-11-2019)“…[Display omitted] •Thermal activated persulfate (TAP) for regenerating toluene gas saturated AC was studied.•The TAP regenerated ACs retained >90% of…”
Get full text
Journal Article -
7
Reproducible and Efficient Deep Reinforcement Learning
Published 01-01-2023“…Deep reinforcement learning (DRL), a paradigm by which agents learn how to do tasks through trial and error, has achieved great success in many domains…”
Get full text
Dissertation -
8
Factors driving the assembly of prokaryotic communities in bulk soil and rhizosphere of Torreya grandis along a 900-year age gradient
Published in The Science of the total environment (01-09-2022)“…Excessive nutrient inputs imperil the stability of forest ecosystems via modifying the interactions among soil properties, microbes, and plants, particularly…”
Get full text
Journal Article -
9
Persulfate Chemical Functionalization of Carbon Nanotubes and Associated Adsorption Behavior in Aqueous Phase
Published in Industrial & engineering chemistry research (01-06-2016)“…The chemical functionalization of carbon nanotubes (CNTs) using sodium persulfate (SPS) oxidation was designed to improve their dispersion stability in water…”
Get full text
Journal Article -
10
Differences in the dielectric properties of various benign and malignant thyroid nodules
Published in Medical physics (Lancaster) (01-02-2021)“…Purpose This experiment was conducted to investigate the dielectric properties of different types of thyroid nodules. Our goal was to find a simple and fast…”
Get full text
Journal Article -
11
B/Al Codoped/Coated Ultra-High Nickel Cobalt-Free Material with Excellent High Voltage/Rate Cycle Stability
Published in ACS sustainable chemistry & engineering (17-06-2024)“…Ultrahigh nickel cobalt-free cathode materials have high energy density and are very promising materials for application in lithium-ion batteries. However,…”
Get full text
Journal Article -
12
Enhanced stratospheric intrusion at Lulin Mountain, Taiwan inferred from beryllium-7 activity
Published in Atmospheric environment (1994) (01-01-2022)“…Beryllium-7 (7Be), produced by the interaction of cosmic radiation with atoms and molecules primarily in the upper troposphere and lower stratosphere, provides…”
Get full text
Journal Article -
13
A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
Published 31-05-2022“…FLAIRS. Vol. 35 (2022) In recent years, Deep Reinforcement Learning (DRL) algorithms have achieved state-of-the-art performance in many challenging strategy…”
Get full text
Journal Article -
14
Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games
Published 04-10-2020“…Training agents using Reinforcement Learning in games with sparse rewards is a challenging problem, since large amounts of exploration are required to retrieve…”
Get full text
Journal Article -
15
Comparing Observation and Action Representations for Deep Reinforcement Learning in $\mu$RTS
Published 26-10-2019“…This paper presents a preliminary study comparing different observation and action space representations for Deep Reinforcement Learning (DRL) in the context…”
Get full text
Journal Article -
16
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Published 23-10-2024“…The dominant paradigm for RLHF is online and on-policy RL: synchronously generating from the large language model (LLM) policy, labelling with a reward model,…”
Get full text
Journal Article -
17
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
Published 23-03-2024“…This work is the first to openly reproduce the Reinforcement Learning from Human Feedback (RLHF) scaling behaviors reported in OpenAI's seminal TL;DR…”
Get full text
Journal Article -
18
Griddly: A platform for AI research in games
Published 12-11-2020“…In recent years, there have been immense breakthroughs in Game AI research, particularly with Reinforcement Learning (RL). Despite their success, the…”
Get full text
Journal Article -
19
Reward Scale Robustness for Proximal Policy Optimization via DreamerV3 Tricks
Published 26-10-2023“…Most reinforcement learning methods rely heavily on dense, well-normalized environment rewards. DreamerV3 recently introduced a model-based method with a…”
Get full text
Journal Article -
20
Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform
Published 29-09-2023“…Distributed Deep Reinforcement Learning (DRL) aims to leverage more computational resources to train autonomous agents with less training time. Despite recent…”
Get full text
Journal Article