Search Results - "Cui, Weihao"
-
1
High-Performance Planar Broadband Hot-Electron Photodetection through Platinum-Dielectric Triple Junctions
Published in Nanomaterials (Basel, Switzerland) (25-09-2024)“…Recently, planar and broadband hot-electron photodetectors (HE PDs) were established but exhibited degraded performances due to the adoptions of the…”
Get full text
Journal Article -
2
ISPA: Exploiting Intra-SM Parallelism in GPUs via Fine-grained Resource Management
Published in IEEE transactions on computers (01-05-2023)“…Emerging GPUs have multiple Streaming Multiprocessors (SM), while each SM is comprised of CUDA Cores and Tensor Cores. While CUDA Cores do the general…”
Get full text
Journal Article -
3
Accelerating Sparse DNNs Based on Tiled GEMM
Published in IEEE transactions on computers (01-05-2024)“…Network pruning can reduce the computation cost of deep neural network (DNN) models. However, sparse models often produce randomly-distributed weights to…”
Get full text
Journal Article -
4
Planar hot-electron photodetection with polarity-switchable photocurrents controlled by the working wavelength
Published in Optics express (17-07-2023)“…Hot-electron photodetection is attracting increasing interests. Based on internal photoemission mechanism, hot-electron photodetectors (HE PDs) convert…”
Get full text
Journal Article -
5
Adaptive Kernel Fusion for Improving the GPU Utilization while Ensuring QoS
Published in IEEE transactions on computers (09-10-2024)“…The prosperity of machine learning applications has promoted the rapid development of GPU architecture. It continues to integrate more CUDA Cores, larger L2…”
Get full text
Journal Article -
6
Toward QoS-Awareness and Improved Utilization of Spatial Multitasking GPUs
Published in IEEE transactions on computers (01-04-2022)“…Datacenters use GPUs to provide the significant computing throughput required by emerging user-facing services. The diurnal user access pattern of user-facing…”
Get full text
Journal Article -
7
Natural Products for Drug Discovery: Discovery of Gramines as Novel Agents against a Plant Virus
Published in Journal of agricultural and food chemistry (27-02-2019)“…Plant viral diseases seriously affect crop yield and quality. The natural product gramine (1) and its simple structural analogues 2–35 were synthesized from…”
Get full text
Journal Article -
8
E2bird: Enhanced Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services
Published in IEEE transactions on parallel and distributed systems (01-06-2021)“…We aim to tackle existing problems about deep learning serving on GPUs in the view of the system. GPUs have been widely adopted to serve online deep…”
Get full text
Journal Article -
9
Improving Cluster Utilization Through Adaptive Resource Management for Deep Neural Network and CPU Jobs Colocation
Published in IEEE transactions on computers (01-12-2023)“…While deep neural network (DNN) models are mainly trained using GPUs, many companies and research institutions build shared GPU clusters. These clusters host…”
Get full text
Journal Article -
10
Improving Cluster Utilization through Adaptive Resource Management for DNN and CPU Jobs Co-location
Published in IEEE transactions on computers (09-08-2023)“…While deep neural network (DNN) models are mainly trained using GPUs, many companies and research institutions build shared GPU clusters. These clusters host…”
Get full text
Journal Article -
11
Superamphiphobic surfaces constructed by cross-linked hollow SiO 2 spheres
Published in Applied surface science (01-04-2017)Get full text
Journal Article -
12
Superamphiphobic surfaces constructed by cross-linked hollow SiO2 spheres
Published in Applied surface science (01-04-2017)“…[Display omitted] •A series of hierarchically fluorinated 3D cross-linked C@SiO2 spheres and hollow SiO2 spheres coating were fabricated.•Fluorinated hollow…”
Get full text
Journal Article -
13
E 2 bird: E nhanced E lastic B atch for I mproving R esponsiveness and Throughput of D eep Learning Services
Published in IEEE transactions on parallel and distributed systems (01-06-2021)Get full text
Journal Article -
14
E2bird: E nhanced E lastic B atch for I mproving R esponsiveness and Throughput of D eep Learning Services
Published in IEEE transactions on parallel and distributed systems (01-01-2021)“…We aim to tackle existing problems about deep learning serving on GPUs in the view of the system. GPUs have been widely adopted to serve online deep…”
Get full text
Journal Article -
15
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS
Published in 2022 IEEE International Symposium on High-Performance Computer Architecture (HPCA) (01-04-2022)“…The proliferation of machine learning applications has promoted both CUDA Cores and Tensor Cores' integration to meet their acceleration demands. While studies…”
Get full text
Conference Proceeding -
16
Ebird: Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services
Published in 2019 IEEE 37th International Conference on Computer Design (ICCD) (01-11-2019)“…GPUs have been widely adopted to serve online deep learning-based services that have stringent QoS requirements. However, emerging deep learning serving…”
Get full text
Conference Proceeding -
17
One-pot fabrication of single-crystalline octahedral Pd-Pt nanocrystals with enhanced electrocatalytic activity for methanol oxidation
Published in Journal of solid state electrochemistry (01-02-2017)“…This study reports the synthesis of octahedral Pd-Pt bimetallic alloy nanocrystals through a facile, one-pot, templateless, and seedless hydrothermal method in…”
Get full text
Journal Article -
18
Microless: Cost-Efficient Hybrid Deployment of Microservices on IaaS VMs and Serverless
Published in 2023 IEEE 29th International Conference on Parallel and Distributed Systems (ICPADS) (17-12-2023)“…Microservices have gained popularity as an architectural approach for developing scalable and modular applications. Traditionally, microservice deployment…”
Get full text
Conference Proceeding -
19
Exploiting Intra-SM Parallelism in GPUs via Persistent and Elastic Blocks
Published in 2021 IEEE 39th International Conference on Computer Design (ICCD) (01-10-2021)“…Emerging GPUs have multiple Streaming Multiprocessors (SM), while each SM is comprised of CUDA Cores and Tensor Cores. While CUDA Cores do the general…”
Get full text
Conference Proceeding -
20
CODA: Improving Resource Utilization by Slimming and Co-locating DNN and CPU Jobs
Published in 2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS) (01-11-2020)“…While deep neural network (DNN) models are often trained on GPUs, many companies and research institutes build GPU clusters that are shared by different…”
Get full text
Conference Proceeding