Search Results - "Cui, Weihao"

1
High-Performance Planar Broadband Hot-Electron Photodetection through Platinum-Dielectric Triple Junctions by Yang, Xiaoyan, Wang, Yongmei, Li, Yaoyao, Cui, Weihao, Hu, Junhui, Zhou, Qingjia, Shao, Weijia

Published in Nanomaterials (Basel, Switzerland) (25-09-2024)
“…Recently, planar and broadband hot-electron photodetectors (HE PDs) were established but exhibited degraded performances due to the adoptions of the…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
ISPA: Exploiting Intra-SM Parallelism in GPUs via Fine-grained Resource Management by Zhao, Han, Cui, Weihao, Chen, Quan, Guo, Minyi

Published in IEEE transactions on computers (01-05-2023)
“…Emerging GPUs have multiple Streaming Multiprocessors (SM), while each SM is comprised of CUDA Cores and Tensor Cores. While CUDA Cores do the general…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
3
Accelerating Sparse DNNs Based on Tiled GEMM by Guo, Cong, Xue, Fengchen, Leng, Jingwen, Qiu, Yuxian, Guan, Yue, Cui, Weihao, Chen, Quan, Guo, Minyi

Published in IEEE transactions on computers (01-05-2024)
“…Network pruning can reduce the computation cost of deep neural network (DNN) models. However, sparse models often produce randomly-distributed weights to…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
4
Planar hot-electron photodetection with polarity-switchable photocurrents controlled by the working wavelength by Shao, Weijia, Cui, Weihao, Hu, Junhui, Wang, Yongmei, Tang, Jian, Li, Xiaofeng

Published in Optics express (17-07-2023)
“…Hot-electron photodetection is attracting increasing interests. Based on internal photoemission mechanism, hot-electron photodetectors (HE PDs) convert…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
5
Adaptive Kernel Fusion for Improving the GPU Utilization while Ensuring QoS by Zhao, Han, Deng, Junxiao, Cui, Weihao, Chen, Quan, Zhang, Youtao, Zeng, Deze, Guo, Minyi

Published in IEEE transactions on computers (09-10-2024)
“…The prosperity of machine learning applications has promoted the rapid development of GPU architecture. It continues to integrate more CUDA Cores, larger L2…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
Toward QoS-Awareness and Improved Utilization of Spatial Multitasking GPUs by Zhang, Wei, Chen, Quan, Zheng, Ningxin, Cui, Weihao, Fu, Kaihua, Guo, Minyi

Published in IEEE transactions on computers (01-04-2022)
“…Datacenters use GPUs to provide the significant computing throughput required by emerging user-facing services. The diurnal user access pattern of user-facing…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
7
Natural Products for Drug Discovery: Discovery of Gramines as Novel Agents against a Plant Virus by Lu, Aidang, Wang, Tienan, Hui, Hao, Wei, Xiaoye, Cui, Weihao, Zhou, Chunlv, Li, Hongyan, Wang, Ziwen, Guo, Jincheng, Ma, Dejun, Wang, Qingmin

Published in Journal of agricultural and food chemistry (27-02-2019)
“…Plant viral diseases seriously affect crop yield and quality. The natural product gramine (1) and its simple structural analogues 2–35 were synthesized from…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
8
E2bird: Enhanced Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services by Cui, Weihao, Chen, Quan, Zhao, Han, Wei, Mengze, Tang, Xiaoxin, Guo, Minyi

Published in IEEE transactions on parallel and distributed systems (01-06-2021)
“…We aim to tackle existing problems about deep learning serving on GPUs in the view of the system. GPUs have been widely adopted to serve online deep…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
9
Improving Cluster Utilization Through Adaptive Resource Management for Deep Neural Network and CPU Jobs Colocation by Zhao, Han, Cui, Weihao, Chen, Quan, Leng, Jingwen, Zeng, Deze, Guo, Minyi

Published in IEEE transactions on computers (01-12-2023)
“…While deep neural network (DNN) models are mainly trained using GPUs, many companies and research institutions build shared GPU clusters. These clusters host…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
10
Improving Cluster Utilization through Adaptive Resource Management for DNN and CPU Jobs Co-location by Zhao, Han, Cui, Weihao, Chen, Quan, Leng, Jingwen, Zeng, Deze, Guo, Minyi

Published in IEEE transactions on computers (09-08-2023)
“…While deep neural network (DNN) models are mainly trained using GPUs, many companies and research institutions build shared GPU clusters. These clusters host…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
11
Superamphiphobic surfaces constructed by cross-linked hollow SiO 2 spheres by Cui, Weihao, Wang, Tao, Yan, Aili, Wang, Sheng

Published in Applied surface science (01-04-2017)

Get full text

Journal Article
QR Code
Save to List

Saved in:
12
Superamphiphobic surfaces constructed by cross-linked hollow SiO2 spheres by Cui, Weihao, Wang, Tao, Yan, Aili, Wang, Sheng

Published in Applied surface science (01-04-2017)
“…[Display omitted] •A series of hierarchically fluorinated 3D cross-linked C@SiO2 spheres and hollow SiO2 spheres coating were fabricated.•Fluorinated hollow…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
13
E 2 bird: E nhanced E lastic B atch for I mproving R esponsiveness and Throughput of D eep Learning Services by Cui, Weihao, Chen, Quan, Zhao, Han, Wei, Mengze, Tang, Xiaoxin, Guo, Minyi

Published in IEEE transactions on parallel and distributed systems (01-06-2021)

Get full text

Journal Article
QR Code
Save to List

Saved in:
14
E2bird: E nhanced E lastic B atch for I mproving R esponsiveness and Throughput of D eep Learning Services by Cui, Weihao, Chen, Quan, Zhao, Han, Wei, Mengze, Tang, Xiaoxin, Guo, Minyi

Published in IEEE transactions on parallel and distributed systems (01-01-2021)
“…We aim to tackle existing problems about deep learning serving on GPUs in the view of the system. GPUs have been widely adopted to serve online deep…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
15
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS by Zhao, Han, Cui, Weihao, Chen, Quan, Zhang, Youtao, Lu, Yanchao, Li, Chao, Leng, Jingwen, Guo, Minyi

Published in 2022 IEEE International Symposium on High-Performance Computer Architecture (HPCA) (01-04-2022)
“…The proliferation of machine learning applications has promoted both CUDA Cores and Tensor Cores' integration to meet their acceleration demands. While studies…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
16
Ebird: Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services by Cui, Weihao, Wei, Mengze, Chen, Quan, Tang, Xiaoxin, Leng, Jingwen, Li, Li, Guo, Mingyi

Published in 2019 IEEE 37th International Conference on Computer Design (ICCD) (01-11-2019)
“…GPUs have been widely adopted to serve online deep learning-based services that have stringent QoS requirements. However, emerging deep learning serving…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
17
One-pot fabrication of single-crystalline octahedral Pd-Pt nanocrystals with enhanced electrocatalytic activity for methanol oxidation by Peng, Meiling, Xu, Wei, Cui, Weihao, Wang, Tao, Wang, Sheng

Published in Journal of solid state electrochemistry (01-02-2017)
“…This study reports the synthesis of octahedral Pd-Pt bimetallic alloy nanocrystals through a facile, one-pot, templateless, and seedless hydrothermal method in…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
18
Microless: Cost-Efficient Hybrid Deployment of Microservices on IaaS VMs and Serverless by Cheng, Jiagan, Zhao, Yilong, Li, Zijun, Chen, Quan, Cui, Weihao, Guo, Minyi

Published in 2023 IEEE 29th International Conference on Parallel and Distributed Systems (ICPADS) (17-12-2023)
“…Microservices have gained popularity as an architectural approach for developing scalable and modular applications. Traditionally, microservice deployment…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
19
Exploiting Intra-SM Parallelism in GPUs via Persistent and Elastic Blocks by Zhao, Han, Cui, Weihao, Chen, Quan, Zhao, Jieru, Leng, Jingwen, Guo, Minyi

Published in 2021 IEEE 39th International Conference on Computer Design (ICCD) (01-10-2021)
“…Emerging GPUs have multiple Streaming Multiprocessors (SM), while each SM is comprised of CUDA Cores and Tensor Cores. While CUDA Cores do the general…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
20
CODA: Improving Resource Utilization by Slimming and Co-locating DNN and CPU Jobs by Zhao, Han, Cui, Weihao, Chen, Quan, Leng, Jingwen, Yu, Kai, Zeng, Deze, Li, Chao, Guo, Minyi

Published in 2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS) (01-11-2020)
“…While deep neural network (DNN) models are often trained on GPUs, many companies and research institutes build GPU clusters that are shared by different…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:

Search Results - "Cui, Weihao"

High-Performance Planar Broadband Hot-Electron Photodetection through Platinum-Dielectric Triple Junctions by Yang, Xiaoyan, Wang, Yongmei, Li, Yaoyao, Cui, Weihao, Hu, Junhui, Zhou, Qingjia, Shao, Weijia

ISPA: Exploiting Intra-SM Parallelism in GPUs via Fine-grained Resource Management by Zhao, Han, Cui, Weihao, Chen, Quan, Guo, Minyi

Accelerating Sparse DNNs Based on Tiled GEMM by Guo, Cong, Xue, Fengchen, Leng, Jingwen, Qiu, Yuxian, Guan, Yue, Cui, Weihao, Chen, Quan, Guo, Minyi

Planar hot-electron photodetection with polarity-switchable photocurrents controlled by the working wavelength by Shao, Weijia, Cui, Weihao, Hu, Junhui, Wang, Yongmei, Tang, Jian, Li, Xiaofeng

Adaptive Kernel Fusion for Improving the GPU Utilization while Ensuring QoS by Zhao, Han, Deng, Junxiao, Cui, Weihao, Chen, Quan, Zhang, Youtao, Zeng, Deze, Guo, Minyi

Toward QoS-Awareness and Improved Utilization of Spatial Multitasking GPUs by Zhang, Wei, Chen, Quan, Zheng, Ningxin, Cui, Weihao, Fu, Kaihua, Guo, Minyi

Natural Products for Drug Discovery: Discovery of Gramines as Novel Agents against a Plant Virus by Lu, Aidang, Wang, Tienan, Hui, Hao, Wei, Xiaoye, Cui, Weihao, Zhou, Chunlv, Li, Hongyan, Wang, Ziwen, Guo, Jincheng, Ma, Dejun, Wang, Qingmin

E2bird: Enhanced Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services by Cui, Weihao, Chen, Quan, Zhao, Han, Wei, Mengze, Tang, Xiaoxin, Guo, Minyi

Improving Cluster Utilization Through Adaptive Resource Management for Deep Neural Network and CPU Jobs Colocation by Zhao, Han, Cui, Weihao, Chen, Quan, Leng, Jingwen, Zeng, Deze, Guo, Minyi

Improving Cluster Utilization through Adaptive Resource Management for DNN and CPU Jobs Co-location by Zhao, Han, Cui, Weihao, Chen, Quan, Leng, Jingwen, Zeng, Deze, Guo, Minyi

Superamphiphobic surfaces constructed by cross-linked hollow SiO 2 spheres by Cui, Weihao, Wang, Tao, Yan, Aili, Wang, Sheng

Superamphiphobic surfaces constructed by cross-linked hollow SiO2 spheres by Cui, Weihao, Wang, Tao, Yan, Aili, Wang, Sheng

E 2 bird: E nhanced E lastic B atch for I mproving R esponsiveness and Throughput of D eep Learning Services by Cui, Weihao, Chen, Quan, Zhao, Han, Wei, Mengze, Tang, Xiaoxin, Guo, Minyi

E2bird: E nhanced E lastic B atch for I mproving R esponsiveness and Throughput of D eep Learning Services by Cui, Weihao, Chen, Quan, Zhao, Han, Wei, Mengze, Tang, Xiaoxin, Guo, Minyi

Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS by Zhao, Han, Cui, Weihao, Chen, Quan, Zhang, Youtao, Lu, Yanchao, Li, Chao, Leng, Jingwen, Guo, Minyi

Ebird: Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services by Cui, Weihao, Wei, Mengze, Chen, Quan, Tang, Xiaoxin, Leng, Jingwen, Li, Li, Guo, Mingyi

One-pot fabrication of single-crystalline octahedral Pd-Pt nanocrystals with enhanced electrocatalytic activity for methanol oxidation by Peng, Meiling, Xu, Wei, Cui, Weihao, Wang, Tao, Wang, Sheng

Microless: Cost-Efficient Hybrid Deployment of Microservices on IaaS VMs and Serverless by Cheng, Jiagan, Zhao, Yilong, Li, Zijun, Chen, Quan, Cui, Weihao, Guo, Minyi

Exploiting Intra-SM Parallelism in GPUs via Persistent and Elastic Blocks by Zhao, Han, Cui, Weihao, Chen, Quan, Zhao, Jieru, Leng, Jingwen, Guo, Minyi

CODA: Improving Resource Utilization by Slimming and Co-locating DNN and CPU Jobs by Zhao, Han, Cui, Weihao, Chen, Quan, Leng, Jingwen, Yu, Kai, Zeng, Deze, Li, Chao, Guo, Minyi

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication