Search Results - "Cui, Weihao"

Refine Results
  1. 1

    High-Performance Planar Broadband Hot-Electron Photodetection through Platinum-Dielectric Triple Junctions by Yang, Xiaoyan, Wang, Yongmei, Li, Yaoyao, Cui, Weihao, Hu, Junhui, Zhou, Qingjia, Shao, Weijia

    Published in Nanomaterials (Basel, Switzerland) (25-09-2024)
    “…Recently, planar and broadband hot-electron photodetectors (HE PDs) were established but exhibited degraded performances due to the adoptions of the…”
    Get full text
    Journal Article
  2. 2

    ISPA: Exploiting Intra-SM Parallelism in GPUs via Fine-grained Resource Management by Zhao, Han, Cui, Weihao, Chen, Quan, Guo, Minyi

    Published in IEEE transactions on computers (01-05-2023)
    “…Emerging GPUs have multiple Streaming Multiprocessors (SM), while each SM is comprised of CUDA Cores and Tensor Cores. While CUDA Cores do the general…”
    Get full text
    Journal Article
  3. 3

    Accelerating Sparse DNNs Based on Tiled GEMM by Guo, Cong, Xue, Fengchen, Leng, Jingwen, Qiu, Yuxian, Guan, Yue, Cui, Weihao, Chen, Quan, Guo, Minyi

    Published in IEEE transactions on computers (01-05-2024)
    “…Network pruning can reduce the computation cost of deep neural network (DNN) models. However, sparse models often produce randomly-distributed weights to…”
    Get full text
    Journal Article
  4. 4

    Planar hot-electron photodetection with polarity-switchable photocurrents controlled by the working wavelength by Shao, Weijia, Cui, Weihao, Hu, Junhui, Wang, Yongmei, Tang, Jian, Li, Xiaofeng

    Published in Optics express (17-07-2023)
    “…Hot-electron photodetection is attracting increasing interests. Based on internal photoemission mechanism, hot-electron photodetectors (HE PDs) convert…”
    Get full text
    Journal Article
  5. 5

    Adaptive Kernel Fusion for Improving the GPU Utilization while Ensuring QoS by Zhao, Han, Deng, Junxiao, Cui, Weihao, Chen, Quan, Zhang, Youtao, Zeng, Deze, Guo, Minyi

    Published in IEEE transactions on computers (09-10-2024)
    “…The prosperity of machine learning applications has promoted the rapid development of GPU architecture. It continues to integrate more CUDA Cores, larger L2…”
    Get full text
    Journal Article
  6. 6

    Toward QoS-Awareness and Improved Utilization of Spatial Multitasking GPUs by Zhang, Wei, Chen, Quan, Zheng, Ningxin, Cui, Weihao, Fu, Kaihua, Guo, Minyi

    Published in IEEE transactions on computers (01-04-2022)
    “…Datacenters use GPUs to provide the significant computing throughput required by emerging user-facing services. The diurnal user access pattern of user-facing…”
    Get full text
    Journal Article
  7. 7

    Natural Products for Drug Discovery: Discovery of Gramines as Novel Agents against a Plant Virus by Lu, Aidang, Wang, Tienan, Hui, Hao, Wei, Xiaoye, Cui, Weihao, Zhou, Chunlv, Li, Hongyan, Wang, Ziwen, Guo, Jincheng, Ma, Dejun, Wang, Qingmin

    Published in Journal of agricultural and food chemistry (27-02-2019)
    “…Plant viral diseases seriously affect crop yield and quality. The natural product gramine (1) and its simple structural analogues 2–35 were synthesized from…”
    Get full text
    Journal Article
  8. 8

    E2bird: Enhanced Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services by Cui, Weihao, Chen, Quan, Zhao, Han, Wei, Mengze, Tang, Xiaoxin, Guo, Minyi

    “…We aim to tackle existing problems about deep learning serving on GPUs in the view of the system. GPUs have been widely adopted to serve online deep…”
    Get full text
    Journal Article
  9. 9

    Improving Cluster Utilization Through Adaptive Resource Management for Deep Neural Network and CPU Jobs Colocation by Zhao, Han, Cui, Weihao, Chen, Quan, Leng, Jingwen, Zeng, Deze, Guo, Minyi

    Published in IEEE transactions on computers (01-12-2023)
    “…While deep neural network (DNN) models are mainly trained using GPUs, many companies and research institutions build shared GPU clusters. These clusters host…”
    Get full text
    Journal Article
  10. 10

    Improving Cluster Utilization through Adaptive Resource Management for DNN and CPU Jobs Co-location by Zhao, Han, Cui, Weihao, Chen, Quan, Leng, Jingwen, Zeng, Deze, Guo, Minyi

    Published in IEEE transactions on computers (09-08-2023)
    “…While deep neural network (DNN) models are mainly trained using GPUs, many companies and research institutions build shared GPU clusters. These clusters host…”
    Get full text
    Journal Article
  11. 11
  12. 12

    Superamphiphobic surfaces constructed by cross-linked hollow SiO2 spheres by Cui, Weihao, Wang, Tao, Yan, Aili, Wang, Sheng

    Published in Applied surface science (01-04-2017)
    “…[Display omitted] •A series of hierarchically fluorinated 3D cross-linked C@SiO2 spheres and hollow SiO2 spheres coating were fabricated.•Fluorinated hollow…”
    Get full text
    Journal Article
  13. 13
  14. 14

    E2bird: E nhanced E lastic B atch for I mproving R esponsiveness and Throughput of D eep Learning Services by Cui, Weihao, Chen, Quan, Zhao, Han, Wei, Mengze, Tang, Xiaoxin, Guo, Minyi

    “…We aim to tackle existing problems about deep learning serving on GPUs in the view of the system. GPUs have been widely adopted to serve online deep…”
    Get full text
    Journal Article
  15. 15

    Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS by Zhao, Han, Cui, Weihao, Chen, Quan, Zhang, Youtao, Lu, Yanchao, Li, Chao, Leng, Jingwen, Guo, Minyi

    “…The proliferation of machine learning applications has promoted both CUDA Cores and Tensor Cores' integration to meet their acceleration demands. While studies…”
    Get full text
    Conference Proceeding
  16. 16

    Ebird: Elastic Batch for Improving Responsiveness and Throughput of Deep Learning Services by Cui, Weihao, Wei, Mengze, Chen, Quan, Tang, Xiaoxin, Leng, Jingwen, Li, Li, Guo, Mingyi

    “…GPUs have been widely adopted to serve online deep learning-based services that have stringent QoS requirements. However, emerging deep learning serving…”
    Get full text
    Conference Proceeding
  17. 17

    One-pot fabrication of single-crystalline octahedral Pd-Pt nanocrystals with enhanced electrocatalytic activity for methanol oxidation by Peng, Meiling, Xu, Wei, Cui, Weihao, Wang, Tao, Wang, Sheng

    Published in Journal of solid state electrochemistry (01-02-2017)
    “…This study reports the synthesis of octahedral Pd-Pt bimetallic alloy nanocrystals through a facile, one-pot, templateless, and seedless hydrothermal method in…”
    Get full text
    Journal Article
  18. 18

    Microless: Cost-Efficient Hybrid Deployment of Microservices on IaaS VMs and Serverless by Cheng, Jiagan, Zhao, Yilong, Li, Zijun, Chen, Quan, Cui, Weihao, Guo, Minyi

    “…Microservices have gained popularity as an architectural approach for developing scalable and modular applications. Traditionally, microservice deployment…”
    Get full text
    Conference Proceeding
  19. 19

    Exploiting Intra-SM Parallelism in GPUs via Persistent and Elastic Blocks by Zhao, Han, Cui, Weihao, Chen, Quan, Zhao, Jieru, Leng, Jingwen, Guo, Minyi

    “…Emerging GPUs have multiple Streaming Multiprocessors (SM), while each SM is comprised of CUDA Cores and Tensor Cores. While CUDA Cores do the general…”
    Get full text
    Conference Proceeding
  20. 20

    CODA: Improving Resource Utilization by Slimming and Co-locating DNN and CPU Jobs by Zhao, Han, Cui, Weihao, Chen, Quan, Leng, Jingwen, Yu, Kai, Zeng, Deze, Li, Chao, Guo, Minyi

    “…While deep neural network (DNN) models are often trained on GPUs, many companies and research institutes build GPU clusters that are shared by different…”
    Get full text
    Conference Proceeding