Search Results - "Yan, Guihai"
-
1
SqueezeFlow: A Sparse CNN Accelerator Exploiting Concise Convolution Rules
Published in IEEE transactions on computers (01-11-2019)“…Convolutional Neural Networks (CNNs) have been widely used in machine learning tasks. While delivering state-of-the-art accuracy, CNNs are known as both…”
Get full text
Journal Article -
2
DPU-Direct: Unleashing Remote Accelerators via Enhanced RDMA for Disaggregated Datacenters
Published in IEEE transactions on computers (01-08-2024)“…This paper presents DPU-Direct, an accelerator disaggregation system that connects accelerator nodes (ANs) and CPU nodes (CNs) over a standard Remote Direct…”
Get full text
Journal Article -
3
FlexFlow: A Flexible Dataflow Accelerator Architecture for Convolutional Neural Networks
Published in 2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) (01-02-2017)“…Convolutional Neural Networks (CNN) are very computation-intensive. Recently, a lot of CNN accelerators based on the CNN intrinsic parallelism are proposed…”
Get full text
Conference Proceeding -
4
Monocular 3D Multi-Person Pose Estimation for On-Site Joint Flexion Assessment: A Case of Extreme Knee Flexion Detection
Published in Sensors (Basel, Switzerland) (24-09-2024)“…Work-related musculoskeletal disorders (WMSDs) represent a significant health challenge for workers in construction environments, often arising from prolonged…”
Get full text
Journal Article -
5
Joint Design of Training and Hardware Towards Efficient and Accuracy-Scalable Neural Network Inference
Published in IEEE journal on emerging and selected topics in circuits and systems (01-12-2018)“…The intrinsic error tolerance of neural network (NN) presents opportunities for approximate computing techniques to improve the energy efficiency of NN…”
Get full text
Journal Article -
6
CoreRank: Redeeming "Sick Silicon" by Dynamically Quantifying Core-Level Healthy Condition
Published in IEEE transactions on computers (01-03-2016)“…In field degradation of manycore processors poses a grand challenge to core management, largely because the degradation is hard to quantify. We propose a novel…”
Get full text
Journal Article -
7
DOE: database offloading engine for accelerating SQL processing
Published in Distributed and parallel databases : an international journal (01-09-2023)“…The CPU-Accelerator heterogeneous systems have demonstrated performance and efficiency benefits on DBMSs. However, the CPU-Cache-DRAM architecture can not…”
Get full text
Journal Article -
8
Promoting the Harmony between Sparsity and Regularity: A Relaxed Synchronous Architecture for Convolutional Neural Networks
Published in IEEE transactions on computers (01-06-2019)“…There are two approaches to improve the performance of Convolutional Neural Networks (CNNs): 1) accelerating computation and 2) reducing the amount of…”
Get full text
Journal Article -
9
ShuttleNoC: Power-Adaptable Communication Infrastructure for Many-Core Processors
Published in IEEE transactions on computer-aided design of integrated circuits and systems (01-08-2019)“…Networks-on-chip (NoCs), as the communication infrastructure in many-core processors, has demonstrated remarkable power consumption along with the technology…”
Get full text
Journal Article -
10
SmartShuttle: Optimizing off-chip memory accesses for deep learning accelerators
Published in 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE) (01-03-2018)“…Convolutional Neural Network (CNN) accelerators are rapidly growing in popularity as a promising solution for deep learning based applications. Though…”
Get full text
Conference Proceeding -
11
An Analytical Framework for Estimating Scale-Out and Scale-Up Power Efficiency of Heterogeneous Manycores
Published in IEEE transactions on computers (01-02-2016)“…Heterogeneous manycore architectures have shown to be highly promising to boost power efficiency through two independent ways: (1) enabling massive…”
Get full text
Journal Article -
12
EcoUp: Towards Economical Datacenter Upgrading
Published in IEEE transactions on parallel and distributed systems (01-07-2016)“…The rapid growth of cloud services dictates increasingly powerful datacenters to maintain the high quality of service (QoS). It's a common practice in…”
Get full text
Journal Article -
13
Automated Defect Detection on Dry-Hanging Stone Curtain Walls through Colored Point Clouds
Published in Buildings (Basel) (01-09-2024)“…Stone curtain walls are widely used in contemporary architectures; however, their regular inspection is always labor-intensive, time-consuming, and hazardous…”
Get full text
Journal Article -
14
Orchestrator: Guarding Against Voltage Emergencies in Multithreaded Applications
Published in IEEE transactions on very large scale integration (VLSI) systems (01-12-2014)“…Voltage emergency (VE) has become a critical challenge with decreasing feature size and increasing power capacity. Destructive core interference is one main…”
Get full text
Journal Article -
15
ReviveNet: A Self-Adaptive Architecture for Improving Lifetime Reliability via Localized Timing Adaptation
Published in IEEE transactions on computers (01-09-2011)“…The aggressive technology scaling poses serious challenges to lifetime reliability. A parament challenge comes from a variety of aging mechanisms that can…”
Get full text
Journal Article -
16
RISO: Enforce Noninterfered Performance With Relaxed Network-on-Chip Isolation in Many-Core Cloud Processors
Published in IEEE transactions on very large scale integration (VLSI) systems (01-12-2015)“…Workload consolidation is widely used in modern cloud processors to reduce total cost of ownership. Performance isolation has to be enforced between…”
Get full text
Journal Article -
17
CCR: A concise convolution rule for sparse neural network accelerators
Published in 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE) (01-03-2018)“…Convolutional Neural networks (CNNs) have achieved great success in a broad range of applications. As CNN-based methods are often both computation and memory…”
Get full text
Conference Proceeding -
18
SVFD: A Versatile Online Fault Detection Scheme via Checking of Stability Violation
Published in IEEE transactions on very large scale integration (VLSI) systems (01-09-2011)“…In ultra-deep submicrometer technology, soft errors and device aging are two of the paramount reliability concerns. Although many studies have been done to…”
Get full text
Journal Article -
19
Exploiting the Potential of Computation Reuse Through Approximate Computing
Published in IEEE transactions on multi-scale computing systems (01-07-2017)“…Approximate computing, which tackles tradeoff between computation quality (e.g., accuracy) and computation efforts, is becoming a promising technique to…”
Get full text
Journal Article -
20
PowerTrader: Enforcing Autonomous Power Management for Future Large-Scale Many-Core Processors
Published in IEEE transactions on multi-scale computing systems (01-10-2017)“…Existing power management approaches for modern many-core processors resort to "centralized" design concept, aiming to optimize chip performance under fixed…”
Get full text
Journal Article