Search Results - "Niu, Dimin"
-
1
Overcoming the challenges of crossbar resistive memory architectures
Published in 2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA) (01-02-2015)“…The scalability of DRAM faces challenges from increasing power consumption and the difficulty of building high aspect ratio capacitors. Consequently, emerging…”
Get full text
Conference Proceeding -
2
Practical Near-Data-Processing Architecture for Large-Scale Distributed Graph Neural Network
Published in IEEE access (2022)“…Graph Neural Networks have drawn tremendous attention in the past few years due to their convincing performance and high interpretability in various…”
Get full text
Journal Article -
3
OpSparse: A Highly Optimized Framework for Sparse General Matrix Multiplication on GPUs
Published in IEEE access (2022)“…Sparse general matrix multiplication (SpGEMM) is an important and expensive computation primitive in many real-world applications. Due to SpGEMM's inherent…”
Get full text
Journal Article -
4
Accelerating CPU-Based Sparse General Matrix Multiplication With Binary Row Merging
Published in IEEE access (2022)“…Sparse general matrix multiplication (SpGEMM) is a fundamental building block for many real-world applications. Since SpGEMM is a well-known memory-bounded…”
Get full text
Journal Article -
5
Impact of process variations on emerging memristor
Published in Design Automation Conference (01-06-2010)“…The memristor, known as the fourth basic two-terminal circuit element, has attracted many research interests since the first real device was developed by HP…”
Get full text
Conference Proceeding -
6
DRAF: A Low-Power DRAM-Based Reconfigurable Acceleration Fabric
Published in IEEE MICRO (2017)“…The DRAM-Based Reconfigurable Acceleration Fabric (DRAF) uses commodity DRAM technology to implement a bit-level, reconfigurable fabric that improves area…”
Get full text
Journal Article -
7
Low power multi-level-cell resistive memory design with incomplete data mapping
Published in 2013 IEEE 31st International Conference on Computer Design (ICCD) (01-10-2013)“…Phase change memory (PCM) has been widely studied as a potential DRAM alternative. The multi-level cell (MLC) can further increase the memory density and…”
Get full text
Conference Proceeding -
8
Understanding the trade-offs in multi-level cell ReRAM memory design
Published in 2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC) (29-05-2013)“…Resistive Random Access Memory (ReRAM) is one of the most promising emerging memory technologies as a potential replacement for DRAM memory and/or NAND Flash…”
Get full text
Conference Proceeding -
9
Cost-effective integration of three-dimensional (3D) ICs emphasizing testing cost analysis
Published in 2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) (01-11-2010)“…Three-dimensional (3D) ICs promise to overcome barriers in interconnect scaling by leveraging fast, dense inter-die vias, thereby offering benefits of improved…”
Get full text
Conference Proceeding -
10
DRAF: A Low-Power DRAM-Based Reconfigurable Acceleration Fabric
Published in 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA) (01-06-2016)“…FPGAs are a popular target for application-specific accelerators because they lead to a good balance between flexibility and energy efficiency. However, FPGA…”
Get full text
Conference Proceeding -
11
Impact of Write Pulse and Process Variation on 22 nm FinFET-Based STT-RAM Design: A Device-Architecture Co-Optimization Approach
Published in IEEE transactions on multi-scale computing systems (01-10-2015)“…Spin-transfer torque random access memory (STT-RAM) is a promising candidate for universal memory due to its speed, scalability, and non-volatility. A wide…”
Get full text
Journal Article -
12
Energy-efficient multi-level cell phase-change memory system with data encoding
Published in 2011 IEEE 29th International Conference on Computer Design (ICCD) (01-10-2011)“…Phase-change memory (PCM) is one of the most promising technologies among emerging non-volatile memories. Recently, the technology of multi-level cell (MLC)…”
Get full text
Conference Proceeding -
13
A Hybrid solid-state storage architecture for the performance, energy consumption, and lifetime improvement
Published in HPCA - 16 2010 The Sixteenth International Symposium on High-Performance Computer Architecture (01-01-2010)“…In recent years, many systems have employed NAND flash memory as storage devices because of its advantages of higher performance (compared to the traditional…”
Get full text
Conference Proceeding -
14
SCOPE: a stochastic computing engine for DRAM-based in-situ accelerator
Published in 2018 51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) (01-10-2018)“…Memory-centric architecture, which bridges the gap between compute and memory, is considered as a promising solution to tackle the memory wall and the power…”
Get full text
Conference Proceeding -
15
SBAC: A statistics based cache bypassing method for asymmetric-access caches
Published in 2014 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED) (01-08-2014)“…Asymmetric-access caches with emerging technologies, such as STT-RAM and RRAM, have become very competitive designs recently. Since the write operations…”
Get full text
Conference Proceeding -
16
DRISA: a DRAM-based Reconfigurable In-Situ Accelerator
Published in 2017 50th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) (14-10-2017)“…Data movement between the processing units and the memory in traditional von Neumann architecture is creating the "memory wall" problem. To bridge the gap, two…”
Get full text
Conference Proceeding -
17
iPIM: Programmable In-Memory Image Processing Accelerator Using Near-Bank Architecture
Published in 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA) (01-05-2020)“…Image processing is becoming an increasingly important domain for many applications on workstations and the datacenter that require accelerators for high…”
Get full text
Conference Proceeding -
18
Design of cross-point metal-oxide ReRAM emphasizing reliability and cost
Published in 2013 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) (01-11-2013)“…Metal-Oxide Resistive Random Access Memory (ReRAM) technology is gaining popularity due to its superior write bandwidth, high density, and low operating power…”
Get full text
Conference Proceeding -
19
Flatfish: A Reinforcement Learning Approach for Application-Aware Address Mapping
Published in IEEE transactions on computer-aided design of integrated circuits and systems (01-11-2022)“…The DRAM performance has become a critical bottleneck of modern computing systems. Prior studies have proposed various optimization techniques on address…”
Get full text
Journal Article -
20
DLUX: A LUT-Based Near-Bank Accelerator for Data Center Deep Learning Training Workloads
Published in IEEE transactions on computer-aided design of integrated circuits and systems (01-08-2021)“…The frequent data movement between the processor and the memory has become a severe performance bottleneck for deep neural network (DNN) training workloads in…”
Get full text
Journal Article