Search Results - "Dally, William"
-
1
The GPU Computing Era
Published in IEEE MICRO (01-03-2010)“…GPU computing is at a tipping point, becoming more widely used in demanding consumer applications and high-performance computing. This article describes the…”
Get full text
Journal Article -
2
SpArch: Efficient Architecture for Sparse Matrix Multiplication
Published in 2020 IEEE International Symposium on High Performance Computer Architecture (HPCA) (01-02-2020)“…Generalized Sparse Matrix-Matrix Multiplication (SpGEMM) is a ubiquitous task in various engineering and scientific applications. However, inner product based…”
Get full text
Conference Proceeding -
3
EIE: Efficient Inference Engine on Compressed Deep Neural Network
Published in 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA) (01-06-2016)“…State-of-the-art deep neural networks (DNNs) have hundreds of millions of connections and are both computationally and memory intensive, making them difficult…”
Get full text
Conference Proceeding -
4
A Novel High-Efficiency Three-Phase Multilevel PV Inverter With Reduced DC-Link Capacitance
Published in IEEE transactions on industrial electronics (1982) (01-05-2023)“…In this article, we present a novel three-phase multilevel inverter (MLI) design for photovoltaic applications which does not require large dc-link capacitors…”
Get full text
Journal Article -
5
Evolution of the Graphics Processing Unit (GPU)
Published in IEEE MICRO (01-11-2021)“…Graphics processing units (GPUs) power today’s fastest supercomputers, are the dominant platform for deep learning, and provide the intelligence for devices…”
Get full text
Journal Article -
6
Accelerating Chip Design With Machine Learning
Published in IEEE MICRO (01-11-2020)“…Recent advancements in machine learning provide an opportunity to transform chip design workflows. We review recent research applying techniques such as deep…”
Get full text
Journal Article -
7
A 0.297-pJ/Bit 50.4-Gb/s/Wire Inverter-Based Short-Reach Simultaneous Bi-Directional Transceiver for Die-to-Die Interface in 5-nm CMOS
Published in IEEE journal of solid-state circuits (01-04-2023)“…This article presents a clock-forwarded, inverter-based short-reach simultaneous bi-directional (ISR-SBD) physical layer (PHY) targeted for die-to-die…”
Get full text
Journal Article -
8
LNS-Madam: Low-Precision Training in Logarithmic Number System Using Multiplicative Weight Update
Published in IEEE transactions on computers (01-12-2022)“…Representing deep neural networks (DNNs) in low-precision is a promising approach to enable efficient acceleration and memory reduction. Previous methods that…”
Get full text
Journal Article -
9
GPUs and the Future of Parallel Computing
Published in IEEE MICRO (01-09-2011)“…This article discusses the capabilities of state-of-the art GPU-based high-throughput computing systems and considers the challenges to scaling single-chip…”
Get full text
Journal Article -
10
Darwin: A Genomics Coprocessor
Published in IEEE MICRO (01-05-2019)“…Long read sequencing is promising as it provides knowledge of a full spectrum of mutations in the human genome and generates more contiguous de novo…”
Get full text
Journal Article -
11
A 0.32-128 TOPS, Scalable Multi-Chip-Module-Based Deep Neural Network Inference Accelerator With Ground-Referenced Signaling in 16 nm
Published in IEEE journal of solid-state circuits (01-04-2020)“…Custom accelerators improve the energy efficiency, area efficiency, and performance of deep neural network (DNN) inference. This article presents a scalable…”
Get full text
Journal Article -
12
A 95.6-TOPS/W Deep Learning Inference Accelerator With Per-Vector Scaled 4-bit Quantization in 5 nm
Published in IEEE journal of solid-state circuits (01-04-2023)“…The energy efficiency of deep neural network (DNN) inference can be improved with custom accelerators. DNN inference accelerators often employ specialized…”
Get full text
Journal Article -
13
The Longshore Transport Enigma and Analysis of a 10-Year Record of Wind-Driven Nearshore Currents
Published in Journal of coastal research (01-01-2018)“…Burnette, C. and Dally, W.R., 2018. The longshore transport enigma and analysis of a 10-year record of wind-driven nearshore currents. Previous analysis of a…”
Get full text
Journal Article -
14
A 1.17-pJ/b, 25-Gb/s/pin Ground-Referenced Single-Ended Serial Link for Off- and On-Package Communication Using a Process- and Temperature-Adaptive Voltage Regulator
Published in IEEE journal of solid-state circuits (01-01-2019)“…This paper describes a short-reach serial link to connect chips mounted on the same package or on neighboring packages on a printed circuit board (PCB). The…”
Get full text
Journal Article -
15
Elastic Buffer Flow Control for On-Chip Networks
Published in IEEE transactions on computers (01-02-2013)“…Networks-on-chip (NoCs) were developed to meet the communication requirements of large-scale systems. The majority of current NoCs spend considerable area and…”
Get full text
Journal Article -
16
A 0.190-pJ/bit 25.2-Gb/s/wire Inverter-Based AC-Coupled Transceiver for Short-Reach Die-to-Die Interfaces in 5-nm CMOS
Published in IEEE journal of solid-state circuits (01-04-2024)“…This article presents an inverter-based short-reach ac-coupled toggle (ISR-ACT) link targeted for short-reach die-to-die communication over silicon interposer…”
Get full text
Journal Article -
17
Evaluating the Impact of Beach Nourishment on Surfing: Surf City, Long Beach Island, New Jersey, U.S.A
Published in Journal of coastal research (01-07-2018)“…Dally, W.R. and Osiecki, D.A., 2018. Evaluating the impact of beach nourishment on surfing: Surf City, Long Beach Island, New Jersey, U.S.A. Utilizing the…”
Get full text
Journal Article -
18
Energy Efficient On-Demand Dynamic Branch Prediction Models
Published in IEEE transactions on computers (01-03-2020)“…The branch predictor unit (BPU) is among the main energy consuming components in out-of-order (OoO) processors. For integer applications, we find 16 percent of…”
Get full text
Journal Article -
19
Champagne: Automated Whole-Genome Phylogenomic Character Matrix Method Using Large Genomic Indels for Homoplasy-Free Inference
Published in Genome biology and evolution (02-03-2022)“…Abstract We present Champagne, a whole-genome method for generating character matrices for phylogenomic analysis using large genomic indel events. By…”
Get full text
Journal Article -
20
A 0.54 pJ/b 20 Gb/s Ground-Referenced Single-Ended Short-Reach Serial Link in 28 nm CMOS for Advanced Packaging Applications
Published in IEEE journal of solid-state circuits (01-12-2013)“…High-speed signaling over high density interconnect on organic package substrates or silicon interposers offers an attractive solution to the off-chip…”
Get full text
Journal Article Conference Proceeding