Search Results - "Wu, Ephrem"
-
1
Advancing high performance heterogeneous integration through die stacking
Published in 2012 Proceedings of the ESSCIRC (ESSCIRC) (01-09-2012)“…This paper describes the industry's first heterogeneous Stacked Silicon Interconnect (SSI) FPGA family (3D integration). Each device is housed in a…”
Get full text
Conference Proceeding -
2
MLPerf Inference Benchmark
Published in 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA) (01-05-2020)“…Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded…”
Get full text
Conference Proceeding -
3
Learning Accurate Integer Transformer Machine-Translation Models
Published 03-01-2020“…We describe a method for training accurate Transformer machine-translation models to run inference using 8-bit integer (INT8) hardware matrix multipliers, as…”
Get full text
Journal Article -
4
Interconnect and package design of a heterogeneous stacked-silicon FPGA
Published in Proceedings of the IEEE 2013 Custom Integrated Circuits Conference (01-09-2013)“…This paper reviews the interconnect and package design of a heterogeneous stacked-silicon FPGA. Up to five dice from two die types are mounted on a passive…”
Get full text
Conference Proceeding -
5
Advancing high performance heterogeneous integration through die stacking
Published in 2012 Proceedings of the European Solid-State Device Research Conference (ESSDERC) (01-09-2012)“…This paper describes the industry's first heterogeneous Stacked Silicon Interconnect (SSI) FPGA family (3D integration). Each device is housed in a…”
Get full text
Conference Proceeding -
6
Optical backplanes with 3D integrated photonics?
Published in 2012 IEEE Hot Chips 24 Symposium (HCS) (01-08-2012)“…This article consists of a collection of slides from the author's conference presentation on the deployment of optical backplanes with 3D integrated photonics…”
Get full text
Conference Proceeding -
7
A Data-Center FPGA Acceleration Platform for Convolutional Neural Networks
Published in 2019 29th International Conference on Field Programmable Logic and Applications (FPL) (01-09-2019)“…Intensive computation is entering data centers with multiple workloads of deep learning. To balance the compute efficiency, performance, and total cost of…”
Get full text
Conference Proceeding -
8
A high-throughput reconfigurable processing array for neural networks
Published in 2017 27th International Conference on Field Programmable Logic and Applications (FPL) (01-09-2017)“…FPGA-based neural-networks typically leave performance on the table because the DSP resources run at less than a third of the peak clock rate. This paper…”
Get full text
Conference Proceeding -
9
FPGAs with 28Gb/s transceivers built with heterogeneous stacked-silicon interconnects
Published in 2012 IEEE Hot Chips 24 Symposium (HCS) (01-08-2012)“…This article consists of a collection of slides from the author's conference presentation on Xilinx's field programmable gate arrays (FPGA) that deploy 28 Gb/s…”
Get full text
Conference Proceeding -
10
A Data-Center FPGA Acceleration Platform for Convolutional Neural Networks
Published 17-09-2019“…Intensive computation is entering data centers with multiple workloads of deep learning. To balance the compute efficiency, performance, and total cost of…”
Get full text
Journal Article -
11
MLPerf Inference Benchmark
Published 06-11-2019“…Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded…”
Get full text
Journal Article -
12
A framework for scaling future backplanes
Published in IEEE communications magazine (01-11-2012)“…As line interfaces in communications chassis transition to 100 Gb/s and higher per port, many in the industry question when electrical backplanes inside these…”
Get full text
Magazine Article -
13
Noise reduction using multi-frame motion estimation, with outlier rejection and trajectory correction
Published in 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing (1993)“…A novel noise reduction (NR) system, MMOT-NR, is proposed. It consists of a multiframe block-based motion estimator which uses the least trimmed square (LTS)…”
Get full text
Conference Proceeding -
14
The Princeton University behavioral synthesis system
Published in [1992] Proceedings 29th ACM/IEEE Design Automation Conference (1992)“…The Princeton University behavioral synthesis system (PUBSS) is a high-level synthesis system targeted to control-dominated machines. PUBSS compiles a very…”
Get full text
Conference Proceeding