Search Results - "Baas, Bevan M"
-
1
Parallel AES Encryption Engines for Many-Core Processor Arrays
Published in IEEE transactions on computers (01-03-2013)“…By exploring different granularities of data-level and task-level parallelism, we map 16 implementations of an Advanced Encryption Standard (AES) cipher with…”
Get full text
Journal Article -
2
Display Stream Compression Decoders for Fine-Grained Many-Core Processor Arrays
Published in IEEE transactions on circuits and systems. II, Express briefs (01-05-2021)“…This brief presents two software Display Stream Compression (DSC) video decoder designs for many-core processor arrays. The first design exploits fine-grained…”
Get full text
Journal Article -
3
KiloCore: A 32-nm 1000-Processor Computational Array
Published in IEEE journal of solid-state circuits (01-04-2017)“…A processor array containing 1000 independent processors and 12 memory modules was fabricated in 32-nm partially depleted silicon on insulator CMOS. The…”
Get full text
Journal Article -
4
A Low-Complexity Message-Passing Algorithm for Reduced Routing Congestion in LDPC Decoders
Published in IEEE transactions on circuits and systems. I, Regular papers (01-05-2010)“…A low-complexity message-passing algorithm, called Split-Row Threshold, is used to implement low-density parity-check (LDPC) decoders with reduced layout…”
Get full text
Journal Article -
5
A Low-Area Multi-Link Interconnect Architecture for GALS Chip Multiprocessors
Published in IEEE transactions on very large scale integration (VLSI) systems (01-05-2010)“…A new inter-processor communication architecture for chip multiprocessors is proposed which has a low area cost, flexible routing capability, and supports…”
Get full text
Journal Article -
6
Achieving High-Performance On-Chip Networks With Shared-Buffer Routers
Published in IEEE transactions on very large scale integration (VLSI) systems (01-06-2014)“…On-chip routers typically have buffers dedicated to their input or output ports for temporarily storing packets in case contention occurs on output physical…”
Get full text
Journal Article -
7
Hybrid Hardware/Software Floating-Point Implementations for Optimized Area and Throughput Tradeoffs
Published in IEEE transactions on very large scale integration (VLSI) systems (01-01-2017)“…Hybrid floating-point (FP) implementations improve software FP performance without incurring the area overhead of full hardware FP units. The proposed…”
Get full text
Journal Article -
8
Processor Tile Shapes and Interconnect Topologies for Dense On-Chip Networks
Published in IEEE transactions on very large scale integration (VLSI) systems (01-06-2014)“…We propose two eight-neighbor, two five-nearest-neighbor, and three six-nearest-neighbor interconnection topologies for many-core processor arrays-three of…”
Get full text
Journal Article -
9
A 167-Processor Computational Platform in 65 nm CMOS
Published in IEEE journal of solid-state circuits (01-04-2009)“…A 167-processor computational platform consists of an array of simple programmable processors capable of per-processor dynamic supply voltage and clock…”
Get full text
Journal Article Conference Proceeding -
10
A 1080p H.264/AVC Baseline Residual Encoder for a Fine-Grained Many-Core System
Published in IEEE transactions on circuits and systems for video technology (01-07-2011)“…This paper presents a baseline residual encoder for H.264/AVC on a programmable fine-grained many-core processing array that utilizes no application-specific…”
Get full text
Journal Article -
11
A low-power, high-performance, 1024-point FFT processor
Published in IEEE journal of solid-state circuits (01-03-1999)“…This paper presents an energy-efficient, single-chip, 1024-point fast Fourier transform (FFT) processor. The 460000-transistor design has been fabricated in a…”
Get full text
Journal Article -
12
Time-Scalable Mapping for Circuit-Switched GALS Chip Multiprocessor Platforms
Published in IEEE transactions on computer-aided design of integrated circuits and systems (01-05-2014)“…We study the problem of mapping concurrent tasks of an application to cores of a chip multiprocessor that utilize circuit-switched interconnect and global…”
Get full text
Journal Article -
13
AsAP: An Asynchronous Array of Simple Processors
Published in IEEE journal of solid-state circuits (01-03-2008)“…An array of simple programmable processors is implemented in 0.18 mum CMOS and contains 36 asynchronously clocked independent processors. Each processor…”
Get full text
Journal Article -
14
High Performance, Energy Efficiency, and Scalability With GALS Chip Multiprocessors
Published in IEEE transactions on very large scale integration (VLSI) systems (01-01-2009)“…Chip multiprocessors with globally asynchronous locally synchronous (GALS) clocking styles are promising candidates for processing computationally-intensive…”
Get full text
Journal Article -
15
A Split-Decoding Message Passing Algorithm for Low Density Parity Check Decoders
Published in Journal of signal processing systems (01-12-2010)“…A Split decoding algorithm is proposed which divides each row of the parity check matrix into two or multiple nearly-independent simplified partitions. The…”
Get full text
Journal Article -
16
Area efficient backprojection computation with reduced floating-point word width for SAR image formation
Published in 2015 49th Asilomar Conference on Signals, Systems and Computers (01-11-2015)“…The widths of data words in digital processors have a direct impact on area in application-specific ICs (ASICs) and FPGAs. Circuit area impacts energy…”
Get full text
Conference Proceeding Journal Article -
17
A Scalable Dual-Clock FIFO for Data Transfers Between Arbitrary and Haltable Clock Domains
Published in IEEE transactions on very large scale integration (VLSI) systems (01-10-2007)“…A robust, scalable, and power efficient dual-clock first-input first-out (FIFO) architecture which is useful for transferring data between modules operating in…”
Get full text
Journal Article -
18
Optimizing power of many-core systems by exploiting dynamic voltage, frequency and core scaling
Published in 2015 IEEE 58th International Midwest Symposium on Circuits and Systems (MWSCAS) (01-08-2015)“…To address the well-known "power wall" issue, many-core processors with dynamic voltage and frequency scaling (DVFS) are widely investigated. To further…”
Get full text
Conference Proceeding -
19
Hybrid floating-point modules with low area overhead on a fine-grained processing core
Published in 2014 48th Asilomar Conference on Signals, Systems and Computers (01-11-2014)“…This paper proposes Hybrid Floating-Point Modules (HFPMs) as a method to improve software floating-point (FP) throughput without incurring the area overhead of…”
Get full text
Conference Proceeding -
20
Scalable hardware-based power management for many-core systems
Published in 2014 48th Asilomar Conference on Signals, Systems and Computers (01-11-2014)“…Due to high levels integration, the design of many-core systems becomes increasingly challenging. Runtime dynamic voltage and frequency scaling (DVFS) is an…”
Get full text
Conference Proceeding