Search Results - "Goutis, Costas"
-
1
A methodology for speeding up matrix vector multiplication for single/multi-core architectures
Published in The Journal of supercomputing (01-07-2015)“…In this paper, a new methodology for computing the Dense Matrix Vector Multiplication, for both embedded (processors without SIMD unit) and general purpose…”
Get full text
Journal Article -
2
Design and implementation of totally-self checking SHA-1 and SHA-256 hash functions’ architectures
Published in Microprocessors and microsystems (01-09-2016)“…•Totally Self-Checking (TSC) design for widely used hash function: SHA-1•Totally Self-Checking (TSC) design for widely used hash function: SHA-256•Appropriate…”
Get full text
Journal Article -
3
A Methodology for Speeding Up Fast Fourier Transform Focusing on Memory Architecture Utilization
Published in IEEE transactions on signal processing (01-12-2011)“…Several SOA (state of the art) self-tuning software libraries exist, such as the Fastest Fourier Transform in the West (FFTW) for fast Fourier transform (FFT)…”
Get full text
Journal Article -
4
A Methodology for Speeding up MVM for Regular, Toeplitz and Bisymmetric Toeplitz Matrices
Published in Journal of signal processing systems (01-12-2014)“…The Matrix Vector Multiplication algorithm is an important kernel in most varied domains and application areas and the performance of its implementations…”
Get full text
Journal Article -
5
Resource aware mapping on coarse grained reconfigurable arrays
Published in Microprocessors and microsystems (01-03-2009)“…Coarse grain reconfigurable array architectures have become increasingly popular due to their flexibility, scalability and performance. However, the mapping of…”
Get full text
Journal Article -
6
Efficient high-performance implementation of JPEG-LS encoder
Published in Journal of real-time image processing (01-12-2008)“…A new design approach to create an efficient high-performance JPEG-LS encoder is proposed in this paper. The proposed implementation compresses the image data…”
Get full text
Journal Article -
7
Evaluation of design alternatives for the 2-D-discrete wavelet transform
Published in IEEE transactions on circuits and systems for video technology (01-12-2001)“…In this paper, the three main hardware architectures for the 2-D discrete wavelet transform (2-D-DWT) are reviewed. Also, optimization techniques applicable to…”
Get full text
Journal Article -
8
A Top-Down Design Methodology for Ultrahigh-Performance Hashing Cores
Published in IEEE transactions on dependable and secure computing (01-10-2009)“…Many cryptographic primitives that are used in cryptographic schemes and security protocols such as SET, PKI, IPSec, and VPNs utilize hash functions, which…”
Get full text
Journal Article -
9
Compiler assisted architectural exploration framework for coarse grained reconfigurable arrays
Published in The Journal of supercomputing (01-05-2009)“…Coarse Grain Reconfigurable Array (CGRA) architectures have been extensively used for accelerating time consuming loops. The design of such systems requires…”
Get full text
Journal Article -
10
High-speed FPGA implementation of secure hash algorithm for IPSec and VPN applications
Published in The Journal of supercomputing (01-08-2006)Get full text
Journal Article -
11
Design space exploration of an optimized compiler approach for a generic reconfigurable array architecture
Published in The Journal of supercomputing (01-05-2007)“…Several mesh-like coarse-grained reconfigurable architectures have been devised in the last few years accompanied with their corresponding mapping flows. One…”
Get full text
Journal Article -
12
Decoupled Processors Architecture for Accelerating Data Intensive Applications using Scratch-Pad Memory Hierarchy
Published in Journal of signal processing systems (01-06-2010)“…We present an architecture of decoupled processors with a memory hierarchy consisting only of scratch-pad memories, and a main memory. This architecture…”
Get full text
Journal Article -
13
Speedups from extending embedded processors with a high-performance coarse-grained reconfigurable data-path
Published in Journal of systems architecture (01-05-2008)“…In this paper, an embedded system that extends microprocessor cores with a high-performance coarse-grained reconfigurable data-path is introduced. The…”
Get full text
Journal Article -
14
Performance and Energy Consumption Improvements in Microprocessor Systems Utilizing a Coprocessor Data-Path
Published in Journal of signal processing systems (01-02-2008)“…The speedups and the energy reductions achieved in a generic single-chip microprocessor system by employing a high-performance data-path are presented. The…”
Get full text
Journal Article -
15
Partitioning methodology for heterogeneous reconfigurable functional units
Published in The Journal of supercomputing (01-10-2006)Get full text
Journal Article -
16
Speedups and Energy Reductions From Mapping DSP Applications on an Embedded Reconfigurable System
Published in IEEE transactions on very large scale integration (VLSI) systems (01-12-2007)“…This paper presents performance improvements and energy savings from mapping real-world benchmarks on an embedded single-chip platform that includes…”
Get full text
Journal Article -
17
Exploring the speedups of embedded microprocessor systems utilizing a high-performance coprocessor data-path
Published in The Journal of supercomputing (01-03-2007)Get full text
Journal Article -
18
Performance improvements from partitioning applications to FPGA hardware in embedded SoCs
Published in The Journal of supercomputing (01-02-2006)Get full text
Journal Article -
19
Memory accesses reordering for interconnect power reduction in sum-of-products computations
Published in IEEE transactions on signal processing (01-11-2002)“…Techniques for interconnect power consumption reduction in realizations of sum-of-products computations are presented. The proposed techniques reorder the…”
Get full text
Journal Article -
20
A method for partitioning applications in hybrid reconfigurable architectures
Published in Design automation for embedded systems (01-03-2005)“…In this paper, we propose a methodology for accelerating application segments by partitioning them between reconfigurable hardware blocks of different…”
Get full text
Journal Article