Search Results - "Ahmad, Masab"
-
1
A performance predictor for implementation selection of parallelized static and temporal graph algorithms
Published in Concurrency and computation (25-01-2022)“…Task‐based execution of graph workloads allows various ordered and unordered implementations, with tasks representing dependencies between graph vertices and…”
Get full text
Journal Article -
2
Advancing the State-of-the-Art in Hardware Trojans Detection
Published in IEEE transactions on dependable and secure computing (01-01-2019)“…Over the past decade, Hardware Trojans (HTs) research community has made significant progress towards developing effective countermeasures for various types of…”
Get full text
Journal Article -
3
In-Hardware Moving Compute to Data Model to Accelerate Thread Synchronization on Large Multicores
Published in IEEE MICRO (01-01-2020)“…In this article, the moving computation to data model (MC2D) is proposed to accelerate thread synchronization by pinning shared data to dedicated cores, and…”
Get full text
Journal Article -
4
Efficient Situational Scheduling of Graph Workloads on Single-Chip Multicores and GPUs
Published in IEEE MICRO (01-01-2017)“…Situational dynamic changes in graph analytic algorithm implementations give rise to efficiency challenges in concurrent hardware, such as GPUs and large-scale…”
Get full text
Journal Article -
5
GraphTuner: An Input Dependence Aware Loop Perforation Scheme for Efficient Execution of Approximated Graph Algorithms
Published in 2017 IEEE International Conference on Computer Design (ICCD) (01-11-2017)“…Graph algorithms have gained popularity and are utilized in high performance and mobile computing paradigms. Input dependence due to input graph changes leads…”
Get full text
Conference Proceeding -
6
Accelerating Graph Processing on Large-Scale Multicores
Published 01-01-2019“…With the ever-increasing amount of data and input variations, portable performance is becoming harder to exploit on today’s architectures. Computational setups…”
Get full text
Dissertation -
7
POSTER: Exploiting Multi-Level Task Dependencies to Prune Redundant Work in Relax-Ordered Task-Parallel Algorithms
Published in 2019 28th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01-09-2019)“…Work-efficient task-parallel algorithms enforce ordering between tasks using queuing primitives. Such algorithms offer limited parallelism due to queuing…”
Get full text
Conference Proceeding -
8
Understanding Concurrency for Graph Workloads in Large Scale Multicores
Published 01-01-2016“…Algorithms operating on a graph setting are known to be highly irregular and un- structured. This leads to workload imbalance and data locality challenge when…”
Get full text
Dissertation -
9
Accelerating Synchronization in Graph Analytics Using Moving Compute to Data Model on Tilera TILE-Gx72
Published in 2018 IEEE 36th International Conference on Computer Design (ICCD) (01-10-2018)“…The shared memory cache coherence paradigm is prevalent in modern multicores. However, as the number of cores increases, synchronization between threads limits…”
Get full text
Conference Proceeding -
10
Power & throughput optimized lifting architecture for Wavelet Packet Transform
Published in 2014 IEEE International Symposium on Circuits and Systems (ISCAS) (01-06-2014)“…This paper presents area-power efficient architectures for the lifting based Wavelet Packet Transform (WPT). Using Daubechies 6 as an example, three different…”
Get full text
Conference Proceeding -
11
CRONO: A Benchmark Suite for Multithreaded Graph Algorithms Executing on Futuristic Multicores
Published in 2015 IEEE International Symposium on Workload Characterization (01-10-2015)“…Algorithms operating on a graph setting are known to be highly irregular and unstructured. This leads to workload imbalance and data locality challenge when…”
Get full text
Conference Proceeding -
12
GPU concurrency choices in graph analytics
Published in 2016 IEEE International Symposium on Workload Characterization (IISWC) (01-09-2016)“…Graph analytics is becoming ever more ubiquitous in today's world. However, situational dynamic changes in input graphs, such as changes in traffic and weather…”
Get full text
Conference Proceeding -
13
HeteroMap: A Runtime Performance Predictor for Efficient Processing of Graph Analytics on Heterogeneous Multi-Accelerators
Published in 2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) (01-03-2019)“…With the ever-increasing amount of data and input variations, portable performance is becoming harder to exploit on today's architectures. Computational setups…”
Get full text
Conference Proceeding -
14
Accelerating Graph and Machine Learning Workloads Using a Shared Memory Multicore Architecture with Auxiliary Support for In-hardware Explicit Messaging
Published in 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) (01-05-2017)“…Shared Memory stands out as a sine qua non for parallel programming of many commercial and emerging multicore processors. It optimizes patterns of…”
Get full text
Conference Proceeding -
15
Efficient parallelization of path planning workload on single-chip shared-memory multicores
Published in 2015 IEEE High Performance Extreme Computing Conference (HPEC) (01-09-2015)“…Path planning problems greatly arise in many applications where the objective is to find the shortest path from a given source to destination. In this paper,…”
Get full text
Conference Proceeding -
16
Software-Hardware Managed Last-level Cache Allocation Scheme for Large-Scale NVRAM-Based Multicores Executing Parallel Data Analytics Applications
Published in 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS) (01-05-2018)“…Developments in machine learning and graph analytics have seen these fields establish themselves as pervasive in a wide range of applications. Non-volatile…”
Get full text
Conference Proceeding -
17
M-MAP: Multi-factor memory authentication for secure embedded processors
Published in 2015 33rd IEEE International Conference on Computer Design (ICCD) (01-10-2015)“…The challenges faced in securing embedded computing systems against multifaceted memory safety vulnerabilities have prompted great interest in the development…”
Get full text
Conference Proceeding