Search Results - "Hur, Ibrahim"
-
1
Memory Prefetching Using Adaptive Stream Detection
Published in 2006 39th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'06) (09-12-2006)“…We present Adaptive Stream Detection, a simple technique for modulating the aggressiveness of a stream prefetcher to match a workload's observed spatial…”
Get full text
Conference Proceeding -
2
Adaptive History-Based Memory Schedulers
Published in 37th International Symposium on Microarchitecture (MICRO-37'04) (04-12-2004)“…As memory performance becomes increasingly important to overall system performance, the need to carefully schedule memory operations also increases. This paper…”
Get full text
Conference Proceeding -
3
Modeling DRAM Timing in Parallel Simulators With Immediate-Response Memory Model
Published in IEEE computer architecture letters (01-07-2021)“…Accurately modeling memory timing in a processor simulator is crucial for obtaining accurate and useful performance predictions. DRAM has a complex timing and…”
Get full text
Journal Article -
4
HotTiles: Accelerating SpMM with Heterogeneous Accelerator Architectures
Published in 2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA) (02-03-2024)“…Sparse Matrix Dense Matrix Multiplication (SpMM) is an important kernel with application across a wide range of domains, including machine learning and linear…”
Get full text
Conference Proceeding -
5
The Intel® Programmable and Integrated Unified Memory Architecture (PIUMA) Graph Analytics Processor
Published in IEEE MICRO (01-09-2023)“…High performance large scale graph analytics are essential to timely analyze relationships in big data sets. Conventional processor architectures suffer from…”
Get full text
Journal Article -
6
RIO: ROB-Centric In-Order Modeling of Out-of-Order Processors
Published in IEEE computer architecture letters (01-01-2021)“…Architectural studies of the cache and memory hierarchy need a fast simulation model for the processor core that accurately conveys the impact of memory…”
Get full text
Journal Article -
7
Breaking In-Order Branch Miss Recovery
Published in IEEE computer architecture letters (01-01-2020)“…Despite very accurate branch predictors, branch misses remain an important source of performance limiters, especially for irregular applications. To ensure…”
Get full text
Journal Article -
8
Multi-Stage CPI Stacks
Published in IEEE computer architecture letters (01-01-2018)“…CPI stacks are an intuitive way to visualize processor core performance bottlenecks. However, they often do not provide a full view on all bottlenecks, because…”
Get full text
Journal Article -
9
A comprehensive approach to DRAM power management
Published in 2008 IEEE 14th International Symposium on High Performance Computer Architecture (01-02-2008)“…This paper describes a comprehensive approach for using the memory controller to improve DRAM energy efficiency and manage DRAM power. We make three…”
Get full text
Conference Proceeding -
10
Efficient Asynchronous RPC Calls for Microservices: DeathStarBench Study
Published 27-09-2022“…Crucial in the performance of microservice applications is the efficient handling of RPC calls. We found that the asynchronous call implementation in a popular…”
Get full text
Journal Article -
11
Undersubscribed threading on clustered cache architectures
Published in 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA) (01-02-2014)“…Recent many-core processors such as Intel's Xeon Phi and GPGPUs specialize in running highly scalable parallel applications at high performance while…”
Get full text
Conference Proceeding -
12
Power-aware multi-core simulation for early design stage hardware/software co-optimization
Published in 2012 21st International Conference on Parallel Architectures and Compilation Techniques (PACT) (01-09-2012)“…Stringent performance targets and power constraints push designers towards building specialized workload-optimized systems across a broad spectrum of the…”
Get full text
Conference Proceeding -
13
DRAM Bandwidth and Latency Stacks: Visualizing DRAM Bottlenecks
Published in 2022 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) (01-05-2022)“…For memory-bound applications, memory bandwidth utilization and memory access latency determine performance. DRAM specifications mention the maximum peak…”
Get full text
Conference Proceeding -
14
Accurate and Scalable Many-Node Simulation
Published 18-01-2024“…Accurate performance estimation of future many-node machines is challenging because it requires detailed simulation models of both node and network. However,…”
Get full text
Journal Article -
15
Resource-bounded multicore emulation using Beefarm
Published in Microprocessors and microsystems (01-11-2012)“…In this article, we present the Beefarm infrastructure for FPGA-based multiprocessor emulation, a popular research topic of the last few years both in FPGA and…”
Get full text
Journal Article -
16
Hash Table Scalability on Intel PIUMA
Published in 2020 IEEE High Performance Extreme Computing Conference (HPEC) (22-09-2020)“…The Intel PIUMA (Programmable and Integrated Unified Memory Architecture) is a scalable, massively multithreaded architecture designed to operate on…”
Get full text
Conference Proceeding -
17
Profiling and Optimizing Transactional Memory Applications
Published in International journal of parallel programming (01-02-2012)“…Many researchers have developed applications using transactional memory (TM) with the purpose of benchmarking different implementations, and studying whether…”
Get full text
Journal Article -
18
Discovering and understanding performance bottlenecks in transactional applications
Published in 2010 19th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01-09-2010)“…Many researchers have developed applications using transactional memory (TM) with the purpose of benchmarking different implementations, and studying whether…”
Get full text
Conference Proceeding -
19
Simulating Wrong-Path Instructions in Decoupled Functional-First Simulation
Published in 2023 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) (01-04-2023)“…Wrong-path speculative execution on an out-oforder processor core has no impact on an application's functionality and correctness, but it can impact…”
Get full text
Conference Proceeding -
20
Feedback mechanisms for improving probabilistic memory prefetching
Published in 2009 IEEE 15th International Symposium on High Performance Computer Architecture (01-02-2009)“…This paper presents three techniques for improving the effectiveness of the recently proposed adaptive stream detection (ASD) prefetching mechanism. The ASD…”
Get full text
Conference Proceeding