Search Results - "Tullsen, Dean M."
-
1
Context-Sensitive Decoding: On-Demand Microcode Customization for Security and Energy Management
Published in IEEE MICRO (01-05-2019)“…Modern instruction set decoders feature translation of native instructions into internal micro-ops to simplify the CPU design and improve instruction-level…”
Get full text
Journal Article -
2
Redefining the Role of the CPU in the Era of CPU-GPU Integration
Published in IEEE MICRO (01-11-2012)“…We've seen the quick adoption of GPUs as general-purpose computing engines in recent years, fueled by high computational throughput and energy efficiency…”
Get full text
Journal Article -
3
I See Dead µops: Leaking Secrets via Intel/AMD Micro-Op Caches
Published in 2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture (ISCA) (01-06-2021)“…Modern Intel, AMD, and ARM processors translate complex instructions into simpler internal micro-ops that are then cached in a dedicated on-chip structure…”
Get full text
Conference Proceeding -
4
Heterogeneous chip multiprocessors
Published in Computer (Long Beach, Calif.) (01-11-2005)“…Heterogeneous (or asymmetric) chip multiprocessors present unique opportunities for improving system throughput, reducing processor power, and mitigating…”
Get full text
Journal Article -
5
MemPod: A Clustered Architecture for Efficient and Scalable Migration in Flat Address Space Multi-level Memories
Published in 2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) (01-02-2017)“…In the near future, die-stacked DRAM will be increasingly present in conjunction with off-chip memories in hybrid memory systems. Research on this subject…”
Get full text
Conference Proceeding -
6
The case for colocation of high performance computing workloads
Published in Concurrency and computation (01-02-2016)“…Summary The current state of practice in supercomputer resource allocation places jobs from different users on disjoint nodes both in terms of time and space…”
Get full text
Journal Article -
7
Understanding idle behavior and power gating mechanisms in the context of modern benchmarks on CPU-GPU Integrated systems
Published in 2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA) (01-02-2015)“…Overall energy consumption In modern computing systems Is significantly Impacted by Idle power. Power gating, also known as C6, Is an effective mechanism to…”
Get full text
Conference Proceeding -
8
Single-ISA Heterogeneous Multi-Core Architectures for Multithreaded Workload Performance
Published in International Symposium on Computer Architecture: Proceedings of the 31st annual international symposium on Computer architecture : Munchen, Germany; 19-23 June 2004 (02-03-2004)“…A single-ISA heterogeneous multi-core architecture is achip multiprocessor composed of cores of varying size, performance,and complexity. This paper…”
Get full text
Conference Proceeding -
9
Underclocked Software Prefetching: More Cores, Less Energy
Published in IEEE MICRO (01-07-2012)“…Power consumption is a concern for helper-thread prefetching that uses extra cores to speed up the single-thread execution, because power consumption increases…”
Get full text
Journal Article -
10
Eliminating Redundant Computation and Exposing Parallelism through Data-Triggered Threads
Published in IEEE MICRO (01-05-2012)“…Unlike threads in parallel programs created by conventional programming, data-triggered threads are initiated when a memory value is changed. By expressing…”
Get full text
Journal Article -
11
Handling long-latency loads in a simultaneous multithreading processor
Published in Proceedings of the annual International Symposium on Microarchitecture (2001)“…Simultaneous multithreading architectures have been defined previously with fully shared execution resources. When one thread in such an architecture…”
Get full text
Conference Proceeding Journal Article -
12
Resistive Computation: A Critique
Published in IEEE computer architecture letters (01-07-2014)“…Resistive Computation was suggested by [6] as an idea for tacking the power wall by replacing conventional CMOS logic with Magnetic Tunnel Junction (MTJ) based…”
Get full text
Journal Article -
13
Dynamic speculative precomputation
Published in Proceedings of the annual International Symposium on Microarchitecture (2001)“…A large number of memory accesses in memory-bound applications are irregular, such as pointer dereferences, and can be effectively targeted by thread-based…”
Get full text
Conference Proceeding Journal Article -
14
Editorial: Special Section on CMP Architectures
Published in IEEE transactions on parallel and distributed systems (01-08-2007)Get full text
Journal Article -
15
Inter-socket victim cacheing for platform power reduction
Published in 2010 IEEE International Conference on Computer Design (01-10-2010)“…On a multi-socket architecture with load below peak, as is often the case in a server installation, it is common to consolidate load onto fewer sockets to save…”
Get full text
Conference Proceeding -
16
A Tree Based Router Search Engine Architecture with Single Port Memories
Published in 32nd International Symposium on Computer Architecture (ISCA'05) (01-05-2005)“…Pipelined forwarding engines are used in core routers to meet speed demands. Tree-based searches are pipelined across a number of stages to achieve high…”
Get full text
Conference Proceeding -
17
An event-driven multithreaded dynamic optimization framework
Published in 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05) (2005)“…Dynamic optimization has the potential to adapt the program's behavior at run-time to deliver performance improvements over static optimization. Dynamic…”
Get full text
Conference Proceeding -
18
Conjoined-Core Chip Multiprocessing
Published in 37th International Symposium on Microarchitecture (MICRO-37'04) (04-12-2004)“…Chip Multiprocessors (CMP) and Simultaneous Multi-threading (SMT) are two approaches that have been proposed to increase processor efficiency. We believe these…”
Get full text
Conference Proceeding -
19
Harnessing ISA diversity: Design of a heterogeneous-ISA chip multiprocessor
Published in 2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) (01-06-2014)“…Heterogeneous multicore architectures have the potential for high performance and energy efficiency. These architectures may be composed of small…”
Get full text
Conference Proceeding -
20
Simultaneous multithreading: maximizing on-chip parallelism
Published in International Symposium on Computer Architecture: Proceedings of the 22nd annual international symposium on Computer architecture; 22-24 June 1995 (01-05-1995)“…This paper examines simultaneous multithreading, a technique permitting several independent threads to issue instructions to a superscalar's multiple…”
Get full text
Conference Proceeding