Search Results - "Patel, Pratyush"
-
1
A stress-induced source of phonon bursts and quasiparticle poisoning
Published in Nature communications (31-07-2024)“…The performance of superconducting qubits is degraded by a poorly characterized set of energy sources breaking the Cooper pairs responsible for…”
Get full text
Journal Article -
2
A server-based approach for predictable GPU access control
Published in 2017 IEEE 23rd International Conference on Embedded and Real-Time Computing Systems and Applications (RTCSA) (01-08-2017)“…We propose a server-based approach to manage a general-purpose graphics processing unit (GPU) in a predictable and efficient manner. Our proposed approach…”
Get full text
Conference Proceeding -
3
Towards Improved Power Management in Cloud GPUs
Published in IEEE computer architecture letters (01-07-2023)“…As modern server GPUs are increasingly power intensive, better power management mechanisms can significantly reduce the power consumption, capital costs, and…”
Get full text
Journal Article -
4
Splitwise: Efficient Generative LLM Inference Using Phase Splitting
Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29-06-2024)“…Generative large language model (LLM) applications are growing rapidly, leading to large-scale deployments of expensive and power-hungry GPUs. Our…”
Get full text
Conference Proceeding -
5
A server-based approach for predictable GPU access with improved analysis
Published in Journal of systems architecture (01-08-2018)“…We propose a server-based approach to manage a general-purpose graphics processing unit (GPU) in a predictable and efficient manner. Our proposed approach…”
Get full text
Journal Article -
6
Analytical Enhancements and Practical Insights for MPCP with Self-Suspensions
Published in 2018 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS) (01-04-2018)“…Hardware accelerators such as GP-GPUs and DSPs are being increasingly used in computationally-intensive real-time and multimedia systems. System efficiency is…”
Get full text
Conference Proceeding -
7
TimerShield: Protecting High-Priority Tasks from Low-Priority Timer Interference (Outstanding Paper)
Published in 2017 IEEE Real-Time and Embedded Technology and Applications Symposium (RTAS) (01-04-2017)“…Timer interference arises when a high-priority realtime task is delayed by a timer interrupt that is intended for a lower-priority task. We demonstrate that…”
Get full text
Conference Proceeding -
8
The Virtual Block Interface: A Flexible Alternative to the Conventional Virtual Memory Framework
Published in 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture (ISCA) (01-05-2020)“…Computers continue to diversify with respect to system designs, emerging memory technologies, and application memory demands. Unfortunately, continually…”
Get full text
Conference Proceeding -
9
Input-Dependent Power Usage in GPUs
Published 26-09-2024“…GPUs are known to be power-hungry, and due to the boom in artificial intelligence, they are currently the major contributors to the high power demands of…”
Get full text
Journal Article -
10
Splitwise: Efficient generative LLM inference using phase splitting
Published 30-11-2023“…Recent innovations in generative large language models (LLMs) have made their applications and use-cases ubiquitous. This has led to large-scale deployments of…”
Get full text
Journal Article -
11
POLCA: Power Oversubscription in LLM Cloud Providers
Published 24-08-2023“…Recent innovation in large language models (LLMs), and their myriad use-cases have rapidly driven up the compute capacity demand for datacenter GPUs. Several…”
Get full text
Journal Article -
12
Hybrid Computing for Interactive Datacenter Applications
Published 10-04-2023“…Field-Programmable Gate Arrays (FPGAs) are more energy efficient and cost effective than CPUs for a wide variety of datacenter applications. Yet, for…”
Get full text
Journal Article -
13
Low Energy Backgrounds and Excess Noise in a Two-Channel Low-Threshold Calorimeter
Published 21-10-2024“…We describe observations of low energy excess (LEE) events (background events observed in all light dark matter direct detection calorimeters) and noise in a…”
Get full text
Journal Article -
14
A Stress Induced Source of Phonon Bursts and Quasiparticle Poisoning
Published 14-08-2024“…Nat. Commun. 15, 6444 (2024) The performance of superconducting qubits is degraded by a poorly characterized set of energy sources breaking the Cooper pairs…”
Get full text
Journal Article -
15
A Server-based Approach for Predictable GPU Access with Improved Analysis
Published 19-09-2017“…We propose a server-based approach to manage a general-purpose graphics processing unit (GPU) in a predictable and efficient manner. Our proposed approach…”
Get full text
Journal Article -
16
The Virtual Block Interface: A Flexible Alternative to the Conventional Virtual Memory Framework
Published 19-05-2020“…Computers continue to diversify with respect to system designs, emerging memory technologies, and application memory demands. Unfortunately, continually…”
Get full text
Journal Article