Search Results - "Patel, Pratyush"

  • Showing 1 - 16 results of 16
Refine Results
  1. 1
  2. 2

    A server-based approach for predictable GPU access control by Hyoseung Kim, Patel, Pratyush, Shige Wang, Rajkumar, Ragunathan Raj

    “…We propose a server-based approach to manage a general-purpose graphics processing unit (GPU) in a predictable and efficient manner. Our proposed approach…”
    Get full text
    Conference Proceeding
  3. 3

    Towards Improved Power Management in Cloud GPUs by Patel, Pratyush, Gong, Zibo, Rizvi, Syeda, Choukse, Esha, Misra, Pulkit, Anderson, Tom, Sriraman, Akshitha

    Published in IEEE computer architecture letters (01-07-2023)
    “…As modern server GPUs are increasingly power intensive, better power management mechanisms can significantly reduce the power consumption, capital costs, and…”
    Get full text
    Journal Article
  4. 4

    Splitwise: Efficient Generative LLM Inference Using Phase Splitting by Patel, Pratyush, Choukse, Esha, Zhang, Chaojie, Shah, Aashaka, Goiri, Inigo, Maleki, Saeed, Bianchini, Ricardo

    “…Generative large language model (LLM) applications are growing rapidly, leading to large-scale deployments of expensive and power-hungry GPUs. Our…”
    Get full text
    Conference Proceeding
  5. 5

    A server-based approach for predictable GPU access with improved analysis by Kim, Hyoseung, Patel, Pratyush, Wang, Shige, (Raj) Rajkumar, Ragunathan

    Published in Journal of systems architecture (01-08-2018)
    “…We propose a server-based approach to manage a general-purpose graphics processing unit (GPU) in a predictable and efficient manner. Our proposed approach…”
    Get full text
    Journal Article
  6. 6

    Analytical Enhancements and Practical Insights for MPCP with Self-Suspensions by Patel, Pratyush, Baek, Iljoo, Kim, Hyoseung, Rajkumar, Ragunathan

    “…Hardware accelerators such as GP-GPUs and DSPs are being increasingly used in computationally-intensive real-time and multimedia systems. System efficiency is…”
    Get full text
    Conference Proceeding
  7. 7

    TimerShield: Protecting High-Priority Tasks from Low-Priority Timer Interference (Outstanding Paper) by Patel, Pratyush, Vanga, Manohar, Brandenburg, Bjorn B.

    “…Timer interference arises when a high-priority realtime task is delayed by a timer interrupt that is intended for a lower-priority task. We demonstrate that…”
    Get full text
    Conference Proceeding
  8. 8
  9. 9

    Input-Dependent Power Usage in GPUs by Gregersen, Theo, Patel, Pratyush, Choukse, Esha

    Published 26-09-2024
    “…GPUs are known to be power-hungry, and due to the boom in artificial intelligence, they are currently the major contributors to the high power demands of…”
    Get full text
    Journal Article
  10. 10

    Splitwise: Efficient generative LLM inference using phase splitting by Patel, Pratyush, Choukse, Esha, Zhang, Chaojie, Shah, Aashaka, Goiri, Íñigo, Maleki, Saeed, Bianchini, Ricardo

    Published 30-11-2023
    “…Recent innovations in generative large language models (LLMs) have made their applications and use-cases ubiquitous. This has led to large-scale deployments of…”
    Get full text
    Journal Article
  11. 11

    POLCA: Power Oversubscription in LLM Cloud Providers by Patel, Pratyush, Choukse, Esha, Zhang, Chaojie, Goiri, Íñigo, Warrier, Brijesh, Mahalingam, Nithish, Bianchini, Ricardo

    Published 24-08-2023
    “…Recent innovation in large language models (LLMs), and their myriad use-cases have rapidly driven up the compute capacity demand for datacenter GPUs. Several…”
    Get full text
    Journal Article
  12. 12

    Hybrid Computing for Interactive Datacenter Applications by Patel, Pratyush, Lim, Katie, Jhunjhunwalla, Kushal, Martinez, Ashlie, Demoulin, Max, Nelson, Jacob, Zhang, Irene, Anderson, Thomas

    Published 10-04-2023
    “…Field-Programmable Gate Arrays (FPGAs) are more energy efficient and cost effective than CPUs for a wide variety of datacenter applications. Yet, for…”
    Get full text
    Journal Article
  13. 13
  14. 14
  15. 15

    A Server-based Approach for Predictable GPU Access with Improved Analysis by Kim, Hyoseung, Patel, Pratyush, Wang, Shige, Ragunathan, Rajkumar

    Published 19-09-2017
    “…We propose a server-based approach to manage a general-purpose graphics processing unit (GPU) in a predictable and efficient manner. Our proposed approach…”
    Get full text
    Journal Article
  16. 16

    The Virtual Block Interface: A Flexible Alternative to the Conventional Virtual Memory Framework by Hajinazar, Nastaran, Patel, Pratyush, Patel, Minesh, Kanellopoulos, Konstantinos, Ghose, Saugata, Ausavarungnirun, Rachata, OliveiraJr, Geraldo Francisco de, Appavoo, Jonathan, Seshadri, Vivek, Mutlu, Onur

    Published 19-05-2020
    “…Computers continue to diversify with respect to system designs, emerging memory technologies, and application memory demands. Unfortunately, continually…”
    Get full text
    Journal Article