Search Results - "Acun, Bilge"
-
1
Beyond Efficiency: Scaling AI Sustainably
Published in IEEE MICRO (01-09-2024)“…Barroso’s seminal contributions in energy-proportional warehouse-scale computing launched an era where modern data centers have become more energy efficient…”
Get full text
Journal Article -
2
Understanding Training Efficiency of Deep Learning Recommendation Models at Scale
Published in 2021 IEEE International Symposium on High-Performance Computer Architecture (HPCA) (01-02-2021)“…The use of GPUs has proliferated for machine learning workflows and is now considered mainstream for many deep learning models. Meanwhile, when training…”
Get full text
Conference Proceeding -
3
Datacenter-Scale Analysis and Optimization of GPU Machine Learning Workloads
Published in IEEE MICRO (01-09-2021)“…In this article, we present a system to collectively optimize efficiency in a very large scale deployment of GPU servers for machine learning workloads at…”
Get full text
Journal Article -
4
MAD-Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems
Published in 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA) (29-06-2024)“…Training and deploying large-scale machine learning models is time-consuming, requires significant distributed computing infrastructures, and incurs high…”
Get full text
Conference Proceeding -
5
Towards realizing the potential of malleable jobs
Published in 2014 21st International Conference on High Performance Computing (HiPC) (01-12-2014)“…Malleable jobs are those which can dynamically shrink or expand the number of processors on which they are executing at runtime in response to an external…”
Get full text
Conference Proceeding -
6
Power, Reliability, and Performance: One System to Rule them All
Published in Computer (Long Beach, Calif.) (01-10-2016)“…In a design based on the Charm++ parallel programming framework, an adaptive runtime system dynamically interacts with a datacenter's resource manager to…”
Get full text
Journal Article -
7
Power Aware Heterogeneous Node Assembly
Published in 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA) (01-02-2019)Get full text
Conference Proceeding -
8
Support for Power Efficient Proactive Cooling Mechanisms
Published in 2017 IEEE 24th International Conference on High Performance Computing (HiPC) (01-12-2017)“…Increasing scale of data centers and the density of server nodes pose significant challenges in producing power and energy efficient cooling infrastructures…”
Get full text
Conference Proceeding -
9
Mitigating Variability in HPC Systems and Applications for Performance and Power Efficiency
Published 01-01-2017“…Power consumption and process variability are two important, interconnected, challenges of future generation large-scale High Performance Computing (HPC) data…”
Get full text
Dissertation -
10
SecNDP: Secure Near-Data Processing with Untrusted Memory
Published in 2022 IEEE International Symposium on High-Performance Computer Architecture (HPCA) (01-04-2022)“…Today's data-intensive applications increasingly suffer from significant performance bottlenecks due to the limited memory bandwidth of the classical von…”
Get full text
Conference Proceeding -
11
Thermal aware automated load balancing for HPC applications
Published in 2013 IEEE International Conference on Cluster Computing (CLUSTER) (01-09-2013)“…As we move towards the exascale era, power and energy have become major challenges. Some of the supercomputers draw more than 10 megawatts, leading to high…”
Get full text
Conference Proceeding -
12
Beyond Efficiency: Scaling AI Sustainably
Published 07-06-2024“…Barroso's seminal contributions in energy-proportional warehouse-scale computing launched an era where modern datacenters have become more energy efficient and…”
Get full text
Journal Article -
13
Fine-Grained Energy Efficiency Using Per-Core DVFS with an Adaptive Runtime System
Published in 2019 Tenth International Green and Sustainable Computing Conference (IGSC) (01-10-2019)“…Dynamic voltage and frequency scaling (DVFS) is a well-known technique to reduce the power and/or energy consumption of various applications. While most…”
Get full text
Conference Proceeding -
14
Parallel programming with migratable objects: charm++ in practice
Published in Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (16-11-2014)“…The advent of petascale computing has introduced new challenges (e.g. heterogeneity, system failure) for programming scalable parallel applications. Increased…”
Get full text
Conference Proceeding -
15
Unlocking the Potential of Renewable Energy Through Curtailment Prediction
Published 28-05-2024“…A significant fraction (5-15%) of renewable energy generated goes into waste in the grids around the world today due to oversupply issues and transmission…”
Get full text
Journal Article -
16
CHAI: Clustered Head Attention for Efficient LLM Inference
Published 12-03-2024“…Large Language Models (LLMs) with hundreds of billions of parameters have transformed the field of machine learning. However, serving these models at inference…”
Get full text
Journal Article -
17
Carbon Responder: Coordinating Demand Response for the Datacenter Fleet
Published 14-11-2023“…The increasing integration of renewable energy sources results in fluctuations in carbon intensity throughout the day. To mitigate their carbon footprint,…”
Get full text
Journal Article -
18
MAD Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems
Published 04-10-2023“…Training and deploying large-scale machine learning models is time-consuming, requires significant distributed computing infrastructures, and incurs high…”
Get full text
Journal Article -
19
TT-Rec: Tensor Train Compression for Deep Learning Recommendation Models
Published 25-01-2021“…The memory capacity of embedding tables in deep learning recommendation models (DLRMs) is increasing dramatically from tens of GBs to TBs across the industry…”
Get full text
Journal Article -
20
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
Published in 2024 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) (05-05-2024)“…As the development of large-scale Generative AI models evolve beyond text (1D) generation to include image (2D) and video (3D) generation, processing spatial…”
Get full text
Conference Proceeding