Search Results - "Moses, William S."
-
1
Polygeist: Raising C to Polyhedral MLIR
Published in 2021 30th International Conference on Parallel Architectures and Compilation Techniques (PACT) (01-09-2021)“…We present Polygeist, a new compilation flow that connects the MLIR compiler infrastructure to cutting edge polyhedral optimization tools. It consists of a C…”
Get full text
Conference Proceeding -
2
Retargeting and Respecializing GPU Workloads for Performance Portability
Published in 2024 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) (02-03-2024)“…In order to come close to peak performance, accelerators like GPUs require significant architecture-specific tuning that understand the availability of shared…”
Get full text
Conference Proceeding -
3
Enabling Transformers to Understand Low-Level Programs
Published in 2022 IEEE High Performance Extreme Computing Conference (HPEC) (19-09-2022)“…Unlike prior approaches to machine learning, Transformer models can first be trained on a large corpus of unlabeled data with a generic objective and then on a…”
Get full text
Conference Proceeding -
4
Reverse-Mode Automatic Differentiation and Optimization of GPU Kernels via Enzyme
Published in SC21: International Conference for High Performance Computing, Networking, Storage and Analysis (14-11-2021)“…Computing derivatives is key to many algorithms in scientific computing and machine learning such as optimization, uncertainty quantification, and stability…”
Get full text
Conference Proceeding -
5
Scalable Automatic Differentiation of Multiple Parallel Paradigms through Compiler Augmentation
Published in SC22: International Conference for High Performance Computing, Networking, Storage and Analysis (01-11-2022)“…Derivatives are key to numerous science, engineering, and machine learning applications. While existing tools generate derivatives of programs in a single…”
Get full text
Conference Proceeding -
6
The Quantum Tortoise and the Classical Hare: A simple framework for understanding which problems quantum computing will accelerate (and which it will not)
Published 24-10-2023“…Quantum computing promises transformational gains for solving some problems, but little to none for others. For anyone hoping to use quantum computers now or…”
Get full text
Journal Article -
7
Instead of Rewriting Foreign Code for Machine Learning, Automatically Synthesize Fast Gradients
Published 04-10-2020“…Applying differentiable programming techniques and machine learning algorithms to foreign programs requires developers to either rewrite their code in a…”
Get full text
Journal Article -
8
Extracting Incentives from Black-Box Decisions
Published 12-10-2019“…An algorithmic decision-maker incentivizes people to act in certain ways to receive better decisions. These incentives can dramatically influence subjects'…”
Get full text
Journal Article -
9
The MLIR Transform Dialect. Your compiler is more powerful than you think
Published 05-09-2024“…To take full advantage of a specific hardware target, performance engineers need to gain control on compilers in order to leverage their domain knowledge about…”
Get full text
Journal Article -
10
Input-Gen: Guided Generation of Stateful Inputs for Testing, Tuning, and Training
Published 13-06-2024“…The size and complexity of software applications is increasing at an accelerating pace. Source code repositories (along with their dependencies) require vast…”
Get full text
Journal Article -
11
High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs
Published 01-07-2022“…While parallelism remains the main source of performance, architectural implementations and programming models change with each new hardware generation, often…”
Get full text
Journal Article -
12
Computational Complexity of Arranging Music
Published 14-07-2016“…This paper proves that arrangement of music is NP-hard when subject to various constraints: avoiding musical dissonance, limiting how many notes can be played…”
Get full text
Journal Article -
13
Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions
Published 13-02-2018“…Deep learning models with convolutional and recurrent networks are now ubiquitous and analyze massive amounts of audio, image, video, text and graph data, with…”
Get full text
Journal Article