Search Results - "Jacobs, Sam"
-
1
Scalable Topological Data Analysis and Visualization for Evaluating Data-Driven Models in Scientific Applications
Published in IEEE transactions on visualization and computer graphics (01-01-2020)“…With the rapid adoption of machine learning techniques for large-scale applications in science and engineering comes the convergence of two grand challenges in…”
Get full text
Journal Article -
2
-
3
Enabling machine learning-ready HPC ensembles with Merlin
Published in Future generation computer systems (01-06-2022)“…With the growing complexity of computational and experimental facilities, many scientific researchers are turning to machine learning (ML) techniques to…”
Get full text
Journal Article -
4
Enabling rapid COVID-19 small molecule drug design through scalable deep learning of generative models
Published in The international journal of high performance computing applications (01-09-2021)“…We improved the quality and reduced the time to produce machine learned models for use in small molecule antiviral design. Our globally asynchronous…”
Get full text
Journal Article -
5
Scalable Topological Data Analysis and Visualization for Evaluating Data-Driven Models in Scientific Applications
Published in IEEE transactions on visualization and computer graphics (30-08-2019)“…With the rapid adoption of machine learning techniques for large-scale applications in science and engineering comes the convergence of two grand challenges in…”
Get full text
Journal Article -
6
Adaptive neighbor connection for PRMs: A natural fit for heterogeneous environments and parallelism
Published in 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems (01-11-2013)“…Probabilistic Roadmap Methods (PRMs) are widely used motion planning methods that sample robot configurations (nodes) and connect them to form a graph…”
Get full text
Conference Proceeding -
7
The anatomy of a distributed motion planning roadmap
Published in 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems (01-09-2014)“…In this paper, we evaluate and compare the quality and structure of roadmaps constructed from parallelizing sampling-based motion planning algorithms against…”
Get full text
Conference Proceeding -
8
Parallelizing Training of Deep Generative Models on Massive Scientific Datasets
Published in 2019 IEEE International Conference on Cluster Computing (CLUSTER) (01-09-2019)“…Training deep neural networks on large scientific data is a challenging task that requires enormous compute power, especially if no pre-trained models exist to…”
Get full text
Conference Proceeding -
9
On the weakly C-H···π hydrogen bonded complexes of sevoflurane and benzene
Published in Physical chemistry chemical physics : PCCP (21-08-2011)“…A vibrational assignment of the anaesthetic sevoflurane, (CF(3))(2)CHOCH(2)F, is proposed and its interaction with the aromatic model compound benzene is…”
Get more information
Journal Article -
10
Blind RRT: A probabilistically complete distributed RRT
Published in 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems (01-11-2013)“…Rapidly-Exploring Random Trees (RRTs) have been successful at finding feasible solutions for many types of problems. With motion planning becoming more…”
Get full text
Conference Proceeding -
11
Learning Interpretable Models Through Multi-Objective Neural Architecture Search
Published 16-12-2021“…Monumental advances in deep learning have led to unprecedented achievements across various domains. While the performance of deep neural networks is…”
Get full text
Journal Article -
12
Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer
Published 29-08-2024“…Large Language Models (LLMs) with long context capabilities are integral to complex tasks in natural language processing and computational biology, such as…”
Get full text
Journal Article -
13
Universal Checkpointing: Efficient and Flexible Checkpointing for Large Scale Distributed Training
Published 26-06-2024“…Existing checkpointing approaches seem ill-suited for distributed training even though hardware limitations make model parallelism, i.e., sharding model state…”
Get full text
Journal Article -
14
Large-Scale Industrial Alarm Reduction and Critical Events Mining Using Graph Analytics on Spark
Published in 2016 IEEE Second International Conference on Big Data Computing Service and Applications (BigDataService) (01-03-2016)“…In current industrial practice, thousands of industrial alarms generating millions of alarm events, are built into digital control systems typically found in…”
Get full text
Conference Proceeding -
15
DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models
Published 25-09-2023“…Computation in a typical Transformer-based large language model (LLM) can be characterized by batch size, hidden dimension, number of layers, and sequence…”
Get full text
Journal Article -
16
System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models
Published in 2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (27-05-2024)“…Long sequences are ubiquitous in NLP tasks such as document summarization, machine translation, and dialogue modeling [1]-[9]. Traditional approaches to…”
Get full text
Conference Proceeding -
17
Local randomization in neighbor selection improves PRM roadmap quality
Published in 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems (01-10-2012)“…Probabilistic Roadmap Methods (PRMs) are one of the most used classes of motion planning methods. These sampling-based methods generate robot configurations…”
Get full text
Conference Proceeding -
18
ZeRO++: Extremely Efficient Collective Communication for Giant Model Training
Published 16-06-2023“…Zero Redundancy Optimizer (ZeRO) has been used to train a wide range of large language models on massive GPUs clusters due to its ease of use, efficiency, and…”
Get full text
Journal Article -
19
Parallelizing Graph Neural Networks via Matrix Compaction for Edge-Conditioned Networks
Published in 2022 22nd IEEE International Symposium on Cluster, Cloud and Internet Computing (CCGrid) (01-05-2022)“…Graph neural networks (GNNs) are a powerful approach for machine learning on graph datasets. Such datasets often consist of millions of modestly-sized graphs,…”
Get full text
Conference Proceeding -
20
SUPER: SUb-Graph Parallelism for TransformERs
Published in 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS) (01-05-2021)“…Transformer models have revolutionized the field of Natural Language Processing (NLP) and they achieve state-of-the-art performance in applications like…”
Get full text
Conference Proceeding