Search Results - "Grützmacher, Thomas"
-
1
A customized precision format based on mantissa segmentation for accelerating sparse linear algebra
Published in Concurrency and computation (10-08-2020)“…Summary In this work, we pursue the idea of radically decoupling the floating point format used for arithmetic operations from the format used to store the…”
Get full text
Journal Article -
2
Ginkgo - A math library designed to accelerate Exascale Computing Project science applications
Published in The international journal of high performance computing applications (01-11-2024)“…Large-scale simulations require efficient computation across the entire computing hierarchy. A challenge of the Exascale Computing Project (ECP) was to…”
Get full text
Journal Article -
3
Compressed basis GMRES on high-performance graphics processing units
Published in The international journal of high performance computing applications (05-08-2022)“…Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the…”
Get full text
Journal Article -
4
Using Ginkgo's memory accessor for improving the accuracy of memory‐bound low precision BLAS
Published in Software, practice & experience (01-01-2023)“…The roofline model not only provides a powerful tool to relate an application's performance with the specific constraints imposed by the target hardware but…”
Get full text
Journal Article -
5
Toward a modular precision ecosystem for high-performance computing
Published in The international journal of high performance computing applications (01-11-2019)“…With the memory bandwidth of current computer architectures being significantly slower than the (floating point) arithmetic performance, many scientific…”
Get full text
Journal Article -
6
Multifacets of lossy compression for scientific data in the Joint-Laboratory of Extreme Scale Computing
Published in Future generation computer systems (01-02-2025)“…The Joint Laboratory on Extreme-Scale Computing (JLESC) was initiated at the same time lossy compression for scientific data became an important topic for the…”
Get full text
Journal Article -
7
Compressed basis GMRES on high-performance graphics processing units
Published in The international journal of high performance computing applications (01-03-2023)“…Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the…”
Get full text
Journal Article -
8
Compression and load balancing for efficient sparse matrix‐vector product on multicore processors and graphics processing units
Published in Concurrency and computation (25-06-2022)“…Summary We contribute to the optimization of the sparse matrix‐vector product by introducing a variant of the coordinate sparse matrix format that balances the…”
Get full text
Journal Article -
9
Ginkgo: A high performance numerical linear algebra library
Published in Journal of open source software (31-08-2020)Get full text
Journal Article -
10
FRSZ2 for In-Register Block Compression Inside GMRES on GPUs
Published 23-09-2024“…The performance of the GMRES iterative solver on GPUs is limited by the GPU main memory bandwidth. Compressed Basis GMRES outperforms GMRES by storing the…”
Get full text
Journal Article -
11
Compressed Basis GMRES on High Performance GPUs
Published 25-09-2020“…Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the…”
Get full text
Journal Article -
12
Variable-Size Batched Condition Number Calculation on GPUs
Published in 2018 30th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) (01-09-2018)“…We present a kernel that is designed to quickly compute the condition number of a large collection of tiny matrices on a graphics processing unit (GPU). The…”
Get full text
Conference Proceeding -
13
Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing
Published 30-06-2020“…In this paper, we present Ginkgo, a modern C++ math library for scientific high performance computing. While classical linear algebra libraries act on matrix…”
Get full text
Journal Article -
14
A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic
Published 13-07-2020“…Within the past years, hardware vendors have started designing low precision special function units in response to the demand of the Machine Learning community…”
Get full text
Journal Article -
15
Gate-induced decoupling of surface and bulk state properties in selectively-deposited Bi$_2$Te$_3$ nanoribbons
Published in SciPost physics core (30-03-2022)“…Three-dimensional topological insulators (TIs) host helical Dirac surface states at the interface with a trivial insulator. In quasi-one-dimensional TI…”
Get full text
Journal Article -
16
High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation
Published in 2018 IEEE/ACM 8th Workshop on Irregular Applications: Architectures and Algorithms (IA3) (01-11-2018)“…We address the acceleration of the PageRank al- gorithm for web information retrieval on graphics processing units (GPUs) via a modular precision framework…”
Get full text
Conference Proceeding