Search Results - "Grützmacher, Thomas" :: Katalog Arama

1
A customized precision format based on mantissa segmentation for accelerating sparse linear algebra by Grützmacher, Thomas, Cojean, Terry, Flegar, Goran, Göbel, Fritz, Anzt, Hartwig

Published in Concurrency and computation (10-08-2020)
“…Summary In this work, we pursue the idea of radically decoupling the floating point format used for arithmetic operations from the format used to store the…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
2
$Ginkgo - A math library designed to accelerate Exascale Computing Project science applications$
Ginkgo - A math library designed to accelerate Exascale Computing Project science applications by Cojean, Terry, Nayak, Pratik, Ribizel, Tobias, Beams, Natalie, Mike Tsai, Yu-Hsiang, Koch, Marcel, Göbel, Fritz, Grützmacher, Thomas, Anzt, Hartwig

Published in The international journal of high performance computing applications (01-11-2024)
“…Large-scale simulations require efficient computation across the entire computing hierarchy. A challenge of the Exascale Computing Project (ECP) was to…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
3
Compressed basis GMRES on high-performance graphics processing units by Aliaga, José I., Anzt, Hartwig, Grützmacher, Thomas, Quintana-Ortí, Enrique S., Tomás, Andrés E.

Published in The international journal of high performance computing applications (05-08-2022)
“…Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
4
Using Ginkgo's memory accessor for improving the accuracy of memory‐bound low precision BLAS by Grützmacher, Thomas, Anzt, Hartwig, Quintana‐Ortí, Enrique S.

Published in Software, practice & experience (01-01-2023)
“…The roofline model not only provides a powerful tool to relate an application's performance with the specific constraints imposed by the target hardware but…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
5
Toward a modular precision ecosystem for high-performance computing by Anzt, Hartwig, Flegar, Goran, Grützmacher, Thomas, Quintana-Ortí, Enrique S

Published in The international journal of high performance computing applications (01-11-2019)
“…With the memory bandwidth of current computer architectures being significantly slower than the (floating point) arithmetic performance, many scientific…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
6
Multifacets of lossy compression for scientific data in the Joint-Laboratory of Extreme Scale Computing by Cappello, Franck, Acosta, Mario, Agullo, Emmanuel, Anzt, Hartwig, Calhoun, Jon, Di, Sheng, Giraud, Luc, Grützmacher, Thomas, Jin, Sian, Sano, Kentaro, Sato, Kento, Singh, Amarjit, Tao, Dingwen, Tian, Jiannan, Ueno, Tomohiro, Underwood, Robert, Vivien, Frédéric, Yepes, Xavier, Kazutomo, Yoshii, Zhang, Boyuan

Published in Future generation computer systems (01-02-2025)
“…The Joint Laboratory on Extreme-Scale Computing (JLESC) was initiated at the same time lossy compression for scientific data became an important topic for the…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
7
Compressed basis GMRES on high-performance graphics processing units by Aliaga, José I, Anzt, Hartwig, Grützmacher, Thomas, Quintana-Ortí, Enrique S, Tomás, Andrés E

Published in The international journal of high performance computing applications (01-03-2023)
“…Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
8
Compression and load balancing for efficient sparse matrix‐vector product on multicore processors and graphics processing units by Aliaga, José I., Anzt, Hartwig, Grützmacher, Thomas, Quintana‐Ortí, Enrique S., Tomás, Andrés E.

Published in Concurrency and computation (25-06-2022)
“…Summary We contribute to the optimization of the sparse matrix‐vector product by introducing a variant of the coordinate sparse matrix format that balances the…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
9
Ginkgo: A high performance numerical linear algebra library by Anzt, Hartwig, Cojean, Terry, Chen, Yen-Chen, Flegar, Goran, Göbel, Fritz, Grützmacher, Thomas, Nayak, Pratik, Ribizel, Tobias, Tsai, Yu-Hsiang

Published in Journal of open source software (31-08-2020)

Get full text

Journal Article
QR Code
Save to List

Saved in:
10
FRSZ2 for In-Register Block Compression Inside GMRES on GPUs by Grützmacher, Thomas, Underwood, Robert, Di, Sheng, Cappello, Franck, Anzt, Hartwig

Published 23-09-2024
“…The performance of the GMRES iterative solver on GPUs is limited by the GPU main memory bandwidth. Compressed Basis GMRES outperforms GMRES by storing the…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
11
Compressed Basis GMRES on High Performance GPUs by Aliaga, José I, Anzt, Hartwig, Grützmacher, Thomas, Quintana-Ortí, Enrique S, Tomás, Andrés E

Published 25-09-2020
“…Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
12
Variable-Size Batched Condition Number Calculation on GPUs by Anzt, Hartwig, Dongarra, Jack, Flegar, Goran, Grutzmacher, Thomas

Published in 2018 30th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) (01-09-2018)
“…We present a kernel that is designed to quickly compute the condition number of a large collection of tiny matrices on a graphics processing unit (GPU). The…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in:
13
Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing by Anzt, Hartwig, Cojean, Terry, Flegar, Goran, Göbel, Fritz, Grützmacher, Thomas, Nayak, Pratik, Ribizel, Tobias, Tsai, Yuhsiang Mike, Quintana-Ortí, Enrique S

Published 30-06-2020
“…In this paper, we present Ginkgo, a modern C++ math library for scientific high performance computing. While classical linear algebra libraries act on matrix…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
14
A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic by Abdelfattah, Ahmad, Anzt, Hartwig, Boman, Erik G, Carson, Erin, Cojean, Terry, Dongarra, Jack, Gates, Mark, Grützmacher, Thomas, Higham, Nicholas J, Li, Sherry, Lindquist, Neil, Liu, Yang, Loe, Jennifer, Luszczek, Piotr, Nayak, Pratik, Pranesh, Sri, Rajamanickam, Siva, Ribizel, Tobias, Smith, Barry, Swirydowicz, Kasia, Thomas, Stephen, Tomov, Stanimire, Tsai, Yaohung M, Yamazaki, Ichitaro, Yang, Urike Meier

Published 13-07-2020
“…Within the past years, hardware vendors have started designing low precision special function units in response to the demand of the Machine Learning community…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
15
Gate-induced decoupling of surface and bulk state properties in selectively-deposited Bi$_2$Te$_3$ nanoribbons by Rosenbach, Daniel, Moors, Kristof, Jalil, Abdur R., Kölzer, Jonas, Zimmermann, Erik, Schubert, Jürgen, Karimzadah, Soraya, Mussler, Gregor, Schüffelgen, Peter, Grützmacher, Detlev, Lüth, Hans, Schäpers, Thomas

Published in SciPost physics core (30-03-2022)
“…Three-dimensional topological insulators (TIs) host helical Dirac surface states at the interface with a trivial insulator. In quasi-one-dimensional TI…”

Get full text

Journal Article
QR Code
Save to List

Saved in:
16
High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation by Grutzmacher, Thomas, Anzt, Hartwig, Scheidegger, Florian, Quintana-Orti, Enrique S.

Published in 2018 IEEE/ACM 8th Workshop on Irregular Applications: Architectures and Algorithms (IA3) (01-11-2018)
“…We address the acceleration of the PageRank al- gorithm for web information retrieval on graphics processing units (GPUs) via a modular precision framework…”

Get full text

Conference Proceeding
QR Code
Save to List

Saved in: