Search Results - "Grützmacher, Thomas"

  • Showing 1 - 16 results of 16
Refine Results
  1. 1

    A customized precision format based on mantissa segmentation for accelerating sparse linear algebra by Grützmacher, Thomas, Cojean, Terry, Flegar, Goran, Göbel, Fritz, Anzt, Hartwig

    Published in Concurrency and computation (10-08-2020)
    “…Summary In this work, we pursue the idea of radically decoupling the floating point format used for arithmetic operations from the format used to store the…”
    Get full text
    Journal Article
  2. 2

    Ginkgo - A math library designed to accelerate Exascale Computing Project science applications by Cojean, Terry, Nayak, Pratik, Ribizel, Tobias, Beams, Natalie, Mike Tsai, Yu-Hsiang, Koch, Marcel, Göbel, Fritz, Grützmacher, Thomas, Anzt, Hartwig

    “…Large-scale simulations require efficient computation across the entire computing hierarchy. A challenge of the Exascale Computing Project (ECP) was to…”
    Get full text
    Journal Article
  3. 3

    Compressed basis GMRES on high-performance graphics processing units by Aliaga, José I., Anzt, Hartwig, Grützmacher, Thomas, Quintana-Ortí, Enrique S., Tomás, Andrés E.

    “…Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the…”
    Get full text
    Journal Article
  4. 4

    Using Ginkgo's memory accessor for improving the accuracy of memory‐bound low precision BLAS by Grützmacher, Thomas, Anzt, Hartwig, Quintana‐Ortí, Enrique S.

    Published in Software, practice & experience (01-01-2023)
    “…The roofline model not only provides a powerful tool to relate an application's performance with the specific constraints imposed by the target hardware but…”
    Get full text
    Journal Article
  5. 5

    Toward a modular precision ecosystem for high-performance computing by Anzt, Hartwig, Flegar, Goran, Grützmacher, Thomas, Quintana-Ortí, Enrique S

    “…With the memory bandwidth of current computer architectures being significantly slower than the (floating point) arithmetic performance, many scientific…”
    Get full text
    Journal Article
  6. 6
  7. 7

    Compressed basis GMRES on high-performance graphics processing units by Aliaga, José I, Anzt, Hartwig, Grützmacher, Thomas, Quintana-Ortí, Enrique S, Tomás, Andrés E

    “…Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the…”
    Get full text
    Journal Article
  8. 8

    Compression and load balancing for efficient sparse matrix‐vector product on multicore processors and graphics processing units by Aliaga, José I., Anzt, Hartwig, Grützmacher, Thomas, Quintana‐Ortí, Enrique S., Tomás, Andrés E.

    Published in Concurrency and computation (25-06-2022)
    “…Summary We contribute to the optimization of the sparse matrix‐vector product by introducing a variant of the coordinate sparse matrix format that balances the…”
    Get full text
    Journal Article
  9. 9
  10. 10

    FRSZ2 for In-Register Block Compression Inside GMRES on GPUs by Grützmacher, Thomas, Underwood, Robert, Di, Sheng, Cappello, Franck, Anzt, Hartwig

    Published 23-09-2024
    “…The performance of the GMRES iterative solver on GPUs is limited by the GPU main memory bandwidth. Compressed Basis GMRES outperforms GMRES by storing the…”
    Get full text
    Journal Article
  11. 11

    Compressed Basis GMRES on High Performance GPUs by Aliaga, José I, Anzt, Hartwig, Grützmacher, Thomas, Quintana-Ortí, Enrique S, Tomás, Andrés E

    Published 25-09-2020
    “…Krylov methods provide a fast and highly parallel numerical tool for the iterative solution of many large-scale sparse linear systems. To a large extent, the…”
    Get full text
    Journal Article
  12. 12

    Variable-Size Batched Condition Number Calculation on GPUs by Anzt, Hartwig, Dongarra, Jack, Flegar, Goran, Grutzmacher, Thomas

    “…We present a kernel that is designed to quickly compute the condition number of a large collection of tiny matrices on a graphics processing unit (GPU). The…”
    Get full text
    Conference Proceeding
  13. 13

    Ginkgo: A Modern Linear Operator Algebra Framework for High Performance Computing by Anzt, Hartwig, Cojean, Terry, Flegar, Goran, Göbel, Fritz, Grützmacher, Thomas, Nayak, Pratik, Ribizel, Tobias, Tsai, Yuhsiang Mike, Quintana-Ortí, Enrique S

    Published 30-06-2020
    “…In this paper, we present Ginkgo, a modern C++ math library for scientific high performance computing. While classical linear algebra libraries act on matrix…”
    Get full text
    Journal Article
  14. 14
  15. 15

    Gate-induced decoupling of surface and bulk state properties in selectively-deposited Bi$_2$Te$_3$ nanoribbons by Rosenbach, Daniel, Moors, Kristof, Jalil, Abdur R., Kölzer, Jonas, Zimmermann, Erik, Schubert, Jürgen, Karimzadah, Soraya, Mussler, Gregor, Schüffelgen, Peter, Grützmacher, Detlev, Lüth, Hans, Schäpers, Thomas

    Published in SciPost physics core (30-03-2022)
    “…Three-dimensional topological insulators (TIs) host helical Dirac surface states at the interface with a trivial insulator. In quasi-one-dimensional TI…”
    Get full text
    Journal Article
  16. 16

    High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation by Grutzmacher, Thomas, Anzt, Hartwig, Scheidegger, Florian, Quintana-Orti, Enrique S.

    “…We address the acceleration of the PageRank al- gorithm for web information retrieval on graphics processing units (GPUs) via a modular precision framework…”
    Get full text
    Conference Proceeding