Search Results - "Moreira, Jose E"

Refine Results
  1. 1
  2. 2

    IBM's POWER10 Processor by Starke, William J., Thompto, Brian W., Stuecheli, Jeff A., Moreira, Jose E.

    Published in IEEE MICRO (01-03-2021)
    “…The IBM POWER10 processor represents the 10th generation of the POWER family of enterprise computing engines. It is built on a balance of computation and…”
    Get full text
    Journal Article
  3. 3

    Compiling for the IBM Matrix Engine for Enterprise Workloads by de Carvalho, Joao P. L., Moreira, Jose E., Amaral, Jose Nelson

    Published in IEEE MICRO (01-09-2022)
    “…The matrix-multiply assist (MMA) facility is the latest addition to IBM’s power instruction set architecture and first shipped in the recently introduced…”
    Get full text
    Journal Article
  4. 4

    Fast matrix multiplication via compiler‐only layered data reorganization and intrinsic lowering by Kuzma, Braedy, Korostelev, Ivan, Carvalho, João P. L., Moreira, José E., Barton, Christopher, Araujo, Guido, Amaral, José Nelson

    Published in Software, practice & experience (01-09-2023)
    “…The resurgence of machine learning has increased the demand for high‐performance basic linear algebra subroutines (BLAS), which have long depended on libraries…”
    Get full text
    Journal Article
  5. 5

    The Blue Gene/L Supercomputer: A Hardware and Software Story by Moreira, Jose E, Salapura, Valentina, Almasi, George, Archer, Charles, Bellofatto, Ralph, Bergner, Peter, Bickford, Randy, Blumrich, Mathias, Brunheroto, Jose R, Bright, Arthur A, Brutman, Michael, Castanos, Jose G, Chen, Dong, Coteus, Paul, Crumley, Paul, Ellis, Sam

    “…The Blue Gene/L system at the Department of Energy Lawrence Livermore National Laboratory in Livermore, California is the world's most powerful supercomputer…”
    Get full text
    Journal Article
  6. 6

    The Case for Full-Throttle Computing: An Alternative Datacenter Design Strategy by Moreira, José E, Karidis, John P

    Published in IEEE MICRO (01-07-2010)
    “…The authors argue that the minimum cost of computing can be provided by consolidating real-time workloads onto relatively large servers, which can operate at…”
    Get full text
    Journal Article
  7. 7

    Modeling Matrix Engines for Portability and Performance by Tukanov, Nicholai, Srinivasaraghavan, Rajalakshmi, Moreira, Jose E., Low, Tze Meng

    “…Matrix engines, also known as matrix-multiplication accel-erators, capable of computing on 2D matrices of various data types are traditionally found only on…”
    Get full text
    Conference Proceeding
  8. 8

    C++ and Interoperability Between Libraries: The GraphBLAS C++ Specification by Brock, Benjamin, McMillan, Scott, Buluc, Aydin, Mattson, Timothy G., Moreira, Jose E.

    “…Interoperability between libraries is often hindered by incompatible data formats, which can necessitate creating new copies of data when transferring data…”
    Get full text
    Conference Proceeding
  9. 9

    Introduction to GraphBLAS 2.0 by Brock, Benjamin, Buluc, Aydin, Mattson, Timothy G., McMillan, Scott, Moreira, Jose E.

    “…The GraphBLAS is a set of basic building blocks for constructing graph algorithms in terms of linear algebra. They are first and foremost defined…”
    Get full text
    Conference Proceeding
  10. 10

    GraphBLAS: C++ Iterators for Sparse Matrices by Brock, Benjamin, McMillan, Scott, Buluc, Aydin, Mattson, Timothy G., Moreira, Jose E.

    “…Iteration over opaque, generic data structures is an important feature of many C++ libraries. Aggressive compiler optimization and inlining enables generic C++…”
    Get full text
    Conference Proceeding
  11. 11

    A Roadmap for the GraphBLAS C++ API by Brock, Benjamin, Buluc, Aydin, Mattson, Timothy G., McMillan, Scott, Moreira, Jose E.

    “…The GraphBLAS are building blocks for expressing graph algorithms in terms of linear algebra. Currently, the GraphBLAS are defined as a C API. Implementations…”
    Get full text
    Conference Proceeding
  12. 12

    Considerations for a Distributed GraphBLAS API by Brock, Benjamin, Buluc, Aydin, Mattson, Timothy G., McMillan, Scott, Moreira, Jose E., Pearce, Roger, Selvitopi, Oguz, Steil, Trevor

    “…The GraphBLAS emerged from an international effort to standardize linear-algebraic building blocks for computing on graphs and graph-structured data. The…”
    Get full text
    Conference Proceeding
  13. 13

    Delivering Teraflops: An Account of how Blue Gene was Brought to Life by Moreira, J.E.

    “…The Blue Gene/L system at the Department of Energy Lawrence Livermore National Laboratory in Livermore, California is the world's most powerful supercomputer…”
    Get full text
    Conference Proceeding
  14. 14

    Multitoroidal Interconnects For Tightly Coupled Supercomputers by Aridor, Y., Domany, T., Goldshmidt, O., Kliteynik, Y., Shmueli, E., Moreira, J.E.

    “…The processing elements of many modern tightly coupled multicomputers are connected via mesh or toroidal networks. Such interconnects are simple and highly…”
    Get full text
    Journal Article
  15. 15

    The GraphBLAS 3.0 Project by Kimmerer, Raye, Mattson, Timothy G., McMillan, Scott, Brock, Benjamin, Welch, Erik, Pelletier, Michel, Moreira, Jose E.

    “…The GraphBLAS C API is mature with an updated specification (version 2.1) and a compliant implementation (SuiteSparse GraphBLAS). We are now focused on…”
    Get full text
    Conference Proceeding
  16. 16

    Performance Evaluation of a Commercial Application, Trade, in Scale-out Environments by Dube, P., Hao Yu, Li Zhang, Moreira, J.E.

    “…Scale-out approach, in contrast to scale-up approach (exploring increasing performance by utilizing more powerful shared-memory servers), refers to deployment…”
    Get full text
    Conference Proceeding
  17. 17

    Supporting multidimensional arrays in Java by Moreira, José E., Midkiff, Samuel P., Gupta, Manish

    Published in Concurrency and computation (01-03-2003)
    “…The lack of direct support for multidimensional arrays in JavaTM has been recognized as a major deficiency in the language's applicability to numerical…”
    Get full text
    Journal Article
  18. 18

    Unlocking the Performance of the BlueGene/L Supercomputer by Almasi, George, Chatterjee, Siddhartha, Gara, Alan, Gunnels, John, Gupta, Manish, Henning, Amy, Moreira, Jose E., Walkup, Bob

    “…The BlueGene/L supercomputer is expected to deliver new levels of application performance by providing a combination of good single-node computational…”
    Get full text
    Conference Proceeding
  19. 19

    GraphBLAS C API: Ideas for future versions of the specification by Mattson, Timothy G., Yang, Carl, McMillan, Scott, Buluc, Aydin, Moreira, Jose E.

    “…The GraphBLAS C specification provisional release 1.0 is complete. To manage the scope of the project, we had to defer important functionality to a future…”
    Get full text
    Conference Proceeding
  20. 20

    Implementing the GraphBLAS C API by Moreira, Jose E., Kumar, Manoj, Horn, William P.

    “…This paper describes our implementation of the GraphBLAS C API. The implementation fully hides the internals of GraphBLAS objects from application programs,…”
    Get full text
    Conference Proceeding