Search Results - "Fukumoto, Naoto"

  • Showing 1 - 14 results of 14
Refine Results
  1. 1
  2. 2

    A traffic-aware memory-cube network using bypassing by Shikama, Yoshiya, Kawano, Ryuta, Matsutani, Hiroki, Amano, Hideharu, Nagasaka, Yusuke, Fukumoto, Naoto, Koibuchi, Michihiro

    Published in Microprocessors and microsystems (01-04-2022)
    “…Three-dimensional stack memory which provides both high-bandwidth access and large capacity is a promising technology for next-generation computer systems…”
    Get full text
    Journal Article
  3. 3

    Efficient Collision-Free MTTKRP Algorithm for Multi-core CPUs with Less Memory Usage by Nagasaka, Yusuke, Fukumoto, Naoto

    “…Tensor decomposition is often used to extract underlying features in the analysis of large and multi-dimensional data. For the tensor data with sparse…”
    Get full text
    Conference Proceeding
  4. 4

    Performance Analysis of Multi-Containerized MD Simulations for Low-Level Resource Allocation by Okuno, Shingo, Hirai, Akira, Fukumoto, Naoto

    “…This study discusses scheduling strategies to maximize ensemble throughput, which is the total throughput of multiple containers running simultaneously. Such a…”
    Get full text
    Conference Proceeding
  5. 5

    Towards Straggler-Tolerant and Accuracy-Aware Distributed DNN Training in Clouds by Okuno, Shingo, Miwa, Masahiro, Fukumoto, Naoto

    “…This study investigated how straggler mitigation affects accuracy during distributed training. While distributed training is one promising way to shorten…”
    Get full text
    Conference Proceeding
  6. 6

    Performance Analysis of Quantum Computer Simulators Across Different Environments by Aoki, Nozomi, Yamazaki, Masafumi, Hirai, Akira, Yamaoka, Mari, Fukumoto, Naoto, Kasagi, Akihiko, Oguchi, Masato

    “…Quantum computers can achieve extremely fast computations for certain problems using quantum properties. Due to these factors, research and development in the…”
    Get full text
    Conference Proceeding
  7. 7

    3D implemented SRAM/DRAM hybrid cache architecture for high-performance and low power consumption by Inoue, Koji, Hashiguchi, Shinya, Ueno, Shinya, Fukumoto, Naoto, Murakami, Kazuaki

    “…This paper introduces our research status focusing on 3D-implemented microprocessors. 3D-IC is one of the most interesting techniques to achieve…”
    Get full text
    Conference Proceeding
  8. 8

    mpiQulacs: A Scalable Distributed Quantum Computer Simulator for ARM-based Clusters by Tabuchi, Akihiro, Imamura, Satoshi, Yamazaki, Masafumi, Honda, Takumi, Kasagi, Akihiko, Nakao, Hiroshi, Fukumoto, Naoto, Nakashima, Kohta

    “…Quantum computer simulators running on classical computers are essential for developing real quantum computers and emerging quantum applications. In…”
    Get full text
    Conference Proceeding
  9. 9

    mpiQulacs: A Distributed Quantum Computer Simulator for A64FX-based Cluster Systems by Imamura, Satoshi, Yamazaki, Masafumi, Honda, Takumi, Kasagi, Akihiko, Tabuchi, Akihiro, Nakao, Hiroshi, Fukumoto, Naoto, Nakashima, Kohta

    Published 30-03-2022
    “…Quantum computer simulators running on classical computers are essential for developing real quantum computers and emerging quantum applications. In…”
    Get full text
    Journal Article
  10. 10
  11. 11

    Low-Latency Low-Energy Memory-Cube Networks using Dual-Voltage Datapaths by Shikama, Yoshiya, Kawano, Ryuta, Matsutani, Hiroki, Amano, Hideharu, Nagasaka, Yusuke, Fukumoto, Naoto, Koibuchi, Michihiro

    “…Three-dimensional stack memory that provides both high-bandwidth access and large capacity is a promising technology for next-generation computer systems…”
    Get full text
    Conference Proceeding
  12. 12

    Yet Another Accelerated SGD: ResNet-50 Training on ImageNet in 74.7 seconds by Yamazaki, Masafumi, Kasagi, Akihiko, Tabuchi, Akihiro, Honda, Takumi, Miwa, Masahiro, Fukumoto, Naoto, Tabaru, Tsuguchika, Ike, Atsushi, Nakashima, Kohta

    Published 29-03-2019
    “…There has been a strong demand for algorithms that can execute machine learning as faster as possible and the speed of deep learning has accelerated by 30…”
    Get full text
    Journal Article
  13. 13
  14. 14

    Analyzing the impact of data prefetching on Chip MultiProcessors by Fukumoto, N., Mihara, T., Inoue, K., Murakami, K.

    “…Data prefetching is a well known approach to compensating for poor memory performance, and has been employed in commercial processor chips. Although a number…”
    Get full text
    Conference Proceeding