額田 彰(ヌカダ アキラ)
- 論文
- GPUのキャッシュを考慮した疎行列ベクトル積計算手法の性能評価
長坂 侑亮; 額田 彰; 松岡 聡
情報処理学会研究報告/2014-HPC-144(5), 2014-05 - 疎行列ベクトル積計算を対象としたGPU向けメモリアクセス削減手法
長坂 侑亮; 額田 彰; 松岡 聡
情報処理学会研究報告/2015-HPC-151(8), 2015-09 - GraphCNN向けの疎行列積計算Batch最適化
長坂 侑亮; 額田 彰; 小島 諒介; 松岡 聡
情報処理学会研究報告/2018-HPC-167(7), 2018-12 - 小疎行列積計算のGPU最適化
長坂 侑亮; 額田 彰; 小島 諒介; 松岡 聡
情報処理学会研究報告/2019-HPC-168(19), 2019-03 - TSUBAME3.0におけるストレージ利用効率化のためのファイルシステムベンチマーク
野村 哲弘; 三浦 信一; 實本 英之; 額田 彰; 遠藤 敏夫
情報処理学会研究報告/2019-HPC-170(24), 2019-07 - 異種アクセラレータを持つTSUBAMEスーパーコンピュータのLinpack評価
遠藤敏夫; 額田 彰; 松岡聡
応用数理/20(2)/pp.29-36, 2010-06 - CUDAによる高速フーリエ変換
額田 彰
応用数理/20(2)/pp.37-43, 2010-06 - FFTSS: a High Performance Fast Fourier Transform Library
Nukada Akira
2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings/III/pp.980-983, 2006-05 - High Performance 3D Convolution for Protein Docking on IBM Blue Gene
Nukada Akira; Hourai Yuichiro; Nishida Akira; Akiyama Yu...
Parallel and Distributed Processing and Applications. ISPA 2007. Lecture Notes in Computer Science/4742/pp.958-969, 2007-08 - Bandwidth Intensive 3-D FFT kernel for GPUs using CUDA
Nukada Akira; Ogata Yasuhiko; Endo Toshio; Matsuoka Satoshi
SC '08: Proceedings of the 2008 ACM/IEEE Conference on Supercomputing, 2008-11 - Fast Conjugate Gradients with Multiple GPUs
Cevahir Ali; Nukada Akira; Matsuoka Satoshi
ICCS 2009: Computational Science – ICCS 2009/pp.893-903, 2009-05 - Auto-Tuning 3-D FFT Library for CUDA GPUs
Nukada Akira; Matsuoka Satoshi
SC '09: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2009-11 - Linpack Evaluation on a Supercomputer with Heterogeneous Accelerators
Endo Toshio; Nukada Akira; Matsuoka Satoshi; Maruyama Naoya
2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2010-04 - A High-Performance Fault-Tolerant Software Framework for Memory on Commodity GPUs
Maruyama Naoya; Nukada Akira; Matsuoka Satoshi
2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2010-04 - High Performance Conjugate Gradient Solver on Multi-GPU Clusters Using Hypergraph Partitioning
Cevahir Ali; Nukada Akira; Matsuoka Satoshi
Computer Science - Research and Development/25(1-2)/pp.83-91, 2010-05 - Statistical Power Modeling of GPU Kernels Using Performance Counters
Nagasaka Hitoshi; Maruyama Naoya; Nukada Akira; Endo Tos...
GREENCOMP '10: Proceedings of the International Conference on Green Computing/pp.115-122, 2010-08 - An 80-Fold Speedup, 15.0 TFlops, Full GPU Acceleration of Non-Hydrostatic Weather Model ASUCA Production Code
Shimokawabe Takashi; Aoki Takayuki; Muroi Chiashi; Ishida...
SC '10: Proceedings of the 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, 2010-11 - Low-overhead diskless checkpoint for hybrid computing systems
Gomez Leonardo Bautista; Nukada Akira; Maruyama Naoya; Ca...
International Conference on High Performance Computing (HiPC 2010), 2010-12 - NVCR: A Transparent Checkpoint-Restart Library for NVIDIA CUDA
Nukada Akira; Takizawa Hiroyuki; Matsuoka Satoshi
20th Heterogeneity in Computing Workshop (HCW 2011)/pp.104-113, 2011-05 - Hamming Color Code for Dense and Robust One-shot 3D Scanning
Yamazaki Shuntaro; Nukada Akira; Mochimaru Masaaki
2011 British Machine Vision Conference/pp.96.1-96.9, 2011-08 - Peta-scale Phase-Field Simulation for Dendritic Solidification on the TSUBAME 2.0 Supercomputer
Shimokawabe Takashi; Aoki Takayuki; Takaki Tomohiro; Yama...
SC '11: Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, 2011-11 - High Performance 3-D FFT using multiple CUDA GPUs
Nukada Akira; Maruyama Yutaka; Matsuoka Satoshi
Fifth Workshop on General Purpose Processing using Graphics Processing Units (GPGPU-5)/pp.57-63, 2012-03 - Scalable Multi-GPU 3-D FFT for TSUBAME 2.0 Supercomputer
Nukada Akira; Sato Kento; Matsuoka Satoshi
SC '12: Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, 2012-11 - Mixed Precision AMG method for Many Core Accelerators
Sumiyoshi Yuki; Fujii Akihiro; Nukada Akira; Tanaka Teruo
nternational Workshop on Enhancing Parallel Scientific Applications with Accelerated HPC (ESAA 2014)/pp.127-132, 2014-08 - TSUBAME-KFC: a Modern Liquid Submersion Cooling Prototype towards Exascale Becoming the Greenest Supercomputer in the World
Endo Toshio; Nukada Akira; Matsuoka Satoshi
20th IEEE International Conference on Parallel and Distributed Systems (ICPADS 2014)/pp.360-367, 2014-12 - さらに表示...
- GPUのキャッシュを考慮した疎行列ベクトル積計算手法の性能評価