Number of the records: 1
Unleashing the Performance of ccNUMA Multiprocessor Architectures in Heterogeneous Stencil Computations
- 1.0495450 - ÚGN 2019 RIV DE eng J - Journal Article
Szustak, L. - Halbiniak, K. - Wyrzykowski, R. - Jakl, Ondřej
Unleashing the Performance of ccNUMA Multiprocessor Architectures in Heterogeneous Stencil Computations.
Journal of Supercomputing. Roč. 2018, June 2018 (2018), s. 1-13. ISSN 0920-8542. E-ISSN 1573-0484
Institutional support: RVO:68145535
Keywords : stencil codes * MPDATA * ccNUMA
OECD category: Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Impact factor: 2.157, year: 2018
https://link.springer.com/content/pdf/10.1007%2Fs11227-018-2460-0.pdf
This paper meets the challenge of harnessing the heterogeneous communication architecture of ccNUMA multiprocessors for heterogeneous stencil computations, an important example of which is the Multidimensional Positive Definite Advection Transport Algorithm (MPDATA). We propose a method for optimization of parallel implementation of heterogeneous stencil computations that is a combination of the islands-of-core strategy and ((Formula presented.))D decomposition. The method allows a flexible management of the trade-off between computation and communication costs in accordance with features of modern ccNUMA architectures. Its efficiency is demonstrated for the implementation of MPDATA on the SGI UV 2000 and UV 3000 servers, as well as for 2- and 4-socket ccNUMA platforms based on various Intel CPU architectures, including Skylake, Broadwell, and Haswell.
Permanent Link: http://hdl.handle.net/11104/0288426
File Download Size Commentary Version Access UGN_0495450.pdf 1 687.9 KB Publisher’s postprint open-access
Number of the records: 1