Počet záznamů: 1
Islands-of-Cores Approach for Harnessing SMP/NUMA Architectures in Heterogeneous Stencil Computations
- 1.
SYSNO ASEP 0482853 Druh ASEP C - Konferenční příspěvek (mezinárodní konf.) Zařazení RIV D - Článek ve sborníku Název Islands-of-Cores Approach for Harnessing SMP/NUMA Architectures in Heterogeneous Stencil Computations Tvůrce(i) Szustak, L. (PL)
Wyrzykowski, R. (PL)
Jakl, Ondřej (UGN-S) RIDCelkový počet autorů 3 Zdroj.dok. Parallel Computing Technologies. PaCT 2017. Lecture Notes in Computer Science. - Berlin : Springer Verlag, 2017 / Malyshkin V. - ISBN 978-3-319-62931-5 Rozsah stran s. 351-364 Poč.str. 14 s. Forma vydání Online - E Akce International Conference, PaCT 2017 /14./ Datum konání 04.09.2017 - 08.09.2017 Místo konání Nizhny Novgorod Země RU - Rusko Typ akce WRD Jazyk dok. eng - angličtina Země vyd. DE - Německo Klíč. slova shared memory system ; stencil computations ; work-load distribution Vědní obor RIV BA - Obecná matematika Obor OECD Applied mathematics CEP LD15105 GA MŠMT - Ministerstvo školství, mládeže a tělovýchovy Institucionální podpora UGN-S - RVO:68145535 UT WOS 000444105600034 EID SCOPUS 85028708294 DOI 10.1007/978-3-319-62932-2_34 Anotace SMP/NUMA systems are powerful HPC platforms which could be applied for a wide range of real-life applications. These systems provide large capacity of shared memory, and allow using the shared-variable programming model to take advantages of shared memory for inter-process communications and synchronizations. However, as data can be physically dispersed over many nodes, the access to various data items may require significantly different times. In this paper, we face the challenge of harnessing the heterogeneous nature of SMP/NUMA communications for a complex scientific application which implements the Multidimensional Positive Definite Advection Transport Algorithm (MPDATA), consisting of a set of heterogeneous stencil computations. When using our method of MPDATA workload distribution, which was successfully applied for small-scale shared memory systems with several CPUs and/or accelerators, significant performance losses are noticeable for larger SMP/NUMA systems, such as SGI UV 2000 server used in this work. To overcome this shortcoming, we propose a new islands-of-cores approach. It exposes a correlation between computation and communication for heterogeneous stencils, and enables an efficient management of trade-off between computation and communication costs in accordance with the features of SMP/NUMA systems. In consequence, when using the maximum configuration with 112 cores of 14 Intel Xeon E5-4627v2 3.3 GHz processors, the proposed approach accelerates the previous method more then 10 times, achieving about 390 Gflop/s, or approximately 30% of the theoretical peak performance. Pracoviště Ústav geoniky Kontakt Lucie Gurková, lucie.gurkova@ugn.cas.cz, Tel.: 596 979 354 Rok sběru 2018 Elektronická adresa https://link.springer.com/chapter/10.1007/978-3-319-62932-2_34#citeas
Počet záznamů: 1