Number of the records: 1  

Islands-of-Cores Approach for Harnessing SMP/NUMA Architectures in Heterogeneous Stencil Computations

  1. 1.
    SYSNO ASEP0482853
    Document TypeC - Proceedings Paper (int. conf.)
    R&D Document TypeConference Paper
    TitleIslands-of-Cores Approach for Harnessing SMP/NUMA Architectures in Heterogeneous Stencil Computations
    Author(s) Szustak, L. (PL)
    Wyrzykowski, R. (PL)
    Jakl, Ondřej (UGN-S) RID
    Number of authors3
    Source TitleParallel Computing Technologies. PaCT 2017. Lecture Notes in Computer Science. - Berlin : Springer Verlag, 2017 / Malyshkin V. - ISBN 978-3-319-62931-5
    Pagess. 351-364
    Number of pages14 s.
    Publication formOnline - E
    ActionInternational Conference, PaCT 2017 /14./
    Event date04.09.2017 - 08.09.2017
    VEvent locationNizhny Novgorod
    CountryRU - Russian Federation
    Event typeWRD
    Languageeng - English
    CountryDE - Germany
    Keywordsshared memory system ; stencil computations ; work-load distribution
    Subject RIVBA - General Mathematics
    OECD categoryApplied mathematics
    R&D ProjectsLD15105 GA MŠMT - Ministry of Education, Youth and Sports (MEYS)
    Institutional supportUGN-S - RVO:68145535
    UT WOS000444105600034
    EID SCOPUS85028708294
    DOI10.1007/978-3-319-62932-2_34
    AnnotationSMP/NUMA systems are powerful HPC platforms which could be applied for a wide range of real-life applications. These systems provide large capacity of shared memory, and allow using the shared-variable programming model to take advantages of shared memory for inter-process communications and synchronizations. However, as data can be physically dispersed over many nodes, the access to various data items may require significantly different times. In this paper, we face the challenge of harnessing the heterogeneous nature of SMP/NUMA communications for a complex scientific application which implements the Multidimensional Positive Definite Advection Transport Algorithm (MPDATA), consisting of a set of heterogeneous stencil computations. When using our method of MPDATA workload distribution, which was successfully applied for small-scale shared memory systems with several CPUs and/or accelerators, significant performance losses are noticeable for larger SMP/NUMA systems, such as SGI UV 2000 server used in this work. To overcome this shortcoming, we propose a new islands-of-cores approach. It exposes a correlation between computation and communication for heterogeneous stencils, and enables an efficient management of trade-off between computation and communication costs in accordance with the features of SMP/NUMA systems. In consequence, when using the maximum configuration with 112 cores of 14 Intel Xeon E5-4627v2 3.3 GHz processors, the proposed approach accelerates the previous method more then 10 times, achieving about 390 Gflop/s, or approximately 30% of the theoretical peak performance.
    WorkplaceInstitute of Geonics
    ContactLucie Gurková, lucie.gurkova@ugn.cas.cz, Tel.: 596 979 354
    Year of Publishing2018
    Electronic addresshttps://link.springer.com/chapter/10.1007/978-3-319-62932-2_34#citeas
Number of the records: 1  

  This site uses cookies to make them easier to browse. Learn more about how we use cookies.