Počet záznamů: 1
Simulations and study of a new scheduling approach for distributed data production
- 1.
SYSNO ASEP 0506284 Druh ASEP C - Konferenční příspěvek (mezinárodní konf.) Zařazení RIV D - Článek ve sborníku Název Simulations and study of a new scheduling approach for distributed data production Tvůrce(i) Makatun, Dzmitry (UJF-V)
Lauret, J. (US)
Rudová, H. (CZ)
Šumbera, Michal (UJF-V) RID, ORCID, SAICelkový počet autorů 4 Číslo článku 012023 Zdroj.dok. Journal of Physics Conference Series, 762. - Bristol : IOP Publishing, 2016 - ISSN 1742-6588 Poč.str. 7 s. Forma vydání Tištěná - P Akce 17th International Workshop on Advanced Computing and Analysis Techniques in Physics Research (ACAT 2016) Datum konání 18.01.2016 - 22.01.2016 Místo konání Valparaiso Země CL - Chile Typ akce WRD Jazyk dok. eng - angličtina Země vyd. GB - Velká Británie Klíč. slova STAR ; ATLAS ; detectors Vědní obor RIV BG - Jaderná, atomová a mol. fyzika, urychlovače Obor OECD Nuclear physics CEP GA13-20841S GA ČR - Grantová agentura ČR LG15001 GA MŠMT - Ministerstvo školství, mládeže a tělovýchovy Institucionální podpora UJF-V - RVO:61389005 UT WOS 000439689600023 EID SCOPUS 85002132012 DOI https://doi.org/10.1088/1742-6596/762/1/012023 Anotace Distributed data processing has found its application in many fields of science (High Energy and Nuclear Physics (HENP), astronomy, biology to name only those). We have focused our research on distributed data production, an essential part of computations in HENP. Using our previous experience, we have recently proposed a new scheduling approach for distributed data production which is based on the network flow maximization model. It has a polynomial complexity providing required scalability with respect to the size of computations. Our approach improves the overall data production throughput due to three factors: transfer input files in advance before their processing (allows to decrease I/O latency). Balancing of the network traffic (includes splitting the load between several alternative transfer paths), and transfer files sequentially in a coordinated manner (allows to reduce the influence of possible network bottlenecks). In this contribution, we present the results of our new simulations based on the GridSim framework which is one of the commonly used tools in the field of distributed computations. In these simulations we study the behavior of standard scheduling approaches compared to our recently proposed approach in a realistic environment relying on the data from the STAR and ATLAS experiments and considering the influence of the background traffic. The final goal of the research is to integrate the proposed scheduling approach into the real data production framework. In order to achieve this we are constantly moving our simulations towards real use cases, study scalability of the model and the influence of the scheduling parameters on the quality of the solution. Pracoviště Ústav jaderné fyziky Kontakt Markéta Sommerová, sommerova@ujf.cas.cz, Tel.: 266 173 228 Rok sběru 2020
Počet záznamů: 1