Federated data storage system prototype for LHC experiments and data intensive science

Standard

Federated data storage system prototype for LHC experiments and data intensive science. / Kiryanov, A.; Klimentov, A.; Krasnopevtsev, D.; Ryabinkin, E.; Zarochentsev, A.

In: Journal of Physics: Conference Series, Vol. 898, No. 6, 062016, 23.11.2017.

Research output: Contribution to journal › Conference article › peer-review

Author

Kiryanov, A. ; Klimentov, A. ; Krasnopevtsev, D. ; Ryabinkin, E. ; Zarochentsev, A. / Federated data storage system prototype for LHC experiments and data intensive science. In: Journal of Physics: Conference Series. 2017 ; Vol. 898, No. 6.

BibTeX

@article{30bb8c259a824fb3b7ea13110d79607a,

title = "Federated data storage system prototype for LHC experiments and data intensive science",

abstract = "Rapid increase of data volume from the experiments running at the Large Hadron Collider (LHC) prompted physics computing community to evaluate new data handling and processing solutions. Russian grid sites and universities' clusters scattered over a large area aim at the task of uniting their resources for future productive work, at the same time giving an opportunity to support large physics collaborations. In our project we address the fundamental problem of designing a computing architecture to integrate distributed storage resources for LHC experiments and other data-intensive science applications and to provide access to data from heterogeneous computing facilities. Studies include development and implementation of federated data storage prototype for Worldwide LHC Computing Grid (WLCG) centres of different levels and University clusters within one National Cloud. The prototype is based on computing resources located in Moscow, Dubna, Saint Petersburg, Gatchina and Geneva. This project intends to implement a federated distributed storage for all kind of operations such as read/write/transfer and access via WAN from Grid centres, university clusters, supercomputers, academic and commercial clouds. The efficiency and performance of the system are demonstrated using synthetic and experiment-specific tests including real data processing and analysis workflows from ATLAS and ALICE experiments, as well as compute-intensive bioinformatics applications (PALEOMIX) running on supercomputers. We present topology and architecture of the designed system, report performance and statistics for different access patterns and show how federated data storage can be used efficiently by physicists and biologists. We also describe how sharing data on a widely distributed storage system can lead to a new computing model and reformations of computing style, for instance how bioinformatics program running on supercomputers can read/write data from the federated storage.",

author = "A. Kiryanov and A. Klimentov and D. Krasnopevtsev and E. Ryabinkin and A. Zarochentsev",

year = "2017",

month = nov,

day = "23",

doi = "10.1088/1742-6596/898/6/062016",

language = "English",

volume = "898",

journal = "Journal of Physics: Conference Series",

issn = "1742-6588",

publisher = "IOP Publishing Ltd.",

number = "6",

note = "22nd International Conference on Computing in High Energy and Nuclear Physics, CHEP 2016 ; Conference date: 10-10-2016 Through 14-10-2016",

}

RIS

TY - JOUR

T1 - Federated data storage system prototype for LHC experiments and data intensive science

AU - Kiryanov, A.

AU - Klimentov, A.

AU - Krasnopevtsev, D.

AU - Ryabinkin, E.

AU - Zarochentsev, A.

PY - 2017/11/23

Y1 - 2017/11/23

N2 - Rapid increase of data volume from the experiments running at the Large Hadron Collider (LHC) prompted physics computing community to evaluate new data handling and processing solutions. Russian grid sites and universities' clusters scattered over a large area aim at the task of uniting their resources for future productive work, at the same time giving an opportunity to support large physics collaborations. In our project we address the fundamental problem of designing a computing architecture to integrate distributed storage resources for LHC experiments and other data-intensive science applications and to provide access to data from heterogeneous computing facilities. Studies include development and implementation of federated data storage prototype for Worldwide LHC Computing Grid (WLCG) centres of different levels and University clusters within one National Cloud. The prototype is based on computing resources located in Moscow, Dubna, Saint Petersburg, Gatchina and Geneva. This project intends to implement a federated distributed storage for all kind of operations such as read/write/transfer and access via WAN from Grid centres, university clusters, supercomputers, academic and commercial clouds. The efficiency and performance of the system are demonstrated using synthetic and experiment-specific tests including real data processing and analysis workflows from ATLAS and ALICE experiments, as well as compute-intensive bioinformatics applications (PALEOMIX) running on supercomputers. We present topology and architecture of the designed system, report performance and statistics for different access patterns and show how federated data storage can be used efficiently by physicists and biologists. We also describe how sharing data on a widely distributed storage system can lead to a new computing model and reformations of computing style, for instance how bioinformatics program running on supercomputers can read/write data from the federated storage.

AB - Rapid increase of data volume from the experiments running at the Large Hadron Collider (LHC) prompted physics computing community to evaluate new data handling and processing solutions. Russian grid sites and universities' clusters scattered over a large area aim at the task of uniting their resources for future productive work, at the same time giving an opportunity to support large physics collaborations. In our project we address the fundamental problem of designing a computing architecture to integrate distributed storage resources for LHC experiments and other data-intensive science applications and to provide access to data from heterogeneous computing facilities. Studies include development and implementation of federated data storage prototype for Worldwide LHC Computing Grid (WLCG) centres of different levels and University clusters within one National Cloud. The prototype is based on computing resources located in Moscow, Dubna, Saint Petersburg, Gatchina and Geneva. This project intends to implement a federated distributed storage for all kind of operations such as read/write/transfer and access via WAN from Grid centres, university clusters, supercomputers, academic and commercial clouds. The efficiency and performance of the system are demonstrated using synthetic and experiment-specific tests including real data processing and analysis workflows from ATLAS and ALICE experiments, as well as compute-intensive bioinformatics applications (PALEOMIX) running on supercomputers. We present topology and architecture of the designed system, report performance and statistics for different access patterns and show how federated data storage can be used efficiently by physicists and biologists. We also describe how sharing data on a widely distributed storage system can lead to a new computing model and reformations of computing style, for instance how bioinformatics program running on supercomputers can read/write data from the federated storage.

UR - http://www.scopus.com/inward/record.url?scp=85038422581&partnerID=8YFLogxK

U2 - 10.1088/1742-6596/898/6/062016

DO - 10.1088/1742-6596/898/6/062016

M3 - Conference article

AN - SCOPUS:85038422581

VL - 898

JO - Journal of Physics: Conference Series

JF - Journal of Physics: Conference Series

SN - 1742-6588

IS - 6

M1 - 062016

T2 - 22nd International Conference on Computing in High Energy and Nuclear Physics, CHEP 2016

Y2 - 10 October 2016 through 14 October 2016

ER -

ID: 36354724