Towards the integrated ALICE Online-Offline (O2) monitoring subsystem
- ALICE (A Large Ion Collider Experiment) is preparing for a major upgrade of the detector, readout and computing systemsfor LHC Run 3. A new facility called O2 (Online-Offline) will play a major role in data compression and event processing. To efficiently operate the experiment, we are designing a monitoring subsystem, which will provide a complete overview of the O2 overall health, detect performance degradation and component failures. The monitoring subsystem will receive and collect up to 600 kHz of performance metrics. It consists of a custom monitoring library and a server-side, distributed software covering five main functional tasks: parameter collection and processing, storage, visualisation and alarms. To select the most appropriate tools for these tasks, we evaluated three options: “Modular Stack”, Zabbix and the currently used ALICE Grid monitoring tool called MonALISA. The former one consists of a toolkit including collectd, Apache Flume, Apache Spark, InfluxDB, Grafana and Riemann. This paper describes the monitoring subsystem functional architecture. It goes through a complete evaluation of the three considered options, the selection process, risk assessment and justification for the final decision. The in-depth comparison includes functional features and throughput measurement to ensure the required processing and storage performance.
Verfasserangaben: | Vasco Miguel Chibante BarrosoORCiD, Domenico EliaORCiDGND, Costin Grigoraș, Andrés Gómez RamírezORCiDGND, Gioacchino Vino, Adam WegrzynekORCiD |
---|---|
URN: | urn:nbn:de:hebis:30:3-718075 |
DOI: | https://doi.org/10.1051/epjconf/201921403043 |
ISSN: | 2100-014X |
Titel des übergeordneten Werkes (Englisch): | EPJ Web of Conferences |
Verlag: | EDP Sciences |
Verlagsort: | Les Ulis |
Dokumentart: | Wissenschaftlicher Artikel |
Sprache: | Englisch |
Datum der Veröffentlichung (online): | 17.09.2019 |
Datum der Erstveröffentlichung: | 17.09.2019 |
Veröffentlichende Institution: | Universitätsbibliothek Johann Christian Senckenberg |
Beteiligte Körperschaft: | International Conference on Computing in High Energy and Nuclear Physics (23. : 2018 : Sofia) |
Datum der Freischaltung: | 02.02.2023 |
Jahrgang: | 214 |
Ausgabe / Heft: | 03043 |
Seitenzahl: | 8 |
HeBIS-PPN: | 505760770 |
Institute: | Physik / Physik |
DDC-Klassifikation: | 5 Naturwissenschaften und Mathematik / 53 Physik / 530 Physik |
Sammlungen: | Universitätspublikationen |
Lizenz (Deutsch): | Creative Commons - Namensnennung 4.0 |