Distributed Analysis in Production with RDataFrame

Guardado en:
Detalles Bibliográficos
Publicado en:EPJ Web of Conferences vol. 337 (2025)
Autor principal: Czurylo, Marta
Otros Autores: Padulano, Vincenzo Eduardo, Piparo, Danilo, Andrea Maria Ola Mejicanos
Publicado:
EDP Sciences
Materias:
Acceso en línea:Citation/Abstract
Full Text - PDF
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!

MARC

LEADER 00000nab a2200000uu 4500
001 3263154287
003 UK-CbPIL
022 |a 2101-6275 
022 |a 2100-014X 
024 7 |a 10.1051/epjconf/202533701007  |2 doi 
035 |a 3263154287 
045 2 |b d20250101  |b d20251231 
084 |a 182355  |2 nlm 
100 1 |a Czurylo, Marta 
245 1 |a Distributed Analysis in Production with RDataFrame 
260 |b EDP Sciences  |c 2025 
513 |a Conference Proceedings 
520 3 |a The ROOT software package provides the data format used in High Energy Physics by the LHC experiments. ROOT offers a data analysis interface called RDataFrame, which has proven to adapt well to the requirements of modern physics analyses. However, with the increasing data collected by the LHC experiments, the challenge to perform an efficient analysis expands. One of the solutions to ease this challenge is the leverage of modern high-performing distributed computing environments, for which RDataFrame provides an easy-to-use interface layer - the distributed RDataFrame.In this paper, we show that the distributed RDataFrame is out of the experimental testing phase, and it is now ready for production thanks to a stabilized user interface. We delve into recent improvements of the distributed RDataFrame, including memory management, C++ code inclusion, and Pythonizations of the interface that allow running the workflows seamlessly. This includes running the distributed RDataFrame on various Analysis Facilities, which is discussed towards the end of the paper. 
653 |a Data analysis 
653 |a Distributed memory 
653 |a Distributed processing 
653 |a Memory management 
700 1 |a Padulano, Vincenzo Eduardo 
700 1 |a Piparo, Danilo 
700 1 |a Andrea Maria Ola Mejicanos 
773 0 |t EPJ Web of Conferences  |g vol. 337 (2025) 
786 0 |d ProQuest  |t Advanced Technologies & Aerospace Database 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/3263154287/abstract/embedded/75I98GEZK8WCJMPQ?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/3263154287/fulltextPDF/embedded/75I98GEZK8WCJMPQ?source=fedsrch