DIANE Distributed Analysis Environment for semiinteractive simulation and
DIANE Distributed Analysis Environment for semiinteractive simulation and analysis in Physics Jakub T. Moscicki, CERN/IT CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
The need for distribution do the analysis/simulation job in parallel tasks to speed up the work by using powerful, worldwide distributed computentional resources, acessing the data in mass storage systems otherwise too big to fit on your laptop. CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
Practical Example example: simulation with analysis each task produces a file with histograms job result = sum of histograms produced by tasks master-worker model client starts a job workers perform tasks and produce histograms master integrates the results CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
Tools at hand: local batch queue clusters/farms of PCs running batch queues use LSF or PBS to submit parallel analysis tasks producing histograms collect and post-process results by hand add all the resulting histogram files CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
Tools at hand: global batch queue federation of clusters also known as a GRID use EDG Resource Broker to submit tasks CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
Comments using middleware directly requires a lot of manual work integration of task results keeping track of failed task and resubmiting workers not easy to monitor the job progress and cancel jobs only one task per worker very inefficient if worker initialization time is long CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
User Wishlist automatic integration of task results monitoring of job progress and individual tasks automatic error-recovery policies granularity of the size of the task may change independently of the number of workers -- natural load-balancing and optimization of performance fine tuning – workers may be mapped to threads, processed or machines depending on the context uniform, transparent and easy user interface and API hiding complexity of underlying middleware mechanisms the same API and UI is used when running local jobs and GRID jobs batch, interactive and semi-interactive operation mode CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
Wishlist (cntd) a lightweight “add-on” framework which drives the execution of parallel jobs in master worker model over any specific middleware implementation: application oriented: target common HEP use cases independent from any particular analysis tool with layered and modular architecture which is easy to adapt to new environment: important for middleware transition integrated in modern scripting environment: e. g. python using standards: e. g. exploit AIDA for analysis making it easy to plug your favourite analysis tool To address these issues DIANE Project was set up in CERN/IT CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
DIANE Overview DIANE R&D Project started in 2001 in CERN/IT with very limited resources (~1 FTE) collaboration with Geant 4 groups at CERN, INFN, ESA succesful prototypes running on LSF and EDG CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
Applications of DIANE Examples of interdisciplinary applications Geant 4 simulation and analysis LHC: ntuple analysis and simulation radiotherapy: brachytherapy, IMRT space missions: ESA Bepi Colombo, LISA CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
DIANE for HEP workgroup clusters features many users, many jobs diverse applications: ntuple analysis, simulation, . . . interactive. . . semi-interactive. . . batch ~ 100 s of machines dynamic environment CHEP 03 users may submit their analysis code mixed CPU and I/O intensive some applications may be preconfigured general analysis e. g. ntuple projections or experiment specific apps load balancing important Jakub. Moscicki@cern. ch DIANE Project
DIANE for Simulation in Medical Apps example: brachytherapy optimization of the treatment planning by MC simulation features CPU intensive few users, few jobs one preconfigured application interactive: seconds. . minutes ~ 10 s of machines ongoing joint collaboration with G 4 and hospital units in Torino, Italy CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
DIANE for Simulation in Space Science LISA: MC simulation for gravitational waves experiment Bepi Colombo mission: HERMES experiment features CPU intensive big jobs (10 processor-years) preconfigured applications batch: days 1000+ machines requirements: error recovery important monitoring and diagnostics CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
DIANE Prototype and Testing scalability tests 70 worker nodes 140 milion Geant 4 events CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
DIANE Screenshot CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
DIANE Web Interface CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
References more informarion: cern. ch/diane www. ge. infn. it/geant 4/techtransf aida. freehep. org CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
The end CHEP 03 Jakub. Moscicki@cern. ch DIANE Project
- Slides: 18