The ALICE Dashboard T 1T 2 ALICE tutorial
The ALICE Dashboard T 1/T 2 ALICE tutorial Pablo Saiz, Julia Andreeva, Benjamin Gaidioz, Ricardo Rocha, Irina Sidirova IT-GS-MND CERN IT Department CH-1211 Geneva 23 Switzerland www. cern. ch/it
Overview • Dashboard structure • Dashboard in production – – Job Monitoring SAM FTS monitoring Site status board • Conclusions Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 2
Dashboard Framework Multiple clients: cli, web Multiple output formats: plain text, csv, xml, xhtml Collectors of information Common configuration and management Agents Web / HTTP Interface Data Access Layer (DAO) DB reading and writing via DAO layer Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it Connection pooling Oracle DB T 1/T 2 ALICE tutorial Easy to add interface for a different backend -- Pablo. Saiz@cern. ch 3
Dashboard activities COMMON applications ALICE, ATLAS, CMS, LHCb, Job monitoring Site reliability Vlemed Experiment specific applications Data management monitoring for ATLAS Accounting information from Apel and Gratia for ATLAS (prototype) Experiment Dashboard CMS Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it Transfer monitoring for ALICE Integration and commissioning Task monitoring for CMS analysis users (ATLAS on the way) IO rate monitoring between WN and SE (prototype) T 1/T 2 ALICE tutorial -- Production monitoring for ATLAS and CMS (prototypes) Site availability based on the Job Robot results of SAM tests monitoring Pablo. Saiz@cern. ch 4
http: //dashboard. cern. ch/alice Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it T 1/T 2 ALICE tutorial -T 1/T 2 ALICE tutorial Pablo. Saiz@cern. ch <#num> -Pablo. Saiz@cern. ch 5
Job Monitoring v. Display all the jobs submitted by a VO o. Follow the status of the jobs v. Collect information from different sources o. RGMA, IC Real Time Monitor, BDII, Mon. ALISA, … v. Very useful for VO managers, site admin, users v. Possibility to get the output in different formats Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it v. Deployed for ALICE, ATLAS, CMS, LHCb and Vle. Med T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 6
Job Monitoring Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 7
Job Monitoring Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 8
FTS reliability v. Daily report on the success of transfers v. Drill down list of errors v. Integrated in the ALICE environment v. Extremely useful during the different ALICE challenges: PDC 06, PDC 07, CRC 08 Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it v. Working on making it generic T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 9
FTS reliability Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 10
SAM monitoring v. Service Availability Monitoring v. Clickable plots to drill down: v. Site availability Service tests v. Links to the SAM results Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it v. Originally, only for CMS v. ATLAS requested a similar interface v. Ongoing work to make it generic T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 11
SAM monitoring Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 12
SAM monitoring Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 13
Site Status Board v. Table with status of the different sites for CMS v. Easy definition of new ‘metrics’ o. The ‘metrics’ can come from different sources v. Links to more detailed information v. Gridmap view Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it v. Used for CMS site commissioning and offline shifts o. Deployed for LHCb and ALICE T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 14
Site Status Board Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 15
Site Status Board Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 16
SSB gridmap Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it T 1/T 2 ALICE tutorial -T 1/T 2 ALICE tutorial Pablo. Saiz@cern. ch -Pablo. Saiz@cern. ch <#num> 17
ALICE SSB For the time being, only SAM tests and maintenance Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 18
Conclusions v. Several applications provided v. Job Monitoring v. FTD/FTS reliability v. SAM framework v. Site Status Board http: //dashboard. cern. ch /alice Internet Services CERN IT Department CH-1211 Genève 23 Switzerland www. cern. ch/it T 1/T 2 ALICE tutorial -- Pablo. Saiz@cern. ch 19
- Slides: 19