Enabling Grids for Escienc E Cream pilot service
Enabling Grids for E-scienc. E Cream pilot service Status and short-term plans Antonio Retico GDB 11 -Mar-09 - CERN www. eu-egee. org EGEE-III INFSO-RI-222667 EGEE and g. Lite are registered trademarks
Agenda Enabling Grids for E-scienc. E • Description of CREAM pilot (Phase 2) – Objectives – Partners – Access info Good Afternoon! • Recent history – – Development Deployment Operational tools Users • Next Steps EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 2
Agenda Enabling Grids for E-scienc. E • Description of CREAM pilot (Phase 2) – Objectives – Partners – Access info EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 3
Objectives of phase 2 Enabling Grids for E-scienc. E • Focus on ICE submission – A good number of CE needed to test scalability • Integration of operational tools EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 4
Partners (phase 2) Enabling Grids for E-scienc. E • Coordination: Antonio Retico (CERN) • JRA 1: Massimo Sgaravatto (INFN-PADOVA) – Development, support • SA 3: Alessio Gianelle (INFN-PADOVA), Gianni Pucciani (CERN) • SA 1: Angela Poschlad (FZK), Christian Neissner (PIC), Daniele Cesini (CNAF), Sara Bertocco (INFN-PADOVA) Danilo Dongiovanni (CNAF) • CMS: Andrea Sciaba’, Enzo Miccio • Alice: Patricia Mendez EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 5
Access info Enabling Grids for E-scienc. E • Home Page – https: //twiki. cern. ch/twiki/bin/view/LCG/Pps. Pilot. Cream • Contacts – egee-pps-pilot-cream@cern. ch EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 6
Agenda Enabling Grids for E-scienc. E • Recent history – – Development Deployment Operational tools Users EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 7
ICE CREAM submission Enabling Grids for E-scienc. E • high failure rate observed, analysed and causes fixed. – Now the system sustains correctly a submission rate of 40 jobs/min. – Performance issues with long lasting jobs due to the way proxy renewal is handled in ICE (bug #47911) • However better that the version currently in PROD EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 8
Deployment status Enabling Grids for E-scienc. E • # of CREAM instances in production growing faster – – ~20 nodes Effect of SA 1 recommendations Pro: direct pilot coordination not needed for that Con: the version of CREAM deployed has known flaws interacting with ICE • Pilot WMS for CMS already reconfigured to use production CREAM • What about PIC? What about FZK? – PIC publishing now pilot version both in PPS and production – FZK maintains two separate instances at two different versions EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 9
SAM and Nagios Enabling Grids for E-scienc. E • SAM – lcg-CE sensor cloned for CREAM – Direct submission tests added – Results available for PPS https: //ppssam. cern. ch: 8443/sam. py • Nagios tests under development EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 10
Users Enabling Grids for E-scienc. E • Alice working with both production and PPS CREAMs • CMS now testing ICE – Difficult interaction with PPS BDII § Need to publish PPS CEs in production § Not all sites like it EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 11
Next steps Enabling Grids for E-scienc. E • Bring CREAMs in pilot and production to run roughly at the same version • Move PIC and FZK from pilot mode production mode • Close phase 2 • Open phase 3 – CMS testing “pilot” ICE WMS using production instances of CREAM – INFN testbed. B (~ 30 CEs): Shall we use them to test Nick’s performance criteria? § Could be done by SA 3 using the pilot infrastructure at PADOVA for a fixed time-slot § Focus on fixing bug #47911: Performance problems in ICE when there many (thousands) proxy files to manage EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 12
A new version for production Enabling Grids for E-scienc. E • Set of patches identified that: – Fixes known weaknesses of the ICE+CREAM system currently in production – Corresponds to a version of Cream validated by Alice in the pilot – Fixes part of the performance issues experienced so far in ICE • Accelerated certification/deployment for these patches – functionality tests (e. g. SAM , Nagios) – not to insist on performance tests (the results are known) – improves the ice cream submission chain currently available EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 13
Candidates for production Enabling Grids for E-scienc. E • patch #2845: Second update of CREAM and CEMon Clients for slc 4/i 386 platform – Ready for integration • patch #2459: First update of ICE – Update of ICE, to be installed "on top" of WMS patch #2562 – Ready for integration but suffers for new bug #47996: Apparent database corruption when ICE exits fixed by developers – Probably will be obsoleted and released again • patch #2748: Third update of CREAM CE for slc 4/i 386 platform – various fixes for CREAM e BLAH - affecting the CE – With Provider expected within this week • patch #2750: YAIM-CREAM-CE 4 th update – Certified • patch #2830: [ YAIM-WMS ] New yaim-wms to properly configure the ICE section – Ready for integration EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 14
Questions? Enabling Grids for E-scienc. E ? EGEE-III INFSO-RI-222667 GDB - 1 Mar 09 - CERN 15
- Slides: 15