Enabling Grids for Escienc E CESGA Status Report
Enabling Grids for E-scienc. E CESGA Status Report Javier Lopez, Alvaro Simon, Esteban Freire/ CESGA SA 3 All Hands Meeting Barcelona www. eu-egee. org EGEE-II INFSO-RI-031688 EGEE and g. Lite are registered trademarks
Outline Enabling Grids for E-scienc. E • • Main Achievements since November Work on Deliverables/Milestones Issues/Problems Next Steps EGEE-II INFSO-RI-031688 EGEE User Forum 2
Enabling Grids for E-scienc. E Main achievements since November EGEE-II INFSO-RI-031688
Infrastructure Enabling Grids for E-scienc. E • New infrastructure based on virtual machines • This is the new common infrastructure for SA 3, PPS and Production testbeds @CESGA • Fronted: – 1 x Dell Poweredge 2950: 1 TB RAID 5 storage (golden images of all the services) • Virtual Machines: – 4 x 10 Dell Poweredge 1955: quad-core processors • For SA 3 services we use HVM machines: they allow us to use kernel 2. 4 without modification to the OS EGEE-II INFSO-RI-031688 EGEE User Forum 4
Infrastructure Enabling Grids for E-scienc. E • Advantages – We can increase or decrease the capacity on demand – Easy to test new releases in a clean environment – Possible to roll-back failing upgrades using LVM snapshoot capability • We have produced a document explaining our infrastructure: – https: //swe-wiki. egee. cesga. es/cgibin/moin. cgi/XEN 3_Virtual_Machines_-_CESGA-EGEE – More detailed documents also available on request EGEE-II INFSO-RI-031688
SGE Enabling Grids for E-scienc. E • NOTE: This task is a joined effort between IC, LIP and CESGA • Integration in LCG CE ready – – RPM packages tested Yaim scripts developed Documentation updated Ready for certification • Re-Distribution of Grid Engine: – Reviewed license and sent to SA 3 list for second review – Re-distribution allowed EGEE-II INFSO-RI-031688
Testing SGE Enabling Grids for E-scienc. E • Based on the Torque/maui tests developed @GRNET • Adapting the scripts • Preliminary results available – Very slow submission • Optimizing SGE configuration EGEE-II INFSO-RI-031688
SGE on g. Lite CE Enabling Grids for E-scienc. E • IP ready (same as in LCG) • Meeting with BLAH developer (David Rebatto) to understand the work required • Required scripts are being developed @IC • Testing will be done @CESGA • APEL ready (Dave Kant) EGEE-II INFSO-RI-031688
Assigned Tasks Enabling Grids for E-scienc. E • Task #4759: Testing SGE – In progress • Task #4600: Provide updated RPMs for SGE jobmanager and installation guide – Ready for certification EGEE-II INFSO-RI-031688
Issues/Problems Enabling Grids for E-scienc. E • Job submission tests: – Preliminary results show that optimization of default SGE configuration required • Improve SGE configuration to send back to CE stdout and stderror files • Modifications required to run SGE on para-virtual machines EGEE-II INFSO-RI-031688
Next Steps Enabling Grids for E-scienc. E • SGE is working on a lcg-CE. Next step: CERTIFICATION – Add RPMs to SA 3 repository – Integrate SGE yaim scripts • Tests for SGE lcg-CE (later they will be reused for glite-CE) • SGE on glite-CE: Started on integrating support for BLAH – Other local middleware elements (GIIS, YAIM) basically remain unchanged for this glite-CE flavour. – APEL ready • Support for external SGE_QMASTER (IC and CESGA use this type of configuration in production) • Grid. ICE sensors for SGE EGEE-II INFSO-RI-031688
References Enabling Grids for E-scienc. E • Xen Virtualization @CESGA – https: //swe-wiki. egee. cesga. es/cgibin/moin. cgi/XEN 3_Virtual_Machines_-_CESGA-EGEE • SGE Wiki Page – https: //twiki. cern. ch/twiki/bin/view/LCG/Implementation. Of. SGE EGEE-II INFSO-RI-031688 EGEE User Forum 12
- Slides: 12