Bridging EGEE to BOINC and Xtrem Web GIN
Bridging EGEE to BOINC and Xtrem. Web GIN : From interoperation to interoperability 1 The EDGe. S project receives Community research funding
Bridging EGEE to BOINC and Xtrem. Web GIN : From interoperation to interoperability Overview • • • Definitions : Service Grids and Desktop Grids Presentation of the EDGe. S project Bridge BOINC EGEE Bridge Xtrem. Web EGEE Bridge EGEE BOINC Bridge EGEE Xtrem. Web Architecture of the EDGe. S 3 G Bridge Desktop Grid Production Infrastructure OGF standards used for future interoperability Authors : Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 2 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah
Bridging EGEE to BOINC and Xtrem. Web SG = Service Grid = Managed grid of managed computing clusters VOMS Admin VOMS Server X 509 proxy with VOMS extensions Accounting Logging & Bookkeeping Submits Job with X 509 proxy Sends back Output Sandbox Log Meta-scheduler (WMS) Gives Accounting and Auditing Manages Site Log Publishes available Resources Pushes Job Sends back Output Sandbox Accesses Data with X 509 proxy Authors : Site Admin Manages VO Gives Job Status Grid User Grid Admin Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 3 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah Site Computing Resource Accesses Data with X 509 proxy Site Storage Resource
Bridging EGEE to BOINC and Xtrem. Web SG = Service Grid = Managed grid of managed computing clusters • Computing and Storage Resources are managed by trained staff inside Sites and are authenticated by X 509 certificates. • Users are authenticated by X 509 certificates or proxies. • Users belong to VOs and get a X 509 proxy from a VOMS server to : – Access data, – Submit jobs. • Executables are NOT authenticated. So trust is primarily between Sites and VOs. • Order of magnitude is typically 100 000 CPUs. • A meta-scheduler (WMS) pushes the jobs to resources with are both suitable and available. Examples : EGEE, Nordu. Grid, OSG, DEISA, … Authors : Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 4 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah
Bridging EGEE to BOINC and Xtrem. Web DG = Desktop Grid = Loose grid scavenging idle resources Unit of Work = Application + Input Data Submits input data for an application Sends back results Requests Unit of Work Grid Server with Application Repository Grid User Sends Unit of Work Sends back results Computing Resource (often Desktop Computer) Owns Resource Certifies Application Accepts or Refuses an application on his resource Currently, for BOINC, both roles of ‘Application Manager’ and ‘Grid User’ are fulfilled by ‘BOINC Project Owners’. Authors : Application Manager Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 5 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah Resource Owner (often volunteer)
Bridging EGEE to BOINC and Xtrem. Web DG = Desktop Grid = Loose grid scavenging idle resources • Computing and Storage Resources are owned by various Owners (it is often volunteer computing), but they are NOT managed and NOT authenticated. • Grid Servers are authenticated by a X 509 certificate. • Users are authenticated by the Grid Servers, but NOT by the Computing and Storage Resources. • Executables are certified by managers of the Grid Servers. So : – Resource Owners have to trust the Grid Servers, – BOINC sends each Work Unit to several Resource Owners, because BOINC does NOT fully trust them. • Order of magnitude can be 1 000 CPUs. • Starving Computing Resources pull Work Units from Grid Servers. Examples : BOINC, Xtrem. Web, x. Grid Authors : Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 6 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah
Bridging EGEE to BOINC and Xtrem. Web Presentation of the EDGe. S project New FP 7 project started on 01/01/2008 WLCG (CERN) g. Lite (EGEE) • Integrate Service Grids and Desktop Grids • Enable very large number of computing resources (100 K-1 M processors) • Attract new scientific communities EDGe. S • Provide a Grid application development environment Current • Provide application repository and bridges for the execution in the SG-DG system Authors : ARC (Nordu. Grid) Boinc (Berkeley) VDT (OSG) Unicore (DEISA) Future Xtrem. Web (INRIA/IN 2 P 3) Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 7 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah Xgrid (Apple)
Bridging EGEE to BOINC and Xtrem. Web Presentation of the EDGe. S project http: //www. edges-grid. eu Now, Interoperation : • Ad-hoc bridges and interfaces between EGEE, BOINC and Xtrem. Web. • A Mo. U between EDGe. S and EGEE has been signed on 23 Sept 2008. • Xtrem. Web users must have a X 509 certificate, be registered in a VO and submit their Jobs with a X 509 proxy. • BOINC Project Owners must have a X 509 certificate, be registered in a VO and store a medium-term X 509 proxy in a My. Proxy server. • All files must be transferred through the Input and Output sandboxes. In the future : • Interoperability using OGF standards, in order to bridge more Grids. • Better support of grid file access (GFAL, lcg_utils and Grid. FTP). Authors : Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 8 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah
Bridging EGEE to BOINC and Xtrem. Web Bridge BOINC EGEE Work Unit BOINC jobwrapper client (simulating a large BOINC computing resource) BOINC Server 3 G jobwrapper Submission Credential access information 3 G jobwrapper Queue Manager & Job DB WUi+1 WUi+2 WUi+3 Config. file Grid Handler Interface 1 for each (BOINC server, BOINC Project Owner, EGEE VO) triple EDGe. S 3 G bridge Job Handler Interface BOINC Handler (WU = Work Unit) EGEE Plugin 1 for each (BOINC Project Owner, EGEE VO) pair Authors : WMS Jobi+1 Jobi+2 Short term X 509 proxy BOINC Project Owner EGEE Medium term X 509 proxy Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 9 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah VOMS extensions VOMS Server My. Proxy
Bridging EGEE to BOINC and Xtrem. Web Bridge BOINC EGEE Solution = Inside EDGe. S bridge, marshalling of the BOINC Work Units into Job collections • For each (BOINC server, BOINC Project Owner, EGEE VO) triple, a separate Job Handler collects the BOINC Work Units and p lace them in a queue. • For each (BOINC Project Owner, EGEE VO) pair, a separate EGEE plugin : – Retrieves a short term X 509 Proxy for the BOINC Project Owner from a My. Proxy server, and VOMS extensions from a VOMS server, – Periodically processes new Work Units found in the queue : • It converts each Work Unit into an EGEE Job, • In order to reduce the usage of the EGEE WMS, it uses Collection possibilities of EGEE to submit many Jobs in one request described using JDL. Authors : Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 10 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah
Bridging EGEE to BOINC and Xtrem. Web Bridge Xtrem. Web EGEE Pushes Pilot job EGEE VOMS Server g. Lite WMS Gives Pilot Job Status Submits mono-user Pilot Job with X 509 proxy Gives Pilot Job Status Mono-user Pilot Job Xtrem. Web Bridge X 509 proxy with VOMS extensions Requests User Jobs Submits User Job with X 509 proxy Xtrem. Web User Authors : Sends back Job Status and Results Sends User Jobs with X 509 proxy Manages User Job status Xtrem. Web Server Computing Element Requests only 1 User Job Sends 1 User Job with same X 509 proxy Sends back results directly Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 11 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah
Bridging EGEE to BOINC and Xtrem. Web Bridge Xtrem. Web EGEE Solution = Xtrem. Web bridge : Gliding with a mono-user Pilot Job 1. A Xtrem. Web User submits to the Xtrem. Web server his User Job with a X 509 proxy. 2. At the request of the Xtrem. Web bridge, the Xtrem. Web server sends him the User Job with the X 509 proxy. 3. The Xtrem. Web bridge submits to a g. Lite WMS a mono-user Pilot Job with this X 509 proxy (job description in a JDL). 4. The g. Lite WMS pushes the Pilot Job to a Computing Element, which executes it. 5. The mono-user Pilot Job requests 1 User Job from the Xtrem. Web server, and stops itself if it receives none. 6. The Xtrem. Web server verifies that the requested User Job has a X 509 proxy, and sends the User Job and the X 509 proxy to the Pilot Job. 7. The Pilot Job verifies that the received X 509 proxy is the same as its own X 509 proxy, and executes the User Job. 8. At the end of the User Job, the Pilot Job sends the Job results directly to the Xtrem. Web server, then stops itself. Authors : Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 12 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah
Bridging EGEE to BOINC and Xtrem. Web Bridge EGEE BOINC g. Lite WMS EGEE BDII Reports resources and performance Sends output Submits Job Logs events Pushes job EGEE LB Gets output Watches Logs events EGEE VOMS X 509 proxy with VOMS extensions Gets EXE EGEE User Authors : Information provider EDGe. S 3 G bridge LCG-CE for EDGe. S Queue Manager GRAM Job Manager for EDGe. S & Job DB Adds job Watches job Checks EXE EDGe. S Application Repository BOINC Computing Resource Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 13 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah Generic Job WS Handler BOINC plugin (DCAPI) BOINC Service BOINC Server
Bridging EGEE to BOINC and Xtrem. Web Bridge EGEE BOINC Solution = Installation of a LCG-CE sending the EGEE Jobs to the EDGe. S bridge marshalling them into BOINC Works Units • Publish information to the BDII according to GLUE 1. 3 • EGEE producer – – – – • New GRAM job manager Gets job information from wrapper Checks if exe is validated in the EDGe. S application repository (GEMLCA) Checks if exe is supported by attached BOINC Gets files from WMS Adds job to 3 G bridge job Database Polls status of jobs in 3 G bridge job Database Gets results from 3 G bridge and uploads to LB BOINC plugin (DC-API) – – Authors : Use DC-API to generate BOINC WUs Jobs are read from the 3 G bridge DB 3 G DB entries are updated on events The plugin has already been implemented for the Cancer. Grid system Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 14 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah
Bridging EGEE to BOINC and Xtrem. Web Bridge EGEE Xtrem. Web g. Lite WMS EGEE BDII Reports resources and performance Sends output Submits Job Logs events Pushes job EGEE LB Gets output Watches Logs events EGEE VOMS X 509 proxy with VOMS extensions Gets EXE EGEE User Authors : Information provider LCG-CE for Xtrem. Web GRAM Job Manager for Xtrem. Web Solution Inside a LCG-CE, installation of a GRAM jobmanager to marshal the EGEE Jobs into Xtrem. Web Jobs. Adds job Watches job Xtrem. Web Server Checks EXE EDGe. S Application Repository Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 15 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah Xtrem. Web Computing Resource
Bridging EGEE to BOINC and Xtrem. Web Architecture of the EDGe. S 3 G Bridge Job with X 509 proxy BOINC Server WU BOINC Handlers EGEE g. Lite WMS LCG-CE for EDGe. S Handler for received jobs Generic handler for received jobs Authors : Job EGEE Handler Generic interface above grid plugins EDGe. S 3 G Bridge Job Database Scheduler Queue Manager Grid. Handler Interface User Work Unit Storage for received jobs Job Handler Interface Control path WU Job Grid plugin (submit jobs, update status, get output, . . . ) EGEE Plugins BOINC Plugins (DC-API) Job Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 16 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah EGEE Control path Job WU WU Xtrem. Web Plugins g. Lite WMS BOINC Server Xtrem. Web Server
Bridging EGEE to BOINC and Xtrem. Web Desktop Grid Production Infrastructure Job EGEE User EDGe. S 3 G bridge BOINC plugin (DC-API) Public DG SZDG 30. 000 PCs Public DG Extremadura Grid 70. 000 PCs Local DG Uo. W Grid 1. 500 PCs Public DG EGEE@home Planned 10. 000 PCs BOINC Desktop Grids Authors : g. Lite WMS LCG-CE for BOINC Public DG Almere. Grid 3. 000 PCs Public DG EGEE Xtrem. Web 1. 000 PCs EGEE LCG-CE for Xtrem. Web Local DG IN 2 P 3 Grid 200 PCs Public DG INRIA Grid 300 PCs Xtrem. Web Desktop Grids Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 17 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah
Bridging EGEE to BOINC and Xtrem. Web OGF standards used for future interoperability • GLUE 2. 0 in order to publish information to the BDII : Needs implementation by g. Lite. • BES to receive Job submissions : For example from Grid. Sphere Portal. • BES to submit Jobs : Needs availability of CREAM CE. • JSDL to describe Jobs : Needs implementation by g. Lite. Potentially : • AUTHZ for Authentication / Authorization • UR, RUS for Job logging and accounting • Byte. IO, SRM, Grid. FTP, DMI to manage data transfers • ACS for the GEMLCA application repository • SAGA, DRMAA for the methodology of application development Authors : Z. Balaton, G. Caillat, Z. Farkas, G. Fedak, G. Gombas, P. Kacsuk, 18 A. Kornafeld, J. Kovacs, H. He, O. Lodygensky, A. Marosi, E. Urbah
- Slides: 18