Using Grid Computing David Groep NIKHEF 2002 07
- Slides: 30
Using Grid Computing David Groep, NIKHEF 2002 -07 -15
The Grid, But Why? Physics @ CERN • LHC particle accellerator • operational in 2007 • 5 -10 Petabyte per year • 150 countries • > 10000 Users • lifetime ~ 20 years 40 M leve l 1 Hz (40 TB/ sec) s 75 K pecial ha Hz ( leve 75 G rdware 5 KH l 2 - em B/sec) bed z ( 5 leve GB/ ded l 3 sec) PC 10 (100 0 Hz s M B/se data c) r e c offli o ne a rding & naly sis
CPU & Data Requirements 5, 000 Estimated CPU Capacity at CERNat Estimated CPU capacity required CERN 4, 500 4, 000 K SI 95 3, 500 3, 000 2, 500 2, 000 1, 500 1, 000 Other experiments LHC experiments 500 0 Moore’s 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 year Jan 2000: 3. 5 K SI 95 law – some measure of the capacity technology advances provide for a constant number of processors or investment < 50% of the main analysis capacity will be at CERN
More Reasons Why ENVISAT • 3500 MEuro programme cost • 10 instruments on board • 200 Mbps data rate to ground • 400 Tbytes data archived/year • ~100 `standard’ products • 10+ dedicated facilities in Europe • ~700 approved science user projects
And More … Bio-informatics • For access to data –Large network bandwidth to access computing centers –Support of Data banks replicas (easier and faster mirroring) –Distributed data banks • For interpretation of data –GRID enabled algorithms BLAST on distributed data banks, distributed data mining
Common Ground • Large amounts of data • Distributed, ad-hoc user community • Problems are distributable • Need for resources grows faster than market • Network grows faster than the application needs • Willingness to share resources … • … if security and integrity is guaranteed
The One-Liner • Resource sharing and coordinated problem solving in dynamic multi-institutional virtual organisations
What is Grid computing? • Dependable, consistent and pervasive access • Combining resources from various organizations • `Virtual Organizations’ – user-based view on Grid • Technical challenges: – – transparent decisions for the user uniformity in access methods secure & crack resistant authentication, authorization, accounting (AAA) "a
Grid Middleware • Globus Project started 1997 • de facto-standard • Reference implementation of Gridforum standards • Large community effort • Basis of several projects, including EU-Data. Grid • Toolkit `bag-of-services' approach • Successful test beds, with single sign-on, etc…
In The Beginning • Distributed Computing – synchronous processing • High-Throughput Computing – asynchronous processing • On-Demand Computing – dynamic resources • Data-Intensive Computing – databases • Collaborative Computing – science Ian Foster and Carl Kesselman, editors, “The Grid: Blueprint for a New Computing Infrastructure, ” Morgan Kaufmann, 1999
Grid Architecture Make all resources talk standard protocols Promote interoperability of application toolkit, similar to interoperability of networks by Internet standards Applications Condor-G Application Toolkits DUROC MDS Replica. Srv Grid Services MPICH-G 2 Grid. FTP VLAM-G GRAM Grid Security Infrastructure (GSI) Condor MPI PBS Grid Fabric SUN Internet Linux
OGSA: new directions • Looks superficially like `web services’ • Based on common standards: – WSDL – SOAP – UDDI • Adds: – Transient services – State of distributed activities – Workflow, videoconf, distributed data analysis • Management of service instances • Grid Security Infrastructure
Looking for Resources • Resource Brokerage based on matchmaking (Condor) • Information Services Mesh – Meta-computing directory – Replica Catalogues Data. Grid http: //marianne. in 2 p 3. fr/
Submitting a Job
Locating a Replica • Grid Data Mirror Package • Moves data across sites • Replicates both files and individual objects • Catalogue used by Broker • Replica Location Service (giggle) • Read-only copies “owner” by the Replica Manager. http: //cmsdoc. cern. ch/cms/grid
Sending Your Data • Tape robots, disks, etc. share Grid. FTP interface • Supports single-sign-on and confidentiality • Optimize for high-speed >1 Gbit/s networks • In the future: automatic optimizations, bandwidth reservations, directory-enabled networking, …
Grid-enabled Databases? • Spit. Fire uniform access to persistent storage on the Grid • • Multiple roles support Compatible with GSI (single sign-on) though Co. G Uses standard stuff: JDBC, SOAP, XML Supports various back-end data bases http: //hep-proj-spitfire. web. cern. ch/hep-proj-spitfire/
Data. Grid Test Bed 1 • Data. Grid TB 1: – 14 countries – 21 major sites – Growing rapidly • Submitting Jobs: – Login only once, run everywhere – Cross administrative boundaries in a secure and trusted way – Mutual authorization
Dutch. Grid Platform • Dutch. Grid: – Test bed coordination – PKI security • Participation by Leiden Delft Amsterdam Enschede KNMI Utrecht Nijmegen NIKHEF: FOM, VU, Uv. A, Utrecht, Nijmegen KNMI, SARA AMOLF DAS-2 (ASCI): TUDelft, Leiden, VU, Uv. A, Utrecht Telematics Institute
And now for some Technical Details For Users
Start using the grid • All the necessary “client tools” are on all Linux and Solaris systems • You just need: – Credentials/tokens for the Grid (see next slides) – Authorization to use resources (you get all NIKHEF resources by default) – Information on which resources to use effectively
Your Grid Credentials • You will use resources across several domains – You may not care about security and authorization – But the remote site admin will ! • All communications are authenticated using X. 509 “Public Key” Certificates • The technology used to secure credit card transactions on the web (https: //……) • Uniquely binds name/affiliation to a digital token
Certification Authorities • CA’s act as trusted third parties • Remote sites trust the CA for a proper binding • They will not do authentication again, so only authorization left. • CA’s are highly valuable: crack one to impersonate others on the Grid (and abuse resources) • Registration Authorities do in-person ID checks
CA’s in Data. Grid • 10 National CA’s (one per EU country) • Each one has a detailed policy and practice statement • NIKHEF operates the CA for Dutch. Grid See http: //www. dutchgrid. nl/ca • Get a “certificate” from the Dutch. Grid CA before you can start using the Grid • It’s valuable, protect it with a pass phrase • One cert valid for all Data. Grid sites
The Proxy • A `proxy certificate’ is a limited-lifetime delegation without a pass phrase to protect it • Implements the single sign-on for Grid • Valid for 12 hours (by default) • Use it to: – Run your jobs – Get access to your data • Get it, by running grid-proxy-init
Now see for yourself
Getting a Certificate • Initialize your environment for the Grid • Use the Globus local guide from http: //www. dutchgrid. nl/Support/ • Send the result to ca@nikhef. nl you will be contacted by phone • Put the certificate (sent by mail) in your $HOME/. globus/usercert. pem • Or use the Web at http: //certificate. nikhef. nl/userhelp. html
Using the Grid • Request authorization: grid. support@nikhef. nl • Look what is out there using grid-info-search or http: //marianne. in 2 p 3. fr/datagrid/giis-browse. html • Try some local hosts: – bilbo, kilogram, triangel kilogram: davidg: 1009$ globus-job-run dommel. wins. uva. nl /usr/ucb/quota -v Disk quotas for random (uid 12 xxx): Filesystem usage quota limit timeleft files quota limit timeleft /home/random 13067 1500000 2000000 0 kilogram: davidg: 1010$ • Start running your analysis/MC/other jobs
Grid. FTP • Universal high-performance file transfer • Extends the FTP protocol with: – Single sign-on (GSI, GSSAPI, RFC 2228) – Parallel streams for speed-up – Striped access (ftp from multiple sites to be faster) • Clients: gsincftp, globus-url-copy.
What’s Next? • Some of the nice user-features to come: – Finding data files by characteristics (give me all golden decay’s) – Moving your job to where the data is – Automatic partitioning of jobs – Support true-interactive work – Better network utilisation (faster access to data) – ……… • If you are in the Data. Grid project, ask your WP leader for authorization in TB 1
- Servicdesk
- Utrecht theoretical physics
- Nikhef
- Lga vs pga socket
- Resource broker in grid computing
- Open grid service architecture
- Grid computing tutorial
- Disadvantages of grid computing
- Worldwide lhc computing grid
- Proc scaproc
- Sas grid computing
- Lgfl history of computing
- Conclusion of grid computing
- Derivatives of grid computing
- Grid and cloud computing lmu
- Grid computing ejemplos
- Challenges of grid computing
- Grid computing security
- Sas grid computing
- Introduction to grid computing
- Grid computing introduction
- Cern grid computing
- What is grid system
- Grid and cloud computing definition
- Grid computing infrastructure
- Globus grid computing
- Conventional computing and intelligent computing
- Expanding double brackets grid method
- Voegwoorde omdat
- Stompi oefeninge
- Voegwoord voorbeelde