The LHC Computing Grid A WorldWide Computer Centre

  • Slides: 23
Download presentation
The LHC Computing Grid A World-Wide Computer Centre Pour Olivier Dessibourg Correspondant Scientifique –

The LHC Computing Grid A World-Wide Computer Centre Pour Olivier Dessibourg Correspondant Scientifique – Le temps 1 Frédéric Hemmer Deputy IT Department Head April 20, 2007 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

The LHC Accelerator The accelerator generates 40 million particle collisions (events) every second at

The LHC Accelerator The accelerator generates 40 million particle collisions (events) every second at the centre of each of the four experiments’ detectors 2 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

LHC DATA This is reduced by online computers that filter out a few hundred

LHC DATA This is reduced by online computers that filter out a few hundred “good” events per sec. Which are recorded on disk and magnetic tape at 100 -1, 000 Mega. Bytes/sec ~15 Peta. Bytes per year 3 for all four experiments Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

The LHC Data Challenge • The accelerator will be completed in 2007 and run

The LHC Data Challenge • The accelerator will be completed in 2007 and run for 10 -15 years • Experiments will produce about 15 Million Gigabytes of data each year (about 20 million CDs!) • LHC data analysis requires a computing power equivalent to ~100, 000 of today's fastest PC processors • Requires many cooperating computer centres, as CERN can only provide ~20% of the capacity Frédéric Hemmer, CERN, IT Department 5 The LHC Computing Grid – April 2007

Solution: the Grid • Use the Grid to unite computing resources of particle physics

Solution: the Grid • Use the Grid to unite computing resources of particle physics institutes around the world The World Wide Web provides seamless access to information that is stored in many millions of different geographical locations The Grid is an infrastructure that provides seamless access to computing power and data storage capacity distributed over the globe 6 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

LCG Service Hierarchy Tier-0 – the accelerator centre • Data acquisition & initial processing

LCG Service Hierarchy Tier-0 – the accelerator centre • Data acquisition & initial processing • Long-term data curation Ø Data Distribution to Tier-1 centres Canada – Triumf (Vancouver) France – IN 2 P 3 (Lyon) Germany –Karlsruhe Italy – CNAF (Bologna) Netherlands – NIKHEF/SARA (Amsterdam) Nordic countries – distributed Tier-1 Spain – PIC (Barcelona) Taiwan – Academia SInica (Taipei) UK – CLRC (Oxford) US – Fermi. Lab (Illinois) – Brookhaven (NY) Tier-1 – “online” to the data acquisition process high availability • Managed Mass Storage – grid-enabled data service Ø All re-processing passes • Data-heavy analysis • National, regional support Tier-2 – ~100 centres in ~40 countries 7 • Simulation • End-user analysis – batch and interactive Ø Services, including Data Archive and Delivery, from Tier-1 s Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

Distribution of Computing Services CPU Disk Tape 8 Frédéric Hemmer, CERN, IT Department The

Distribution of Computing Services CPU Disk Tape 8 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

LHC Computing Grid Project - a Collaboration Building and operating the LHC Grid –

LHC Computing Grid Project - a Collaboration Building and operating the LHC Grid – a global collaboration between • The physicists and computing specialists from the LHC experiments • The national and regional projects in Europe and the US that have been developing Grid middleware • The regional and national computing centres that provide resources for LHC 9 • The research networks Frédéric Hemmer, CERN, IT Department Researc hers tists & n e i c S r Compute e Engineers Softwar iders Prov e c i v r e S The LHC Computing Grid – April 2007

The Grid used by CERN and its partners • The EGEE and OSG projects

The Grid used by CERN and its partners • The EGEE and OSG projects are the basis of the Worldwide LHC Computing Grid Project 10 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

11 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

11 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

The new European Network Backbone • LCG working group with Tier-1 s and national/

The new European Network Backbone • LCG working group with Tier-1 s and national/ regional research network organisations • New GÉANT 2 – research network backbone Strong correlation with major European LHC centres • Swiss Po. P at CERN 12 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

Impact of the LHC Computing Grid in Europe • LCG has been the driving

Impact of the LHC Computing Grid in Europe • LCG has been the driving force for the European multiscience Grid EGEE (Enabling Grids for E-scienc. E) • EGEE is now a global effort, and the largest Grid infrastructure worldwide • Co-funded by the European Commission (~130 M€ over 4 years) • EGEE already used for >20 applications, including… 13 Bio-informatics Education, Training Frédéric Hemmer, CERN, IT Department Medical Imaging The LHC Computing Grid – April 2007

The EGEE Project • Infrastructure operation • Currently includes >200 sites across 40 countries

The EGEE Project • Infrastructure operation • Currently includes >200 sites across 40 countries • Continuous monitoring of grid services & automated site configuration/management http: //gridportal. hep. ph. ic. ac. uk/rtm/ • Middleware • Production quality middleware distributed under business friendly open source licence • User Support - Managed process from first contact through to production usage • • • Training Documentation Expertise in grid-enabling applications Online helpdesk Networking events (User Forum, Conferences etc. ) • Interoperability 14 • Expanding interoperability with related infrastructures Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

Production service Size of the infrastructure today: • 237 sites in 45 countries •

Production service Size of the infrastructure today: • 237 sites in 45 countries • ~36 000 CPU • ~ 5 PB disk, + tape MSS CPU 15 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

Workload Jobs run per month -50 -80000 jobs/day -19 M jobs run in 1

Workload Jobs run per month -50 -80000 jobs/day -19 M jobs run in 1 st year -~8200 cpu-years (1/4 – 1/3 of available infrastructure – continuously) -Non-LHC ~10 k jobs/day – level of total in EGEE-1 16 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

Use of the EGEE Infrastructure >20 k jobs running simultaneously 18 Frédéric Hemmer, CERN,

Use of the EGEE Infrastructure >20 k jobs running simultaneously 18 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

Use for massive data transfer Large LHC experiments now transferring ~ 1 PB/month each

Use for massive data transfer Large LHC experiments now transferring ~ 1 PB/month each 19 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

EGEE Grid Sites : Q 1 2006 sites EGEE: Steady growth over the lifetime

EGEE Grid Sites : Q 1 2006 sites EGEE: Steady growth over the lifetime of the project CPU EGEE: > 180 sites, 40 countries > 24, 000 processors, ~ 5 PB storage Frédéric Hemmer, CERN, IT Department 21 The LHC Computing Grid – April 2007

Applications on EGEE • More than 20 applications from 7 domains • Astrophysics •

Applications on EGEE • More than 20 applications from 7 domains • Astrophysics • MAGIC, Planck • Computational Chemistry • Earth Sciences • Earth Observation, Solid Earth Physics, Hydrology, Climate • Financial Simulation • E-GRID • Fusion • Geophysics • EGEODE • High Energy Physics • 4 LHC experiments (ALICE, ATLAS, CMS, LHCb) • Ba. Bar, CDF, DØ, ZEUS • Life Sciences • Bioinformatics (Drug Discovery, GPS@, Xmipp_MLrefine, etc. ) • Medical imaging (GATE, CDSS, g. PTM 3 D, Si. MRI 3 D, etc. ) • Multimedia • Material Sciences • … Frédéric Hemmer, CERN, IT Department 22 The LHC Computing Grid – April 2007

Example: EGEE Attacks Avian Flu • EGEE used to analyse 300, 000 possible potential

Example: EGEE Attacks Avian Flu • EGEE used to analyse 300, 000 possible potential drug compounds against bird flu virus, H 5 N 1. • 2000 computers at 60 computer centres in Europe, Russia, Taiwan, Israel ran during four weeks in April - the equivalent of 100 years on a single computer. • Potential drug compounds now being identified and ranked 23 Frédéric Hemmer, CERN, IT Department Neuraminidase, one of the two major surface proteins of influenza viruses, facilitating the release of virions from infected cells. Image Courtesy Ying-Ta Wu, Academia. Sinica. The LHC Computing Grid – April 2007

Example: Geocluster industrial application • The first industrial application successfully running on EGEE •

Example: Geocluster industrial application • The first industrial application successfully running on EGEE • Developed by the Compagnie Générale de Géophysique (CGG) in France, doing geophysical simulations for oil, gas, mining and environmental industries. • EGEE technology helps CGG to federate its computing resources around the globe. 24 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

Evolution European e-Infrastructure Coordination EDG EGEE-III 26 Testbeds Routine Usage Frédéric Hemmer, CERN, IT

Evolution European e-Infrastructure Coordination EDG EGEE-III 26 Testbeds Routine Usage Frédéric Hemmer, CERN, IT Department Utility Service The LHC Computing Grid – April 2007

28 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007

28 Frédéric Hemmer, CERN, IT Department The LHC Computing Grid – April 2007