The LHC and grids Welcome to this presentation! We’re so glad you came. While you’re here, you can explore many questions: What is CERN and the Large.

Slides:



Advertisements
Similar presentations
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Why Grids Matter to Europe Bob Jones EGEE.
Advertisements

Particle physics – the computing challenge CERN Large Hadron Collider –2007 –the worlds most powerful particle accelerator –10 petabytes (10 million billion.
Fighting Malaria With The Grid. Computing on The Grid The Internet allows users to share information across vast geographical distances. Using similar.
An overview of the EGEE project Bob Jones EGEE Technical Director DTI International Technology Service-GlobalWatch Mission CERN – June 2004.
An exciting Opportunity to…. Enhance your A Level studies Develop Key Communication Skills Make personal links with Research staff at a leading University.
Randall Sobie The ATLAS Experiment Randall Sobie Institute for Particle Physics University of Victoria Large Hadron Collider (LHC) at CERN Laboratory ATLAS.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Searching for the Higgs – spearheading grid Tara Shears University of Liverpool.
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 15 th April 2009 Visit of Spanish Royal Academy.
Computer Science Prof. Bill Pugh Dept. of Computer Science.
Data mining and statistical learning: lecture 1a Statistics and computer science for a data-rich world.
From the Web to the Grid – Feb TIFRAC – The Tata Institute of Fundamental Research Automatic Calculator The first full-scale, general purpose,
Beyond the Human Genome Project Future goals and projects based on findings from the HGP.
Welcome to this presentation! We’re so glad you came. While you’re here, you can explore many questions: What is CERN and the Large Hadron Collider (LHC)?
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
INFSO-RI Enabling Grids for E-sciencE V. Breton, 30/08/05, seminar at SERONO Grid added value to fight malaria Vincent Breton EGEE.
Introduction to Complexity Science Engineered Complexity.
To the Grid From the Web. From the Web to the Grid – 2007 Why was the Web invented at CERN? Science depends on free access to information and exchange.
… where the Web was born 11 November 2003 Wolfgang von Rüden, IT Division Leader CERN openlab Workshop on TCO Introduction.
Bob Jones Technical Director CERN - August 2003 EGEE is proposed as a project to be funded by the European Union under contract IST
To the Grid From the Web Dr. Francois Grey IT Department, CERN.
…building the next IT revolution From Web to Grid…
+ Big Data IST210 Class Lecture. + Big Data Summary by EMC Corporation ( More videos that.
Les Les Robertson LCG Project Leader High Energy Physics using a worldwide computing grid Torino December 2005.
Experimental Particle Physics Do you want to discover… What is the origin of mass ? Discover the Higgs boson with ATLAS Why is there more matter than anti-matter.
Your university or experiment logo here What is it? What is it for? The Grid.
Presentation of the A particle collision = an event Physicist's goal is to count, trace and characterize all the particles produced and fully.
| nectar.org.au NECTAR TRAINING Module 2 Virtual Laboratories and eResearch Tools.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Dr. Andreas Wagner Deputy Group Leader - Operating Systems and Infrastructure Services CERN IT Department The IT Department & The LHC Computing Grid –
LHC Computing, CERN, & Federated Identities
M.C. Vetterli; SFU/TRIUMF Simon Fraser ATLASATLAS SFU & Canada’s Role in ATLAS M.C. Vetterli Simon Fraser University and TRIUMF SFU Open House, May 31.
Meeting with University of Malta| CERN, May 18, 2015 | Predrag Buncic ALICE Computing in Run 2+ P. Buncic 1.
LHC collisions rate: Hz New PHYSICS rate: Hz Event selection: 1 in 10,000,000,000,000 Signal/Noise: Raw Data volumes produced.
Hans Hoffmann, Director Technology Transfer & Scientific Computing, ( ) Why CERN provides a unique IT challenge for industry: from the Web to the.
ScotGRID is the Scottish prototype Tier 2 Centre for LHCb and ATLAS computing resources. It uses a novel distributed architecture and cutting-edge technology,
CompSci 280 S Introduction to Software Development
Semantic Web - caBIG Abstract: 21st century biomedical research is driven by massive amounts of data: automated technologies generate hundreds of.
2nd GEO Data Providers workshop (20-21 April 2017, Florence, Italy)
Introducing High Energy Physics Computing to the Secondary Classroom
Grid Computing: Running your Jobs around the World
How to build a particle hunter
Objectives Overview Explain why computer literacy is vital to success in today’s world Define the term, computer, and describe the relationship between.
Introduction to CERN F. Hahn / CERN PH-DT1 10. May 2007.
Clouds , Grids and Clusters
Volunteer Computing for Science Gateways
Grid site as a tool for data processing and data analysis
PROGRAMME 10:00 Introduction to presentations and tour (10‘) Francois Grey  10:10 CERN openlab student programme - CERN opencluster (05')    Stephen Eccles 
The LHC Computing Grid Visit of Mtro. Enrique Agüera Ibañez
Access Grid and USAID November 14, 2007
National e-Infrastructure Vision
A. Rama Bharathi Regd. No: 08931F0040 III M.C.A
Dagmar Adamova, NPI AS CR Prague/Rez
The LHC Computing Grid Visit of Her Royal Highness
Tour of CERN Computer Center
Dagmar Adamova (NPI AS CR Prague/Rez) and Maarten Litmaath (CERN)
Building a UK Computing Grid for Particle Physics
EUChinaGRID Applications
A high-performance computing facility for scientific research
Tour of CERN Computer Center
CERN, the LHC and the Grid
Grid Computing Done by: Shamsa Amur Al-Matani.
WIS Strategy – WIS 2.0 Submitted by: Matteo Dell’Acqua(CBS) (Doc 5b)
Particle Physics Theory
From the Web to the Grid – Feb 2006
Grid Application Model and Design and Implementation of Grid Services
The Worldwide LHC Computing Grid
Life Sciences Solutions
The LHC Computing Grid Visit of Professor Andreas Demetriou
Presentation transcript:

The LHC and grids Welcome to this presentation! We’re so glad you came. While you’re here, you can explore many questions: What is CERN and the Large Hadron Collider (LHC)? What is the Worldwide LHC Computing Grid? What can YOU do with grid computing?

Grid computing Now, let’s choose a beginning… Let’s start with the world’s largest machine: the Large Hadron Collider

What is the LHC? Millions of people from all over the world watched when the LHC was switched on in 2008. With its 27-km circumference, the LHC is the largest machine in the world. LHC stands for Large Hadron Collider. Particle physicists are using the LHC to collide protons at very high energies, aiming to learn more about the Universe, The LHC operates at about – 3000C, just above absolute zero, and speeds particles to almost the speed of light. The LHC involves four experiments, with detectors as ‘big as cathedrals’: ALICE, ATLAS, CMS and LHCb

Why the LHC? The Large Hadron Collider takes physics out of the textbooks and in to real life. By smashing particles together, the LHC is helping physicists to: - create new particles and identify their components - reveal the nature of the interactions between particles - create an environment similar to the one present at the origin of our Universe: the Big Bang

But what for? To answer fundamental questions about the Universe: How did the Universe begin? What is the origin of mass? What is the nature of antimatter? Can we find the mysterious “God” particle: the missing Higgs boson?

Where is the LHC? Mont Blanc, 4810 m Downtown Geneva

What is CERN? CERN is the world's largest particle physics centre, and home to the LHC More than 2500 staff scientists (physicists, engineers, and more) work at CERN, with some 6500 scientists visiting at any time. CERN brings together people from 500 universities representing 80 nationalities.

A little off topic, but… Ever heard of the Web? The World Wide Web was invented at CERN, to improve information sharing between physicists working all over the world!

What is CERN? CERN has made many important discoveries, but our current understanding of the Universe is still incomplete! To answer questions still open, physicists around the world have built the Large Hadron Collider (LHC) If the “Higgs boson” particle exists, the LHC will almost certainly find it

Finding the Higgs One way to find the Higgs boson is to look at what happens to particles after they collide at high energies inside the LHC. Physicists count, trace and characterize all the particles produced and fully reconstruct the process. If they can find a characteristic decay pattern producing 4 “muon” particles, they know they’re on the trail of the Higgs!

Finding the Higgs Starting from this event… Selectivity: 1 in 1013 Like looking for 1 person in a thousand world populations! Or for a needle in 20 million haystacks! Starting from this event… Physicists are looking for this “signature”

1 Megabyte (1MB) A digital photo 1 Gigabyte (1GB) = 1000MB A DVD movie 1 Terabyte (1TB) = 1000GB World annual book production 1 Petabyte (1PB) = 1000TB Annual production of one LHC experiment 1 Exabyte (1EB) = 1000 PB World annual information production Data, data, data!! But, the LHC produces 40 million collisions per second This is filtered down to 100 interesting collisions per second Each collision produces about one Megabyte of data = recording rate of 0.1 Gigabytes/sec 1010 collisions recorded each year = 10 Petabytes/year of data, plus analysis data! This is a MASSIVE data-handling challenge! ALICE CMS LHCb ATLAS

Data, data, data!! 15 Petabytes Where will we store all of these data? Balloon (30 Km) Data, data, data!! CD stack with 1 year LHC data! (~ 20 Km) The LHC will produce around 15 Petabytes of data every year! That’s about 20 million CDs each year! Concorde (15 Km) Where will we store all of these data? Mt. Blanc (4.8 Km)

Where will we find such computing power? Data, data, data!! LHC data analysis requires a computing power equivalent to ~ 100,000 of today's fastest PC processors! Where will we find such computing power?

CERN Computer Centre Nowhere near enough! High-throughput computing based on reliable “commodity” technology More than 5000 PCs with around 20,000 processor cores More than 8 petabytes (8 million Gigabytes) of disk storage and 18 petabytes of magnetic tape storage Nowhere near enough!

LHC Computing Problem: CERN alone can provide only a fraction of the resources necessary to crunch all that LHC data. Solution: Connect all the particle physics computing centers, uniting the computing resources of particle physicists in the world!  

Computing for the LHC: a problem? Grid computing: the solution!

LHC Computing Grid Mission: Strategy: Results: Install a functioning grid to help the LHC experiments collect and analyse data coming from the detectors Strategy: Integrate thousands of computers at hundreds of participating institutes worldwide into a global computing resource. Results: The Worldwide LCG launched in October 2008 with more than 100,000 processors from 140 institutions in 33 countries, producing a massive distributed supercomputer that will provide more than 7000 physicists around the world with near real-time access to LHC data, and the power to process it.

WLCG challenges The Worldwide LHC Computing Grid is able to: share data between thousands of scientists with multiple interests link major computer centres, not just PCs ensure all data accessible anywhere, anytime grow rapidly, yet remain reliable for more than a decade cope with different management policies of different centres ensure data security: more is at stake than just money! be up and running anytime the LHC is producing data

What is grid computing? But wait… Just to recap, what’s the difference between the World Wide Web and grid computing? The World Wide Web provides seamless access to information that is stored in many millions of different geographical locations Grid computing provides seamless access to computing power and data storage capacity distributed over the globe.

How do grids work? Grids rely on advanced software, called middleware, to ensure seamless communication between different computers and different parts of the world Grid search engines not only find the data a scientist needs, but also the data processing techniques and the computing power to carry out data analysis Grids can distribute computing tasks to wherever in the world there is spare capacity, and send the result to the scientists

How do grids work? Imagine you’re a scientist. Your grid middleware: Finds convenient places for the your “jobs” (computing tasks) to run Optimises use of globally dispersed computing resources Organises efficient access to scientific data Deals with authentication at different sites you are using Interfaces to local site authorisation and resource allocation policies Runs the jobs Monitors progress Recovers from problems … and …. Tells you when the work is complete and sends your results back!

Who else uses grids? Computational scientists & engineers: large scale modeling of complex structures Experimental scientists: storing and analyzing large data sets Collaborations: large scale multi-institutional projects Corporations: global enterprises and industrial partnership Environmentalists: climate monitoring and modeling Trainers & educators: virtual learning rooms and laboratories Computational scientists & engineers: systems and machines being developed now require very large scale modelling in their design and development. Experimental scientists: access to large data resources and computational power anywhere is the world. Collaborations: large scale projects often involve sites in different locations or different organisations. Corporations: global enterprises and large corporations have distributed sites, data, people and computational resources. Environmentalists: climatic monitoring and modeling require large amounts of data, collected and stored in different sites around the world, and huge computing power. Training & education: Society will benefit, with creation of virtual lecture rooms, distributed classes, resources and tutors.

Grid benefits More effective and seamless collaboration of dispersed communities, both scientific and commercial Ability to run large-scale applications comprising thousands of computers, for wide range of applications Transparent access to distributed resources from your desktop, or even your mobile phone The term “e-Science” has been coined to express these benefits

Enabling Grids for E-Science Creating a global grid for global e-science Mission: Deliver 24/7 grid service to European science; re-engineer grid middleware for production; market grid solutions to different scientific communities Results: EGEE currently involves more than 240 institutions in 45 countries, supporting science in more than 20 disciplines, including bioinformatics, medical imaging, education, climate change, energy, agriculture and more.

The EGEE Vision Grid computing is already changing the way science and much else is done around the world. What will the future hold? An international network of scientists will be able to model a new flood of the Danube in real time, using meteorological and geological data from several centers across Europe. A team of engineering students will be able to run the latest 3D rendering programs from their laptops using the Grid. A geneticist at a conference, inspired by a talk she hears, will be able to launch a complex biomolecular simulation from her mobile phone.

Grids in science There’s so much that grids can already do! Here are some examples in: Medicine (imaging, diagnosis and treatment ) Bioinformatics (study of the human genome and proteome to understand genetic diseases) Nanotechnology (design of new materials from the molecular scale) The environment (weather forecasting, earth observation, modeling and prediction of complex systems)

Medicine Digital image archives Collaborative virtual environments “Grids will enable a standardized, distributed digital mammography resource for improving diagnostic confidence" Digital image archives Collaborative virtual environments On-line clinical conferences “Grids make it possible to use large collections of images in new, dynamic ways, including medical diagnosis.” Medical/Healthcare Digital image archive systems, incorporating electronic images into patient records, are becoming widespread in hospitals. Two demonstrators show how images from selected categories of patients can be assembled from a variety of sources accessed across the Grid and can be used to compare one patient's set of images against the norm, whether it be for brain scans or mammograms. A third demonstrator shows how the Grid can be exploited to construct collaborative virtual environments for the visualisation of large volumes of data. Yet another project shows how the Grid can be used to facilitate on-line clinical conferences between consultants, providing them with all the detailed images and case notes for their patients and for similar cases, thus avoiding unproductive time while travelling between clinics and speeding up patient diagnoses. Wider benefits include on-the-fly customisation of images that assist in a critical task, e.g. surveillance, video security and satellite imaging, and the design of novel algorithms to extract meaningful features from data. “The ability to visualise 3D medical images is key to the diagnosis of pathologies and pre-surgical planning”

Bioinformatics Capturing the complex and evolving patterns of genetic information, determining the development of an embryo Understanding the genetic interactions that underlie the processes of life-form development, disease and evolution. “Every time a new genome is sequenced the result is compared in a variety of ways with other genomes. Each code is made of 3.5 billion pairs of chemicals…” Many technological advances centre on the availability of new and 'better' materials, with specific combinations of materials properties suited to the application, or on capturing the complex and evolving patterns of genetic information determining the development of an embryo. Archival, retrieval and visualisation of vast amounts of data, often stored in differing formats in distributed databases, is key to identifying these new materials and understanding the genetic interactions that underlie the processes of life-form development, disease and evolution. The Grid provides solutions that greatly accelerate these processes, as these demonstrators show. Wider benefits lie in pharmaceuticals, agrochemicals, food production, electronics manufacture and nano-technology from the faster, cheaper discovery of new catalysts, metals, polymers, organic and inorganic materials, and from the identification of new regions where species may thrive or proliferate, together with the definition of supportive or disruptive external influences.

Nanotechnology New and 'better' materials Benefits in pharmaceuticals, agrochemicals, food production, electronics manufacture from the faster, cheaper discovery of new catalysts, metals, polymers, organic and inorganic materials “Grids have the potential to store and analyze data on a scale that will support faster, cheaper synthesis of a whole range of new materials.” Many technological advances centre on the availability of new and 'better' materials, with specific combinations of materials properties suited to the application, or on capturing the complex and evolving patterns of genetic information determining the development of an embryo. Archival, retrieval and visualisation of vast amounts of data, often stored in differing formats in distributed databases, is key to identifying these new materials and understanding the genetic interactions that underlie the processes of life-form development, disease and evolution. The Grid provides solutions that greatly accelerate these processes, as these demonstrators show. Wider benefits lie in pharmaceuticals, agrochemicals, food production, electronics manufacture and nano-technology from the faster, cheaper discovery of new catalysts, metals, polymers, organic and inorganic materials, and from the identification of new regions where species may thrive or proliferate, together with the definition of supportive or disruptive external influences.

Environment Modeling and prediction of earthquakes Climate change studies and weather forecast Pollution control Socio-economic growth planning, financial modeling and performance optimization Measuring, modelling and predicting the effects of human-induced effects upon complex eco-systems such as the climate or upon the local environment create challenges that have not heretofore been easily solved. The Grid, and its emerging technologies, is making possible much more accurate measurements, models and predictions by permitting dynamically scheduled, collaborative computations by multi-disciplinary teams in geographically dispersed locations. Complex problems in biodiversity are being solved using a comprehensive global catalogue of life that enables questions to be answered such as 'where might an existing species be successfully introduced' and 'what is the impact of biological change on complex ecosystems'. For pollution control and climate prediction, this involves complex computations, perhaps with on-line variation of critical parameters by dispersed scientists, and even the use of thousands of independent computers, each contributing to the overall prediction using Monte Carlo simulation techniques. These challenges have been effectively intractable until now, even with today's best supercomputers. Wider benefits include other computationally intensive simulations, for instance in socio-economic growth planning, financial modelling and performance optimisation, scientific and design simulations (e.g. in the aeronautical, automotive and marine industries), medical simulation (e.g. understanding the blood flow through reconstruction of MRA data) and environmental simulation (e.g. air quality monitoring and visualisation of meteorological and climate data). “Federations of heterogeneous databases can be exploited through grid technology to solve complex questions about global issues such as biodiversity.”

Get involved! Volunteer computing grids, like SETI@home, rely on volunteers like you to donate spare computing cycles when their computer is idle. SETI@home spawned an entire industry: Xpulsar@home, Genome@home, Folding@home, evolutionary@home, FightAIDS@home, SARS@home... ...LHC@home and more!

Want more? www.eu-egee.org www.cern.ch/lcg lhcathome.cern.ch www.eu-egi.org www.gridcafe.org www.gridrepublic.org 33