Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 1 Untertitel durch Klicken bearbeiten Volker Guelzow DESY & HTW-Berlin Hamburg, Sept. 24th, 2015 Big.

Slides:



Advertisements
Similar presentations
Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
Advertisements

SCD in Horizon 2020 Ian Collier RAL Tier 1 GridPP 33, Ambleside, August 22 nd 2014.
Pre-Commercial Procurement proposal - HNSciCloud
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 15 th April 2009 Visit of Spanish Royal Academy.
Bob Jones, CERN, IT department 4 October Why Procurement The activities within the Helix Nebula initiative have shown that public research organisations.
Ian Bird WLCG Workshop Okinawa, 12 th April 2015.
GridPP Steve Lloyd, Chair of the GridPP Collaboration Board.
Ian Bird LHCC Referees’ meeting; CERN, 11 th June 2013 March 6, 2013
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
Procurement Innovation for Cloud Services in Europe CERN – 14 May 2014 Bob Jones (CERN) This document produced by Members of the Helix Nebula consortium.
1 European policies for e- Infrastructures Belarus-Poland NREN cross-border link inauguration event Minsk, 9 November 2010 Jean-Luc Dorel European Commission.
Computing for ILC experiment Computing Research Center, KEK Hiroyuki Matsunaga.
Advanced Computing Services for Research Organisations Bob Jones Head of openlab IT dept CERN This document produced by Members of the Helix Nebula consortium.
Storage and data services eIRG Workshop Amsterdam Dr. ir. A. Osseyran Managing director SARA
Helix Nebula The Science Cloud CERN – 14 May 2014 Bob Jones (CERN) This document produced by Members of the Helix Nebula consortium is licensed under a.
Slide 1 John Dyer TERENA ASPIRE Project Manager TF-MSP 28 September 2012 ASPIRE Foresight Study
Notur: - Grant f.o.m is 16.5 Mkr (was 21.7 Mkr) - No guarantees that funding will increase in Same level of operations maintained.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Vision for European DCIs Steven Newhouse Project Director, EGI-InSPIRE 15/09/2010.
Evolution, by tackling new challenges| CHEP 2015, Japan | Patrick Fuhrmann | 16 April 2015 | 1 Patrick Fuhrmann On behave of the project team Evolution,
A public-private partnership building a multidisciplinary cloud platform for data intensive science Bob Jones Head of openlab IT dept CERN This document.
Cloud Services for Research CERN – 26 June 2014 Bob Jones (CERN) This document produced by Members of the Helix Nebula consortium is licensed under a Creative.
This document produced by Members of the Helix Nebula Partners and Consortium is licensed under a Creative Commons Attribution 3.0 Unported License. Permissions.
Bob Jones Technical Director CERN - August 2003 EGEE is proposed as a project to be funded by the European Union under contract IST
RI EGI-InSPIRE RI EGI Future activities Peter Solagna – EGI.eu.
ESFRI & e-Infrastructure Collaborations, EGEE’09 Krzysztof Wrona September 21 st, 2009 European XFEL.
Ian Bird, WLCG MB; 27 th October 2015 October 27, 2015
Ian Bird CMS Computing & Software CERN, 15 th October Oct 2015 Ian Bird; CMS Offline & Computing1.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
Helix Nebula The Science Cloud CERN – 13 June 2014 Alberto Di MEGLIO on behalf of Bob Jones (CERN) This document produced by Members of the Helix Nebula.
Slide David Britton, University of Glasgow IET, Oct 09 1 Prof. David Britton GridPP Project leader University of Glasgow UK-T0 Meeting 21 st Oct 2015 GridPP.
LHC Computing, CERN, & Federated Identities
1 The European Open Science Cloud: Open Day Event EMBL, Heidelberg, 20 January 2016 Joint Research Centre (JRC) The European Commission’s in-house science.
A European Open Science Cloud
The Helix Nebula Initiative EMBL – 20 January 2016 Maryline Lengert (ESA) This document produced by Members of the Helix Nebula consortium is licensed.
INFSO-RI Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum.
Overview on European e-Infrastructure Augusto Burgueño DG CONNECT Porto, 18 June 2015 – GÉANT General Assembly.
Possibilities for joint procurement of commercial cloud services for WLCG WLCG Overview Board Bob Jones (CERN) 28 November 2014.
WP6 – Inter-operability with e- infrastructures Sergio Andreozzi Strategy and Policy Manager, EGI.eu This document produced by Members of the Helix Nebula.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
Ian Bird WLCG Management Board; CERN, 20 th May 2014.
Interoperability and Integration of EGI with Helix Nebula - Workshop Sergio Andreozzi Strategy and Policy Manager (EGI.eu) 11/04/2013 EGI Community.
3rd Helix Nebula Workshop on Interoperability among e-Infrastructures and Commercial Clouds Carmela ASERO, EGI.eu 17 September 2013, Madrid
3 nd Helix Nebula Workshop on Interoperability among e-Infrastructures and Commercial Clouds Sergio Andreozzi Strategy and Policy Manager, EGI.eu EGI Technical.
Economical opportunities stemming from data and computing e- infrastructures Stakeholders consultation on computing and data for the WP Brussels,
Helix Nebula Workshop On Interoperability among Public And Community Clouds Session 2: Networking Connectivity Convener: Carmela ASERO, EGI.eu19 September.
Ian Bird, CERN 1 st February Dec 2015
IPCEI on High performance computing and big data enabled application: a pilot for the European Data Infrastructure Antonio Zoccoli INFN & University of.
Summary and next steps for the future Bob Jones, CERN Second Helix Nebula Review 26 June 2014 This document produced by Members of the Helix Nebula consortium.
WP9– Evaluation, roadmap & development plan Rupert Lueck EMBL – 26 June
EGI-InSPIRE EGI-InSPIRE RI EGI Federated Cloud business models and role in HNX Sergio Andreozzi Strategy and Policy Manager.
European Perspective on Distributed Computing Luis C. Busquets Pérez European Commission - DG CONNECT eInfrastructures 17 September 2013.
The Helix Nebula marketplace 13 May 2015 Bob Jones, CERN.
EGI-Engage EGI Webinar - Introduction - Gergely Sipos EGI.eu / MTA SZTAKI 6/26/
Eu-T0 Laura Perini 28 febbraio 20141Laura ws ccr.
EGI-InSPIRE EGI-InSPIRE RI The European Grid Infrastructure Steven Newhouse Director, EGI.eu Project Director, EGI-InSPIRE 29/06/2016CoreGrid.
WP6 – Inter-operability with e-Infrastructures Sergio Andreozzi - WP6 Task Leader Strategy and Policy Manager, EGI.eu Helix Nebula - 1st Year Review 1.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
RI EGI-InSPIRE RI Pre-OMB meeting Preparation for the Workshop “EGI towards H2020” NGI_UK John Gordon and.
EGI-InSPIRE EGI-InSPIRE RI EGI strategy towards the Open Science Commons Tiziana Ferrari EGI-InSPIRE Director at EGI.eu.
Ian Bird LHCC Referees; CERN, 2 nd June 2015 June 2,
Work Plan for the Second Period Bob Jones, CERN First Helix Nebula Review 03 July This document produced by Members of the Helix Nebula consortium.
EGI-InSPIRE RI An Introduction to European Grid Infrastructure (EGI) March An Introduction to the European Grid Infrastructure.
Cremlin Kick Off | DESY-IT | V. Gülzow | Seite 1 Untertitel durch Klicken bearbeiten e-Infrastructures & Big Data Handling Volker Guelzow DESY Moscow,
H2020, COEs and PRACE.
INFN Computing Outlook The Bologna Initiative
LifeWatch, costing and funding
EGI-Engage Engaging the EGI Community towards an Open Science Commons
Antonella Fresa Technical Coordinator
EGI Webinar - Introduction -
14th International IEEE eScience Conference
Presentation transcript:

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 1 Untertitel durch Klicken bearbeiten Volker Guelzow DESY & HTW-Berlin Hamburg, Sept. 24th, 2015 Big Data Management - Motor der Wissenschaft -

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 2 Introduction What is „Big Data“, what DESY? > Volume -> PB/year > Velocity -> data ingest, analysis less time critical (compared f.i. to stock exchange) > Variety -> well structured data (compared to social media data) > Veracity -> high (compared to social media data) > Value -> high because the science is in the data

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 3 > Data Sources for DESY

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 4 Particle Physics needs

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 5 Higgs Discovery

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 6 Higgs Discovery # of Analysis Jobs

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 7 Completed WLCG Jobs per Tier-2 Site DESY Atlas CMS

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 8 Higgs Discovery Data Volume

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 9 Data Requirements from Photon DESY

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 10 PETRA III 1010 Köcherfliege (Limnephilus flacivornis) Kopf + Thorax Courtesy: Dr. F. Beckmann

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 11 Investigation of a van Gogh painting

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 12 Other examples  3D-time dependent  Pattern recognition  Fast image analyis (f.i. through neural networks)  Virtual reality (Cave )

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 13 New Challenges TypeFrame size Frame rate Peak rate Avail. Pilatus 6M 2463 x 2527 x 425 Hz4.6Gb/sNow AGIPD ( Module) 128 x 512 x 2 x 352 x 14bit 4.5 MHz (10 Hz) 6.1 Gb/s2015 Eiger1k x 1k x 22 kHz30 Gb/snow Lambda3 x 1536 x 512 x 22 kHz60 Gb/snow Percival (1S) 4k x 4k x 2120 Hz60 Gb/s2015 Percival (4S) 8k x 8k x 2120 Hz240 Gb/s Late 2015

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 14 Methods are different for light sources

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 15 The DESY ICT Eco-System Detect or Centr al Stora ge Dat a Analys is Loc al Research er Remot e Research er Loc al Cac he Onl ine Archi ve, Outs ide Simulati ons Grid NAF HPC Farm Cloud Visual izatio n Data&Metadata Management Software Data Policies Technical Infrastructure User Management/AAI networks

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 16 Hard- & Software technology evolution

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 17 Untertitel durch Klicken bearbeiten Electronic systems market value in 2014 was ~1.5 Trillion $ 10 biggest segments Moderate growth rates Maturing markets HEP is here ~15M$ out of 52B$ From Bernd Panzer, Cern CAGR = Compound Annual Growth Rate End-Use Markets

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 18 Processors INTEL, Qualcomm, Samsung, AMD, IBM Graphics INTEL, Nvidia, AMD Hard Disk Drives Western Digital, Seagate, Toshiba DRAM memory Samsung, SK Hynix, Micron NAND Flash memory Samsung, Toshiba, SanDisk, Micron, Hynix, INTEL Solid State Disks Samsung, INTEL, SanDisk, Toshiba, Micron FPGA Xilinx, Altera (currently being bought by INTEL) Tape Storage HP, Fuji, IBM, SpectraLogic, ORACLE Only a few large companies are dominating the various components markets Market Dominance Few companies capable of large scale investments, majority fabless companies Favour evolutionary (adiabatic) changes of technology Clear bias against ‘disruptive’ new technologies (memristor, holographic storage, DNA storage,quantum computing, non-volatile memory, etc.)

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 19 Hardware technology perspective > 20% increase of compute power/year per $ > 15% increase of disk capacity/year per $ > Tape will still improve very much but the role will change > Only a few vendors, this is risky > Evolution, no disruptive changes > Application development for multicore/GPU‘s needed > A rapid network development to Tbit/s

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 20 dCache – OwnCloud Data Management Spinning Disks Tape, Blue-ray … Unlimited hierarchical Storage Space NFS 4.1 CDMI WEB 2.0 dCache SSD’S

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 21 dCache Big Data Cloud LOFAR antenna Huge amounts of data X-FEL (Free Electron Lasers) Fast Ingest

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 22 Towards the DESY Strategy 2 Examples: > Speed > HNSciCloud

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 23 The Speed Project for Beamlines: Cooperation DESY/IBM

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 24 The Architecture

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 25 April 28, 2015 The HNSciCloud Project European Open Science Cloud Pilot Project > Bring together the stakeholders  Research Infrastructures (ESFRI, etc.)  Research Organisations (WLCG tier-1 etc.)  European e-Infrastructures (GEANT, EGI, PRACE, EUDAT, OpenAIRE)  Commercial cloud service providers (Helix Nebula, etc.)  End-users including the long-tail of science > Deliver the pilot  Technical architecture for the hybrid cloud  Security model compatible with EU data protection legislation  Assemble and deploy a 5% scale prototype  Verify the business model to ensure it can be sustained beyond the pilot  Governance structure avoiding monopoly of any research group or service provider > Roadmap for full-scale implementation > Today still more cost effective to operate our own facilities, but this situation is expected to change -> spot market > Hybrid model gives us flexibility  Does not save staff effort as we still need to operate services there, as well as maintaining in-house services

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 26 April 28, 2015 Project Consortium > Includes buyers and experts in the preparation, execution and promotion of the procurement > Idea to use EC procurement calls to co-fund exploratory joint-procurement of cloud services > Proposal submitted on April 14 > Duration:30 month EGI.eu – Integration with e-infrastructures TRUST-IT - Comms/Dissem. CERN, DESY, EMBL-EBI, KIT, IN2P3, INFN, PIC, SARA/Nikhef, STFC BuyersExpertsConsortium Sub-contractor experts: Strategic Blue - Cloud Financial Broker Pinsent Mason – Cloud legal advisor Trento Network - EC PCP Legal Advisor The buyers are public organisations that commit to contribute to a joint procurement

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 27 … and a lot more to do > AAI: access to federated resources (adapting existing solutions from other projects like EduGain based solutions) > Security, policies, life cycle management (like „how long? Ownership? Who has access? What kind of data?..) > Portals for scientific and industrial users (like access to resources, virtual accounting, industrial usage,..) > Open Access

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 28 The DESY Big Data strategy in a nutshell (1) > Develop high sophisticated „Big Data“ management solutions fully alined to the research topics at DESY > … fitting on site experiments and off site experiments > Close cooperation with the scientists, directly in scientific projects > Find solutions in cooperation with other Lab‘s > Apply for third party funding > Cooperation with industry > Assure 24x7 operation > Serve Eu-Xfel (and others) on a full cost model

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 29 The DESY Big Data strategy in a nutshell (2) > Share resources as much as possible between communities > Prepare for a hybrid model „data on site – compute partially on the spot market“ > Development of data management software (-> dCache) > Development of data portals -> Gamma Portal > Extend DESY Data Cloud > Provide excellent analysis facilities, local and Cloud based > Define Data Policies with experiments > Offer Long Term Data Preservation > Continously upgrade networking

Big Data Minds 2015 | DESY-IT | V. Gülzow | Seite 30 Final No Computing - no Science