Dr Tim Smith CERN/IT For the visit of the Alliance of German Science Organizations.

Slides:



Advertisements
Similar presentations
Partnering with Faculty / researchers to Enhance Scholarly Communication Caroline Mutwiri.
Advertisements

Wei Lu 1, Kate Keahey 2, Tim Freeman 2, Frank Siebenlist 2 1 Indiana University, 2 Argonne National Lab
Opening the Research Data Lifecycle Workshop Capturing and Sharing Research Data Simon Coles School of Chemistry, University of Southampton, U.K.
Pulling it all together… with thanks to Sheila Anderson.
Replicating Data from the Large Electron Positron (LEP) collider at CERN (Aleph Experiment) Under the DPHEP umbrella Marcello Maggi/INFN –Bari Tommaso.
The Library behind the scene How does it work ? The Library behind the scenes 1 JINR / CERN Grid and advanced information systems 2012 Anne Gentil-Beccot.
EU-GRID Work Program Massimo Sgaravatto – INFN Padova Cristina Vistoli – INFN Cnaf as INFN members of the EU-GRID technical team.
23/04/2008VLVnT08, Toulon, FR, April 2008, M. Stavrianakou, NESTOR-NOA 1 First thoughts for KM3Net on-shore data storage and distribution Facilities VLV.
©STFC/Keith G Jeffery Metadata in the European e-Infrastructure Metadata in the European e-Infrastructure Keith G Jeffery Science and Technology.
1 Bridging Clouds with CernVM: ATLAS/PanDA example Wenjing Wu
Digital preservation Hydra Europe, LSE 24 April 2015 Anders Conrad.
XXII International Symposium on Nuclear Electronics & Computing NEC’09 TOWARDS OPEN ACCESS PUBLISHING AT JINR I.A. Filozova, V.V. Korenkov, G. Musulmanbekov.
CERN – IT Department CH-1211 Genève 23 Switzerland t CERN Open Source Collaborative tools: Digital Library Software Tim Smith CERN/IT.
The DSpace Course Module – An introduction to DSpace.
OpenAIRE e-Infrastructure & Support for Open Access in FP7 and Horizon 2020 MedOANet Conference Athens, 17 October 2013 Birgit Schmidt University of Goettingen,
Central Reconstruction System on the RHIC Linux Farm in Brookhaven Laboratory HEPIX - BNL October 19, 2004 Tomasz Wlodek - BNL.
InDiCo 20 April 2004 EPFL, Lausanne Integrated Digital Conferencing JY Le Meur CERN
1 Kittikul Kovitanggoon*, Burin Asavapibhop, Narumon Suwonjandee, Gurpreet Singh Chulalongkorn University, Thailand July 23, 2015 Workshop on e-Science.
1 Data services and computing. 2 We tend to be dealt the computing environment in which we must operate. Few of us have enough influence to steer the.
European Organization for Nuclear Research Organisation Européenne pour la Recherche Nucléaire High-Energy Physics Data Delivering Data in Science ICSTI.
E.Soundararajan R.Baskaran & M.Sai Baba Indira Gandhi Centre for Atomic Research, Kalpakkam.
CERN openlab V Technical Strategy Fons Rademakers CERN openlab CTO.
LIBRARY SERVICES Strategies for gaining and maintaining academic support for the institutional open access.
CERN – IT Department CH-1211 Genève 23 Switzerland t Working with Large Data Sets Tim Smith CERN/IT Open Access and Research Data Session.
Enterprise Solutions Chapter 10 – Enterprise Content Management.
ATLAS WAN Requirements at BNL Slides Extracted From Presentation Given By Bruce G. Gibbard 13 December 2004.
23.March 2004Bernd Panzer-Steindel, CERN/IT1 LCG Workshop Computing Fabric.
CERN – IT Department CH-1211 Genève 23 Switzerland t Data Publishing Tim Smith CERN/IT.
The GridPP DIRAC project DIRAC for non-LHC communities.
Introduction to the VO ESAVO ESA/ESAC – Madrid, Spain.
LHC Computing, CERN, & Federated Identities
Institutional Repositories July 2007 Intellectual property management : the DISA experience Dr D Peters DISA: Digital Innovation South Africa.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Data Management Highlights in TSA3.3 Services for HEP Fernando Barreiro Megino,
CASTOR project status CASTOR project status CERNIT-PDP/DM October 1999.
ATLAS Distributed Computing perspectives for Run-2 Simone Campana CERN-IT/SDC on behalf of ADC.
CERN IT Department CH-1211 Genève 23 Switzerland t The Tape Service at CERN Vladimír Bahyl IT-FIO-TSI June 2009.
The GridPP DIRAC project DIRAC for non-LHC communities.
ATLAS Distributed Computing ATLAS session WLCG pre-CHEP Workshop New York May 19-20, 2012 Alexei Klimentov Stephane Jezequel Ikuo Ueda For ATLAS Distributed.
Open Science (publishing) as-a-Service Paolo Manghi (OpenAIRE infrastructure) Institute of Information Science and Technologies Italian Research Council.
Meeting with University of Malta| CERN, May 18, 2015 | Predrag Buncic ALICE Computing in Run 2+ P. Buncic 1.
VI/ CERN Dec 4 CMS Software Architecture vs Hybrid Store Vincenzo Innocente CMS Week CERN, Dec
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Aalto Data.
William J Nixon Setting up a Repository. Introduction Key Features to consider (and review) Wide Range of Technology Available –Best fit for purpose –Clear.
European Organization For Nuclear Research CERN Accelerator Logging Service Overview Focus on Data Extraction for Offline Analysis Ronny Billen & Chris.
DIRAC for Grid and Cloud Dr. Víctor Méndez Muñoz (for DIRAC Project) LHCb Tier 1 Liaison at PIC EGI User Community Board, October 31st, 2013.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Database 12.2 and Oracle Enterprise Manager 13c Liana LUPSA.
THE ATLAS COMPUTING MODEL Sahal Yacoob UKZN On behalf of the ATLAS collaboration.
Enhancements to Galaxy for delivering on NIH Commons
Accessing the VI-SEEM infrastructure
Open Exeter Project Team
WP18, High-speed data recording Krzysztof Wrona, European XFEL
OpenAIRE in 8 Minutes Tony Ross-Hellauer State and University Library,
Future Database Challenges
Introduction to Data Management in EGI
Tim Smith CERN Geneva, Switzerland
Data Fundamentals A. D. Smith – September 26, 2011.
Dagmar Adamova (NPI AS CR Prague/Rez) and Maarten Litmaath (CERN)
Opening Big Data; in small and large chunks
Ákos Frohner EGEE'08 September 2008
Vision for CERN IT Department
VI-SEEM Data Repository
Zenodo: A Research Data Repository for All
Ch 4. The Evolution of Analytic Scalability
A [very] short introduction to Data Management
Building an open library without walls : Archiving of particle physics data and results for long-term access and use Joanne Yeomans CERN Scientific Information.
Status of Grids for HEP and HENP
Dataverse for citing and sharing research data
EOSC-hub Contribution to the EOSC WGs
Presentation transcript:

Dr Tim Smith CERN/IT For the visit of the Alliance of German Science Organizations

[Oct 2013] - 2 As Designed: W. LHC Computing Grid Distributed Data Management – Limited Network resources – Optimize / minimize movement – File placement logic – Deterministic / Static Site Data Management – HSMs – Transparent file access and movement Disk-Tape migration/recall

[Oct 2013] - 3 Research Data Infrastructure of today Distributed Data Management – Network: a resource to schedule – Dynamic data placement – Data transfer services – Expt replica management rules Site Data Management – Indep. technology choices – Decoupled tiers – Disk caches Managed by owners – Bulk 3 rd party migration to tertiary by owners AAA: any data, any time, any where

[Oct 2013] - 4 CERN Infrastructure of tomorrow Connectivity (100 Gbps) 2015: 15k servers, 300k VMs

[Oct 2013] - 5 Big Data … in small pieces Long tail of science Big facilities Data Size x (a small number) x (a large number) Dedicated Big Data Stores

[Oct 2013] - 6

[Oct 2013] - 7 Naming Zenodotus of Ephesus – First librarian of the Ancient Library of Alexandria – First recorded use of metadata

[Oct 2013] - 8 Features

[Oct 2013] - 9 Communities

[Oct 2013] - 10 Deposit

[Oct 2013] - 11 HEP: Data Reduction / Analysis Publication Reduced Reconstructed Raw Researchers T2s, T1s Analysis Coordinators T1s Production Managers T0, T1s File Size # Files

[Oct 2013] - 12 HEP: More than Data Papers Tabular Data Correlation Matrices Internal Notes Wikis Presentations Quality monitoring data Filter / selection algorithms Formatters Calibration Data Conditions Data Log Books Researchers T2s, T1s Analysis Coordinators T1s Production Managers T0, T1s Workflows Contextual metadata SW: 10M LoC

[Oct 2013] - 13 Deposit

[Oct 2013] - 14 Differentiating Features Easy to use and attractive – DropBox integration – Drag-n-drop deposition Low barriers – Little fixed metadata Open on input as well as output – No restrictions on type of data – No restrictions on format of data – No restrictions on licences Distributed community curation

[Oct 2013] - 15 Retro/Per -spective OpenAIRE – FP7 Open Access pilot for peer reviewed articles OpenAIREplus – FP7 OA pilot for publications and research data CERN – Cloud Service

[Oct 2013] - 16 Interested Communities Workshops – Proceedings and presentations Projects – Research output and project artifacts Research Groups – Datasets – snapshots of a live store Universities – Datasets and articles Libraries – Newsletters – Data not fitting in traditional repositories Publishers – Publication/subsidiary datasets and software – Scanned and annotated logbooks Young Radiation Oncologists’ Conference

[Oct 2013] - 17 Perceived Attraction Trust / Security / Know-how – LHC data is thought safe there – Bit Preservation & Media Migration Longevity – An institute with a clear future – A memory institution for HEP Not a company Not a profit enterprise – No tricks and changes

[Oct 2013]