Modelling and Data Centre Requirements: CEDA ESGF UV-CDAT Conference 09-11 December 2014 Philip Kershaw, Centre for Environmental Data Archival, RAL Space,

Slides:



Advertisements
Similar presentations
Professor Dave Delpy Chief Executive of Engineering and Physical Sciences Research Council Research Councils UK Impact Champion Competition vs. Collaboration:
Advertisements

12 August 2004 Strategic Alignment By Maria Rojas.
1 ALL EPSRC VISITS EPSRC plans and priorities. 2 DIGITAL ECONOMY EPSRC lead AHRC ESRC MRC ENERGY EPSRC lead BBSRC ESRC NERC STFC NANOSCIENCE THROUGH ENGINEERING.
© 2005, CARE USA. All rights reserved. PARIS PROGRAM APPROCH At CARE Bangladesh.
The Role of Environmental Monitoring in the Green Economy Strategy K Nathan Hill March 2010.
Connecting the dots: A Family Care model that protects children.
Local Education and Training Boards Adam C Wardle Managing Director, Yorkshire and the Humber Local Education and Training Board.
The Research Workflow Revolution: The Impact of Web 2.0 And Emerging Networking Tools On Research Workflow Bill Russell Communications Director 4 th April.
Introduction to Research Data Management Services, January 2013 Research Data Management Infrastructure The Current Context.
Public engagement and lifelong learning: old wine in a new bottle, or a blended malt? Paul Manners Director, National Co-ordinating Centre for Public Engagement.
December 2008 MRC Data Support Services (DSS) Chris Morris 13 th February 2009 Sharing Research Data: Pioneers, Policies and Protocols The seventh cat.
Cloud Computing Special Interest Group Cloud Computing for the UK Research Community Workshop December 2013 Philip Kershaw, STFC Rutherford Appleton.
Data-intensive Research Policy In Ireland A brief overview By J.-C. Desplat.
VO Sandpit, November 2009 NERC Big Data And what’s in it for NCEO? June 2014 Victoria Bennett CEDA (Centre for Environmental Data Archival)
Improvement Service / Scottish Centre for Regeneration Project: Embedding an Outcomes Approach in Community Regeneration & Tackling Poverty Effectively.
Facilitating Multi Stakeholder Processes and Social Learning Herman Brouwer/ Karèn Verhoosel Centre for Development Innovation MSP Generic Process.
THE JOINED UP WORLD OF E-RESEARCH Professor Neil McLean National Technical Standards Adviser to the Department of Education Science and Training (DEST)
Questions from a patient or carer perspective
PopMedNet Software Development Life Cycle Chayim Herzig-Marx Harvard Pilgrim Health Care Institute Daniel Dee Lincoln Peak Partners.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Thomas Hacker Barb Fossum Matthew Lawrence Open Science Grid May 19, 2011.
Strategic Information Systems Planning
Don Von Dollen Senior Program Manager, Data Integration & Communications Grid Interop December 4, 2012 A Utility Standards and Technology Adoption Framework.
Sustainable Development and HEFCW Higher Education Academy Conference Edinburgh, 24 January 2006 Alyson Thomas, Senior Economic Development Manager, HEFCW.
SoundSoftware.ac.uk: Software sustainability for the audio and music researcher Chris Cannam, Mark Plumbley, Luís Figueira Centre for Digital Music Queen.
Strategic Planning & the Duty to Co-operate Andrew Pritchard Director of Policy & Infrastructure.
EGI-Engage EGI-Engage Engaging the EGI Community towards an Open Science Commons Project Overview 9/14/2015 EGI-Engage: a project.
User requirements for and concerns about a European e-Infrastructure Steven Newhouse, Director.
Copyright © 2004 Sherif Kamel Information Systems Planning Sherif Kamel The American University in Cairo.
Climate Sciences: Use Case and Vision Summary Philip Kershaw CEDA, RAL Space, STFC.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
TEMPLATE DESIGN © An increasing world population, industrial development, globalization and changing weather and climate.
Governance Sub-Committee Report: A Proposal to Measure Progress Toward Realizing the NSDI Vision NGAC Governance Sub-Committee December 2, 2009.
DASISH Final Conference Common Solutions to Common Problems.
Organisational Journey Supporting self-management
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Collaborating with iPlant.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
Doug Tody E2E Perspective EVLA Advisory Committee Meeting December 14-15, 2004 EVLA Software E2E Perspective.
Dr. Fran Berman, RPI Feedback from BRDI Sponsor Forum 11/11 January 29, 2012 Fran Berman.
Data Publication and Quality Control Procedure for CMIP5 / IPCC-AR5 Data WDC Climate / DKRZ:
GBIF Mid Term Meetings 2011 Biodiversity Data Portals for GBIF Participants: The NPT Global Biodiversity Information Facility (GBIF) 3 rd May 2011.
EPA Geospatial Segment United States Environmental Protection Agency Office of Environmental Information Enterprise Architecture Program Segment Architecture.
UKOLN is supported by: Digital Preservation Benefits Tools Project Dissemination Workshop Dr Liz Lyon, Associate Director, UK Digital Curation Centre Director,
Aims to: ● Generate commercial advantage for the College ● Enhance economic and social impact through delivery of an integrated programme of knowledge.
UK Environmental Observation Framework.
E-Science Research Councils awarded e-Science funds ” science increasingly done through distributed global collaborations enabled by the Internet, using.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
Partnership Working Page 1 Celebrating Research and Partnership Working Conference Thursday 15 th October 2015 Dr Mashuq Ally – Assistant Director Equalities,
Kathy Corbiere Service Delivery and Performance Commission
TDRp Implementation Challenges David Vance, Executive Director Peggy Parskey, Assistant Director October 23, 2014.
What does the Devolution Agreement Say? £6bn of Health & Social care budgets transferred to the region. This covers the whole of the health and care system.
A centre of expertise in digital information management UKOLN is supported by: Functional Requirements Eprints Application Profile Working.
The Modeling Circle Courtesy M. Lautenschlager, DKRZ.
Co-funded by European Commission eContentplus Sharing Practices and Experiences on the Authoring and Adaptation of Open Educational Resources Alexander.
A way to develop software that emphasizes communication, collaboration, and integration between development and IT operations teams.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
HLC Criterion Five Primer Thursday, Nov. 5, :40 – 11:40 a.m. Event Center.
Using a Simple Knowledge Organization System to facilitate Catalogue and Search for the ESA CCI Open Data Portal EGU, 21 April 2016 Antony Wilson, Victoria.
1 Open Science Grid: Project Statement & Vision Transform compute and data intensive science through a cross- domain self-managed national distributed.
DATA DEVOLUTION IN BRISTOL Bristol City Council 21/09/2015.
Research Councils UK and the research funding landscape Name Job title Research Councils UK.
ROBERT LOWSON EEA COORDINATOR GMES BUREAU.
EGI-Engage Engaging the EGI Community towards an Open Science Commons
Project Overview and EOSC Governance
Enabling step change in your PPM Maturity
Scotland’s Digital Health and Care Strategy
Brian Matthews STFC EOSCpilot Brian Matthews STFC
RDA uptake activities and plans: ESGF
Expand portfolio of EGI services
Presentation transcript:

Modelling and Data Centre Requirements: CEDA ESGF UV-CDAT Conference December 2014 Philip Kershaw, Centre for Environmental Data Archival, RAL Space, STFC

Centre for Environmental Data Archival CEDA Archive snapshot – variety + complexity challenge 3.0 PB of allocated archive 2.3 PB used in 2,176 “filesets” totalling 152M files Our CMIP5 is 1.2 PB in 1,174 “filesets” totalling 3.2M files

CEDA’s Engagement with ESGF Overarching requirement comes through NERC (UK Natural Environment Research Council): – to maximise the UK's contributions to the CMIP cycle and – Exploit the data for the user communities Supplementary requirements related to CEDA's stakeholders and associated services. International collaboration has been a key to meeting these objectives: – engaging with shared software development effort was more likely to result in systems fit for purpose and – build a community upon which to create common tools and services. The current operation and support burden with ESGF together with other commitments is placing a big strain

Consistency, conformance to standards, performance of services within ESGF Issues around the ingest pipeline and consistency of metadata – “It takes two days to write a script to handle tens to hundreds of parallel wget threads, and six months to deal with all the failure modes associated with mis-configured information” – There are many opportunities in the process for de-synchronisation – Need a single source of authority for information Uptime and reliability of services – We’re interconnected and reliant on one another – But lack of reliability and responsiveness to issues of any one service affects people’s perception of the whole of the federation and of individual partners – There are key services which have a high profile and larger impact – It needs a practical re-assessment e.g. Should we be in the business of running IdPs?

Governance Need clarity about the scope of governance in each of the contexts: – Projects and data – The operational system – The software What drives requirements – The science – User communities – The data centres: the system is not sustainable if it cannot be integrated into the data management infrastructures of the institutions that are operating it.

Operations and Support Need to create a virtuous circle of experience from operations feeding back into software development drivers – Complexity increases exponentially with number of deployments. This is a Federation – Do something simple and do it well Establish processes and decision gates – Process for a new project joining the federation Should it join at all? – What does it gain for project and for the existing communities using ESGF? – Process for releases and patching – does the severity of a security alert warrant major disruption? – Process for publishing … other processes … Clearly delineate between project specific and federation-wide scope Resourcing - People and skills, funding Metrics for level of service – SLA, uptime – If a given provider can’t meet perhaps they shouldn’t be doing it or perhaps we’re doing the wrong things

Future Priorities for our Engagement CEDA needs to serve a number of projects and communities over above ESGF – We can’t continue to run parallel systems – Need to integrate component by component as required and support for interfaces Need to resolve governance and, Operations and support – How can these be resourced? – Simplifying what we run could be more effective Publishing is a high priority for CEDA to contribute to and improve – both from a point of view of software – best practice for consistency and good version control