VO Sandpit, November 2009 NERC Big Data And what’s in it for NCEO? June 2014 Victoria Bennett CEDA (Centre for Environmental Data Archival)

Slides:



Advertisements
Similar presentations
NATURAL ENVIRONMENT RESEARCH COUNCIL NERC Data Policy Why NERC considers data management an important activity. Mark Thorley.
Advertisements

VO Sandpit, November 2009 CEDA Storage Dr Matt Pritchard Centre for Environmental Data Archival (CEDA)
STFC and the UK e-Infrastructure Initiative The Hartree Centre Prof. John Bancroft Project Director, the Hartree Centre Member, e-Infrastructure Leadership.
Research Computing and Facilitating Services CLMS Symposium 28 th June 2012 Clare Gryce Head of Research Computing & Facilitating Services.
The role of the ISIC facility for Climate and Environmental Monitoring from Space (CEMS) in the development of Quality Assured Datasets and Downstream.
Driving Innovation Robert Lowson National Contact Point FP7 Space Cork 17 September 2012 The United Kingdom; a major force in space 1.
QCloud Queensland Cloud Data Storage and Services 27Mar2012 QCloud1.
Leicester Database & Archive Service J. D. Law-Green, J. P. Osborne, R. S. Warwick X-Ray & Observational Astronomy Group, University of Leicester What.
Modelling and Data Centre Requirements: CEDA ESGF UV-CDAT Conference December 2014 Philip Kershaw, Centre for Environmental Data Archival, RAL Space,
Space research Horizon Work Programme. Horizon 2020 Space work programme Work programme covers Discussion in space shadow programme committee.
VO Sandpit, November 2009 CEDA Mission: “curation and facilitation” “Managing complex datasets and accompanying information for reuse and repurpose” Sam.
NERC Environmental Big Data Capital Bid Neil Lonie & Andrew Brooks.
GridPP Tuesday, 23 September 2003 Tim Phillips. 2 Bristol e-Science Vision National scene Bristol e-Science Centre Issues & Challenges.
UK Space Agency Saleh Ahmed Assistant Director, UK Space Agency
CEMS: The Facility for Climate and Environmental Monitoring from Space Victoria Bennett, ISIC/CEDA/NCEO RAL Space.
1 European policies for e- Infrastructures Belarus-Poland NREN cross-border link inauguration event Minsk, 9 November 2010 Jean-Luc Dorel European Commission.
What is EGI? The European Grid Infrastructure enables access to computing resources for European scientists from all fields of science, from Physics to.
Climate Sciences: Use Case and Vision Summary Philip Kershaw CEDA, RAL Space, STFC.
The UK Space Agency Overview for CEOS September 2012.
VO Sandpit, November 2009 e-Infrastructure to enable EO and Climate Science Dr Victoria Bennett Centre for Environmental Data Archival (CEDA)
‘intelligent openness’ The common objective of an RCUK data policy Gregor McDonagh
JASMIN and CEMS: The Need for Secure Data Access in a Virtual Environment Cloud Workshop 23 July 2013 Philip Kershaw Centre for Environmental Data Archival.
14 Aug 08DOE Review John Huth ATLAS Computing at Harvard John Huth.
VO Sandpit, November 2009 CEDA Metadata Steve Donegan/Sam Pepler.
Satellites, Ground Segment, and Data Access Evolution at DLR K
Virtualisation & Cloud Computing at RAL Ian Collier- RAL Tier 1 HEPiX Prague 25 April 2012.
VO Sandpit, November 2009 e-Infrastructure for Climate and Atmospheric Science Research Dr Matt Pritchard Centre for Environmental Data Archival (CEDA)
Slide David Britton, University of Glasgow IET, Oct 09 1 Prof. David Britton GridPP Project leader University of Glasgow UK-T0 Meeting 21 st Oct 2015 GridPP.
Required Data Centre and Interoperable Services: CEDA
Welcome to the PRECIS training workshop
DiRAC-3 – The future Jeremy Yates, STFC DiRAC HPC Facility.
CEOS WGISS-40 COSMO-SkyMed Ground Segment & other Ground Segment Developments Richard Lowe – Group Manger, EO Systems & Operations.
Data Preservation at Rutherford Lab David Corney 9 th July 2010 KEK.
Support to scientific research on seasonal-to-decadal climate and air quality modelling Pierre-Antoine Bretonnière Francesco Benincasa IC3-BSC - Spain.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI strategy and Grand Vision Ludek Matyska EGI Council Chair EGI InSPIRE.
Sensors and Instrumentation Computational and Data Challenges in Environmental Modelling Dr Peter M Allan Director, Hartree Centre, STFC.
Supporting the “Solving Business Problems with Environmental Data” Competition 24 th October 2013 Vlad Stoiljkovic.
Using a Simple Knowledge Organization System to facilitate Catalogue and Search for the ESA CCI Open Data Portal EGU, 21 April 2016 Antony Wilson, Victoria.
British Antarctic Survey Polar Science For Planet Earth (PSPE) Images can be downloaded here from the BAS image collection here:
IPCEI on High performance computing and big data enabled application: a pilot for the European Data Infrastructure Antonio Zoccoli INFN & University of.
Building European Scientific Cloud Computing Infrastructure An overview by Marc-Elian Bégin, SixSq 1.
IBERGRID as RC Total Capacity: > 10k-20K cores, > 3 Petabytes Evolving to cloud (conditioned by WLCG in some cases) Capacity may substantially increase.
ESA UNCLASSIFIED – For Official Use Scientific exploitation…. Ws input to the round table DD/MM/YYYY.
Earth System Modelling: an HPC perspective Mike Ashworth & Rupert Ford Scientific Computing Department and STFC Hartree Centre STFC Daresbury Laboratory.
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No Support to scientific.
EGI-InSPIRE EGI-InSPIRE RI The European Grid Infrastructure Steven Newhouse Director, EGI.eu Project Director, EGI-InSPIRE 29/06/2016CoreGrid.
EGI-InSPIRE RI EGI Compute and Data Services for Open Access in H2020 Tiziana Ferrari Technical Director, EGI.eu
17 July 2007IPDA Steering Committee UK Data Activities Peter Allan, STFC/RAL Mark Leese, Open University.
Sentinel Data Access for Africa 11 December 2014 Sentinels Data Access for Africa.
Copernicus Data & Information Access Service → DIAS 28 September 2016.
2017/18 SANSA Annual Performance Plan
Workshop on the Future of Big Data Management June 2013 Philip Kershaw
Tools and Services Workshop
Joslynn Lee – Data Science Educator
University of Reading 29 March 2011
Opportunities for Collaboration
Copernicus Data & Information Access Services DIAS.
Dagmar Adamova, NPI AS CR Prague/Rez
Presentation on Copernicus Dissemination
UK Status and Plans Scientific Computing Forum 27th Oct 2017
Introduction to XSEDE Resources HPC Workshop 08/21/2017
Copernicus Programme European Commission CEOS Plenary 2017
JASMIN Success Stories
EGI – Organisation overview and outreach
EISCAT-3D: a data centric design for extreme scale computing
Cyberinfrastructure for the Life Sciences
Copernicus DIAS Objectives
Technologies, Tools, Methods for capacity building: UK
Head of Reserach & Enterprise Partnerships, University of Leicester
Presentation transcript:

VO Sandpit, November 2009 NERC Big Data And what’s in it for NCEO? June 2014 Victoria Bennett CEDA (Centre for Environmental Data Archival)

VO Sandpit, November 2009 Outline CEDA and EO Data evolution NERC Big Data NERC’s Big Data Facilities JASMIN and CEMS

UK Earth Observation scientists use super-data- cluster for Big Data processing and analysis CCI SST (Reading) GlobAlbedo (UCL) LST (Leicester) CCI Cloud (RAL)

VO Sandpit, November 2009 CEDA Evolution

VO Sandpit, November 2009 EO Data Volumes: CEDA

VO Sandpit, November 2009 Big EO Data Datasets are getting bigger AATSR Level 1 + Level 2 Day: 15 GB Year : ~5.5 TB Sentinel-3A core products (land + marine) L1+L2 Day: 2170 GB Sentinel 3: ~790 TB …. 172,000 DVDs... And there’s Sentinel 1-A/B, 2-A/B,.. etc

NERC Big Data NERC Environmental Big Data; BIS allocated £13m capital funding to support ‘Big Data’ Between NERC is investing in: Compute and storage capacities of JASMIN Development of the academic component of CEMS Cloud-based software infrastructure to support environmental science (NERC Environmental Workbench) Environmental Big Data capital assets across the research community: New digital assets, equipment for new data, processing and storage hardware, software to share, explore and visualise data

Access: NERC Data Centres Further Information & data discovery service: British Oceanographic Data Centre NERC Earth Observation Data Centre National Geoscience Data Centre Polar Data Centre Environmental Information Data Centre Solar System Data Centre

JASMIN & CEMS: Big Data Facilities JASMIN (super data cluster) - storage & services (CEDA) - scientific computation - access to high volume & complex data CEMS facility – Climate and Environmental Monitoring from Space

VO Sandpit, November 2009 JASMIN-1 JASMIN is configured as a storage and analysis environment Two types of compute: a virtual/cloud environment, configured for flexibility a batch compute environment, configured for performance Both sets of compute connected to 5 PB of parallel fast disk GWS Lotus Bespoke VMs

VO Sandpit, November 2009 JASMIN-2

VO Sandpit, November 2009 JASMIN-2 NCEO’s Academic CEMS is a Virtual Organisation on JASMIN Data Services Link to Sat Apps Catapult Supporting NERC-wide science NERC community and Met Office Virtual Organisations “Managed” and “Un-managed” Cloud

VO Sandpit, November 2009 Using CEMS on JASMIN-2 JASMIN is carved up into consortia for different areas of NERC science Consortium managers are responsible for approving resource requests Similar process for NERC HPC allocation “EO and Climate Services” is one of 8 consortia GWS Lotus Bespoke VMs Use of JASMIN Unmanaged Cloud

VO Sandpit, November 2009 Who is using CEMS on JASMIN? CEDA/NCEO Data Centre Long term curation and dissemination of NCEO datasets Third party datasets needed by science community Please complete our survey! NCEO projects ESA and EC projects in NCEO community Academic CEMS Usage (June 2014) GWS22 ; 1500 TB VMs48 Login users71 Data download users360 ; 130 TB (1 yr) Talks/posters at this conference: 7 Processing, storage, analysis and dissemination of EO Big Data: typically global long term environmental data from satellites

February 2014: 1,000,000 th job run on Lotus Said Kharbouche, UCL, GlobAlbedo project

VO Sandpit, November 2009 EO Data Volumes: CEDA and CEMS

VO Sandpit, November 2009 Why use CEMS on JASMIN? Storage Processing Fast I/O* Data: CMIP5 archive (>1 PB), CEDA archives (> 1PB) – BADC, NEODC all on the same hardware : SCIENCE Satellite Applications Catapult Link: innovative applications, commercial services, exploitation of research data products, collaboration opportunities : IMPACT * #1 in the world for I/O performance?

Sat Apps Data Discovery Hub

VO Sandpit, November 2009 BIS Consultation On proposals for long- term capital investment in science and research Closes 4 th July

VO Sandpit, November 2009 Thanks for your attention