Future Data Access and Analysis Architectures – Discussion

Slides:



Advertisements
Similar presentations
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Advertisements

1 Alternate Title Slide: Presentation Name Goes Here Presenter’s Name Infrastructure Solutions Division Date GIS Perfct Ltd. Autodesk Value Added Reseller.
Jisc Data Spring Pitch: Cloud Workbench Ben Butchart EDINA.
National Earth Science Infrastructure Program AuScope Limited Headquarters School of Earth Sciences University of Melbourne Victoria 3010 Tel
National Spatial Data Infrastructure The Spatial Information Services Stack Dr Robert Woodcock.
AIP-2 Kickoff Workshop End-to-end use case: Discovery, access, and use with variations Doug Nebert GEOSS AIP-2 Kickoff September 2008.
2010 WGISS Expected Outcomes Matha Maiden and Pakorn Apaphant Oct 2, 2009 WGISS 28 Pretoria, South Africa.
WGISS Richard Moreno CNES SIT Workshop Agenda Item #7 WGISS working group CEOS SIT Technical Workshop EUMETSAT, Darmstadt, Germany 17 th -18 th September.
CEOS Data Cube Open Source Software Status Brian Killough CEOS Systems Engineering Office (SEO) WGISS-40 Harwell, Oxfordshire, UK September 30, 2015 (remote.
SEE Grid Information Services Roadmap Dr Rob Woodcock CSIRO.
Page 1 CSISS Center for Spatial Information Science and Systems 09/12/2006 Center for Spatial Information Science and Systems (CSISS) George Mason University.
OGC’s role in GEO: Results from the Architectural Implementation Pilot (AIP) George Percivall Open Geospatial Consortium GEO Task IN-05 Coordinator
Future Data Architectures FDA-AHT SIT Tech Workshop 2016 Side Meeting CEOS Strategic Implementation Team Tech Workshop Oxford, UK 13 th September 2016.
Incoming Themes for 2017 Frank Kelly, USGS
Future Data Access and Analysis Architectures - Overview
NextGEOSS data hub incl. alpha release
CNES DATA CUBE ERWANN poupart
USGS EROS LCMAP System Status Briefing for CEOS
Committee on Earth Observation Satellites
By: Raza Usmani SaaS, PaaS & TaaS By: Raza Usmani
WGISS Working Group for Information Systems & Services
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Moderate Resolution Sensor Interoperability (MRI) Framework
Data Analytics and CERN IT Hadoop Service
CEOS Data Cube Report Agenda Item #7 September 13, 2017
SEO Capacity Building Agenda #15 Brian Killough
WGISS Working Group for Information Systems & Services
INTAROS WP5 Data integration and management
FDA in 2017 A CEOS Chair Theme
Bridges and Clouds Sergiu Sanielevici, PSC Director of User Support for Scientific Applications October 12, 2017 © 2017 Pittsburgh Supercomputing Center.
Presentation on Copernicus Dissemination
Space Data Services Session 2: Space Data Country Outreach and Delivery Agenda Item #3 Brian Killough CEOS Systems Engineering Office (SEO) February 25,
Graduation Project Kick-off presentation - SET
GEOSS EVOLVE Osamu Ochiai GEO Secretariat CEOS WGISS-43
Bringing processing Close to the data Richard MORENO CEOS - WGISS
FDA context Steven Hosford On behalf of FDA Co-chairs
Future Data Architectures Status Report
WGCV Work Plan Actions K. . Thome NASA WGCV Plenary # 43
CEOS Workplan FDA actions
Exploitation Platforms and Common Reference Architecture
Open Data Cubes Cloud Services Experiences and Lessons Learned
Topics NOAA support for: CWIC Infrastructure NOAA as a Data Provider
FDA Objectives and Implementation Planning
GEOSS Future Products Workshop March 26-28, 2013 NOAA
Keep Your Digital Media Assets Safe and Save Time by Choosing ImageVault to be Your Digital Asset Management Solution, Hosted in Microsoft Azure Partner.
Committee on Earth Observation Satellites
Purpose and Objectives of the FDA Big Data Workshop
WIS Strategy – WIS 2.0 Submitted by: Matteo Dell’Acqua(CBS) (Doc 5b)
Committee on Earth Observation Satellites
FDA Update & Implications
Review of Chair Priorities
Interoperability and standards for statistical data exchange
Common Solutions to Common Problems
SESSION 2: GLOBAL DATA FLOWS STUDY
FDA-08 FDA Whitepaper Update
FDA Topics Going Forward…???
GISELA & CHAIN Workshop Digital Cultural Heritage Network
Quoting and Billing: Commercialization of Big Data Analytics
MSDI training courses feedback MSDIWG10 March 2019 Busan
Analysis Ready Data Strategy for CEOS
Brian Killough (NASA, SEO), Mirko Albani (ESA, WGISS Chair)
Microsoft Virtual Academy
WGISS 47 FDA Elements Demonstration Workshop
WP6 – EOSC integration J-F. Perrin (ILL) 15th Jan 2019
Open Data Cube Demo and FDA Interfaces
VC/WG/AHT Working Day Outcomes
Roadmap and short term activities on interoperability of Data Cubes
NOAA OneStop and the Cloud
SEO Report to WGISS-48 Brian Killough
CEOS Analysis Ready Data Strategy
Presentation transcript:

Future Data Access and Analysis Architectures – Discussion Dr Robert Woodcock – WGISS 43 CSIRO

What do WE need to know? Examples of topics from OGC ORM: Enterprise view Why and the value proposition Geospatial Information Information specifications Spatial referencing Geographies features Geospatial Services Services architectures OGC Web Services Reusable Patterns for Deployment Publish, find, bind pattern Geospatial portal and clients Spatial Data Infrastructures Geoprocessing workflows Implementation of standards

Future Data Architectures Principles “Analysis for the masses” Interaction Discovery and Download (files) User provides compute In-place analysis of ready-to- use data User application, data and compute combine from different parties on-demand Integration Single sensor discovery Integration a user problem Comparable observations Global multi-sensor analysis routine Interoperability Discovery of files Emphasis on access Refined discovery of pixels Emphasis on usability for analysis Interfaces Independent application development over data Agency stores and distributes data APIs, Virtual Labs enabled by standards Use of third parties for storage, distribution and analysis This slide shows the architectural principles that change. Discussion will be mostly providing an brief explanation of each component – most likely only emphasising a few as there will likely not be time for all. In some respects the before and after picture is really just a repeat of slides 4 and 5 but it starts to tie into architectural practices – so it leads us from the survey and analysis into the next step – how to make it work in practice. Which the AHT has touched on and is recommending is explored more fully in 2017 with both pilots and more technical work…so leads into next slide.

What do WE need to know? Themes that need attention Doing and learning Pilots of different architectures/approaches – engaging users and other key stakeholders (e.g. partner governments, world bank, etc). Includes producing “ARD” – make it real to find the challenges. Technical frameworks Architectural recommendations and best practices. ARD framework. User Requirements framework. Community dimension Fostering engagement, e.g. open source community, algorithm sharing, etc. Measuring engagement Strategic dimension What do we need to do together to support uptake of satellite EO data from the “CEOS Constellation” e.g. in support of key global agendas? What should CEOS do to support agencies to do their own things, e.g. best practices etc? What should be ‘outside scope’ for CEOS? Connection to CARD4l

CEOS Awesomeness #1 Sensor Integratability Services Application packages for TEP, OGC Contect OGC TB13 Single sign-on authentication and identity federation Needs to be machine to machine too Data Cube tech (and analysis tech) Open Data Cube SciDB Earth Server (Rasdamann) Geohazard TEP Cassandra and related tech stack Software Libraries and Languages Python and R Apache spark Architecture – a lot of similarity. Reference architecture, shared services implementation?

CEOS Awesomeness #2 Lessons and experience to be shared: CNES Orfeo toolbox Jupyter notebooks (Python and R) Machine learning libraries (MLLib) TEP Lessons Savings from Cloud based test environments (and SPOT) Design patterns – there are a lot of similar patterns Web Services pyWPS, WPS, Geoserver, W*S, Domain Services too… Tech futures Pyramidal Data Cube for multi-resolution (and OGC DGGS) DC Consumer and Server split and deployment Containerisation (Docker) S3- native architectures

CEOS Awesomeness #3 Global Data Cube network Interoperability experiment? Different Data Cube technology with same workflow? Analytics Tools/Services BFAST, WTSS, TWDTW, CCDC, … A2Z Algorithm portability and sharing Registry of solutions? Data provenance and ancillary data Cloud Masks, etc pixel level General and pixel level consistency Web Portals All different branding for good reason but… Can we use similar components and frameworks (e.g. Angular.js)

CEOS Awesomeness #4 Validation of results – can we work on this together? Super sites? CBERS ARD – Brian getting onto this already Capacity building? Files vs Web Objects (e.g. AWS S3) – performance, organisation, mindset challenges, experience lessons Compliance-as-a-Service Govt-commercial relationship for common large archives in Cloud User characterisation and success metrics – making it manageable At scale this is hard – much to learn though can be done

How do we get there? Grow a community of contributors to underpin FDA technology (e.g. Data Cube) … CEOS, countries, domain experts Prototype demonstration projects … 3 now + more Develop a FDA Learning Center with documentation, training tools, and a user forum … end of 2017 Conduct frequent training workshops and webinars? Agency focus on ARD creation and provision - WGCV Leverage WGISS exisiting and expand capabilities Engage key stakeholders who can help with scaling and sustainability Continue and support existing WGISS efforts on: Discovery Search Engine Optimization  (search relevancy, keyword search, persistent identifiers) Access Common Standards for Interoperability of Product Formats (metadata/data) and Application Program Interface (API) Exploration of Emerging Big Data services including Cloud Computing