Location Powers: Big Data Webinar

Slides:



Advertisements
Similar presentations
V. Chandrasekar (CSU), Mike Daniels (NCAR), Sara Graves (UAH), Branko Kerkez (Michigan), Frank Vernon (USCD) Integrating Real-time Data into the EarthCube.
Advertisements

, Increasing Discoverability and Accessibility of NASA Atmospheric Science Data Center (ASDC) Data Products with GIS Technology ASDC Introduction The Atmospheric.
, Implementing GIS for Expanded Data Accessibility and Discoverability ASDC Introduction The Atmospheric Science Data Center (ASDC) at NASA Langley Research.
material assembled from the web pages at
Construction of an Environmental Information Network Information Systems May 2004, St. Petersburg.
Big Data Analytics Large-Scale Data Management Big Data Analytics Data Science and Analytics How to manage very large amounts of data and extract value.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
Copyright © 2016 Pearson Education, Inc. Modern Database Management 12 th Edition Jeff Hoffer, Ramesh Venkataraman, Heikki Topi CHAPTER 11: BIG DATA AND.
Copyright, Open Geospatial Consortium Making Location Count Peer-to-Peer File Sharing An Answer to the SDI blues North Carolina GIS Conference February,
OGC ® Geospatial Track Wrap-up Apache Big Data Conference.
OGC’s role in GEO: Results from the Architectural Implementation Pilot (AIP) George Percivall Open Geospatial Consortium GEO Task IN-05 Coordinator
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
The Derivitec Risk Portal Provides Powerful, Cost-Effective Risk Management Solutions, Powered by Azure, that Deploy in Minutes MICROSOFT AZURE ISV PROFILE:
® ® Big Geospatial Data: Open Approaches to Loosely-Coupled Ecosystems George Percivall CTO, Chief Engineer Open Geospatial Consortium
1 Panel on Merge or Split: Mutual Influence between Big Data and HPC Techniques IEEE International Workshop on High-Performance Big Data Computing In conjunction.
Data Analytics (CS40003) Introduction to Data Lecture #1
George Percivall Delft, The Netherlands 22 March 2017
Geoffrey Fox Panel Talk: February
AuraPortal Cloud Helps Empower Organizations to Organize and Control Their Business Processes via Applications on the Microsoft Azure Cloud Platform MICROSOFT.
Datacube projects in ESA
Open Platform 3.0™ Overview – 3rd August 2016 Dr Christopher J Harding
2nd GEO Data Providers workshop (20-21 April 2017, Florence, Italy)
USGS EROS LCMAP System Status Briefing for CEOS
Connected Living Connected Living What to look for Architecture
Viewing Data-Driven Success Through a Capability Lens
DocFusion 365 Intelligent Template Designer and Document Generation Engine on Azure Enables Your Team to Increase Productivity MICROSOFT AZURE APP BUILDER.
Interoperability between Earth observations and Earth science models Session Introduction ESIP 2016 Winter Meeting, January 6-8, 2016 Liping Di Center.
Status and Challenges: January 2017
Open Weather Weather on the Web
Overview of MDM Site Hub
Free Cloud Management Portal for Microsoft Azure Empowers Enterprise Users to Govern Their Cloud Spending and Optimize Cloud Usage and Planning MICROSOFT.
Making maps for the world’s urban environments
Flanders Marine Institute (VLIZ)
Connected Living Connected Living What to look for Architecture
NSF start October 1, 2014 Datanet: CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science Indiana University.
IreckonU Offers a Powerful Hospitality Software Solution, Seamlessly Integrating Existing Hospitality Systems and Services on the Powerful Microsoft Azure.
Data Quality: Practice, Technologies and Implications
Sell Global, Feel Local by Leveraging eShopWorld
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Copyright © 2016 Open Geospatial Consortium
With Help from the Microsoft Azure Cloud,
SMART GROUND platform overview
NSF : CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science PI: Geoffrey C. Fox Software: MIDAS HPC-ABDS.
iSERVOGrid Architecture Working Group Brisbane Australia June
Designed for Big Data Visual Analytics, Zoomdata Allows Business Users to Quickly Connect, Stream, and Visualize Data in the Microsoft Azure Platform MICROSOFT.
Yellowfin: An Azure-Compatible Business Intelligence Platform That Connects People with Their Data for Better Decision Making MICROSOFT AZURE APP BUILDER.
Introduction to D4Science
On-Premises, or Deployed in a Hybrid Environment
Tutorial Overview February 2017
DeFacto Planning on the Powerful Microsoft Azure Platform Puts the Power of Intelligent and Timely Planning at Any Business Manager’s Fingertips Partner.
Accelerate Your Self-Service Data Analytics
FDA Objectives and Implementation Planning
Built on the Powerful Microsoft Azure Platform, the SiouxApp “Project-Server” Helps to Manage Projects and More with App Enhancement Tools MICROSOFT AZURE.
Dell Data Protection | Rapid Recovery: Simple, Quick, Configurable, and Affordable Cloud-Based Backup, Retention, and Archiving Powered by Microsoft Azure.
Carl Data Solutions Collects Utility Sensor and Meter Data to Provide Advanced Reporting, Alarming, and Analytics with Microsoft Azure MICROSOFT AZURE.
One-Stop Shop Manages All Technical Vendor Data and Documentation and is Globally Deployed Using Microsoft Azure to Support Asset Owners/Operators MICROSOFT.
Scanning the environment: The global perspective on the integration of non-traditional data sources, administrative data and geospatial information Sub-regional.
XtremeData on the Microsoft Azure Cloud Platform:
WIS Strategy – WIS 2.0 Submitted by: Matteo Dell’Acqua(CBS) (Doc 5b)
Improve Patient Experience with Saama and Microsoft Azure
$1M a year for 5 years; 7 institutions Active:
Bird of Feather Session
School Districts Can Analyze and Report on Data Across Multiple Systems with EdWire, a Powerful Integration Solution that Utilizes Microsoft Azure MICROSOFT.
ArcGIS Online – The Road Ahead
Brian Killough (NASA, SEO), Mirko Albani (ESA, WGISS Chair)
Copyright © 2019 Open Geospatial Consortium
Advanced Geospatial Techniques: Aiding Earth Observation Applications
Big Data, Simulations and HPC Convergence
UNIT 6 RECENT TRENDS.
Presentation transcript:

Location Powers: Big Data Webinar 22 November 2016

Agenda for Today’s Webinar Introduction to Location Powers and Webinar Denise McKenzie, OGC Summary of Sept 20th Workshop George Percivall, OGC Management and Dissemination of Earth Observation Data in a Big Data World Jeff Walter, NASA Exploring Strategies For Optimizing Knowledge Derivation From Imagery Dan Getman, DigitalGlobe Summary and next steps on Big Data

Summary of Sept 20th Workshop Location Powers: Big Data Copyright © 2016 Open Geospatial Consortium

Location Powers:Big Data, 20 Sep 2016

Use Cases for Big Geo Data High Velocity Ingest Geospatial Databases Entity-oriented Spatial-temporal analytics Grid-oriented Spatial-temporal analytics Feature Fusion GeoAnalytics, Machine Learning Remote sensed data processing Machine Learning Spatial Modeling IoT Message Streaming Built environment models Array databases Users and consuming apps Social Media Message Processing Observation Sources NoSQL databases Integrated environmental models ETL Stream processing using RDF Graph databases Modeling and simulation Wide Area Motion Imagery SQL databases

Sessions 1 Opening; 2 Obtaining Big Data

Population Distribution and Dynamics Modeling LandScan Global Ambient Population Distribution (~ 1km) Increase in spatial and temporal resolution Top Down and Bottom Up Approach Ambient and Day/Night Population Distribution (~ 90 m) LandScan HD Rapid Settlement Identification and Characterization Facility Level Population Density Population Density Tables Settlement Mapping Slide: Jibo Sanyal, ORNL

Inputs to Geospatial Big Data NIST Public Big Data Working Group with 5 working groups: Requirements and Use Cases, Definitions and Taxonomies, Reference Architecture, Security and Privacy and Technology Roadmap 30% of uses cases were geospatial 80% of use cases were streaming Follow up activities extending work and building exemplar use cases defined with DevOps so can be used on multiple infrastructures: HPC, Docker, OpenStack, AWS Two Streaming workshops at http://streamingsystems.org/ Many important streaming geospatial use cases NSF SPIDAL (Scalable Parallel Interoperable Data Analytics Library) project developing HPC-ABDS High Performance Computing enhanced Apache Big Data Stack Slide Source: Geoffrey Fox, Indiana Univ.

Session 3. Maintaining Big Data

Big Data: Driving Forces 09/20/2016 Copyright 2016, JCC Consulting, Inc. Big Data: Driving Forces Inexpensive storage of large volumes of data Inexpensive compute power Next Generation Analytics Moving from off-line to in-line embedded analytics Explaining what happened Predicting what will happen Operating on Data at rest – stored someplace Data in motion – streaming Multiple disparate data sources Look at available data and wonder what answers are hidden there Slide Source: Keith W. Hare, JCC Consulting, Inc.

Copyright 2016, JCC Consulting, Inc. 09/20/2016 Copyright 2016, JCC Consulting, Inc. “Big Data” Data Types Traditional Data Types Character Numerical Date/Time/Timestamp Large Objects – LOB/BLOB/CLOB “Big Data” Data Types Multi-dimensional arrays Images/video Documents  Loosely formatted data Objects Spatial Slide Source: Keith W. Hare, JCC Consulting, Inc.

Apache Projects Geospatially Enabled* *not exhaustive Slide Source: Rob Emanuele, Azavea #LPBigData

Slide Source: Rob Emanuele, Azavea #LPBigData

The Land Change Monitoring Assessment and Projection (LCMAP) information system Slide Source: Glenn Guempel, USGS #LPBigData

Where We Want To Be Download as Last Resort Mentality The Land Change Monitoring Assessment and Projection (LCMAP) information system Where We Want To Be Download as Last Resort Mentality Store data in unzipped, optimal formats ready for direct processing by standard services or custom processes. Provide basic visualization, analysis and extraction functions through services on an open platform. The platform additionally provides the potential processing capacity for building unforeseen custom workflows and processes against big data. Analysis Ready Data We believe over years the download mentality will diminish. Storing data in a ready-to-be-used format will allow users to access data without downloading. Service Functions will be available for basic visualization, analysis and extraction of data. Only download what is needed - perhaps the results rather than all the raw data. Virtual Platforms, like current commercial clouds, will mature and provide cost-effective, on-demand capacity to process big data. Custom processes and workflows can be supported by allowing users to spin up large infrastructure components, process the data, and shutdown without ongoing costs. Slide Source: Glenn Guempel, USGS #LPBigData

Session 4. Analyzing Big Data

Earth Server: Datacubes At Your Fingertips Intercontinental initiative: EU + US + AUS started 2011 Agile Analytics on 3D, 4D Earth & Planetary datacubes Rigorously standards: OGC WMS + WCS + WCPS EU rasdaman + US NASA WorldWind 100s of TB sites now, next: 1+ PB Uni Jacobs PML NCI Australia ESA MEEO MWF EC

Science & GIS Tool Interfacing General-purpose scientist tools: Java, C++ python, R (under work) Geo tools: MapServer, GDAL, QGIS, OpenLayers, Leaflet, NASA WorldWind, ... OGC WCS Core & INSPIRE WCS Reference Implementation Can interface to all tools supporting OGC‘s „Big Geo Data“ standards suite

Moving Features Location data (ID of object, latitude, longitude, time) is one of the typical bigdata: Moving Features Most people have cell-phone (smart phone) Vehicle navigation system measures car locations Vessels and aircrafts locations are managed due to security And so on. The demand for Moving Features is very rapidly increasing Many Applications Moving Features Mobile Objects Disaster management Road traffic control

Applications of Moving Features bigdata Many kinds of Moving Features are used…, “Data integration” is a key point to produce more value Disaster management Traffic management Integration of tsunami and evacuation simulation Aircrafts, vessels, vehicles, and pedestrians traffic Indoor pedestrian flow Sports Tracks of visitors to shopping-malls are useful for marketing Tracks of soccer players and a ball are useful for considering tactics

L O C A T E Geo-enrichment Allows a wide variety of datasets to be appended to a data record using a common spatial ID. - What are the property attributes of this insured property? - What demographic group does this customer belong to? - What businesses are in this area of poor network coverage? Analytics Reduce the complexity of billions of transactional records by assigning data to geographic bins and aggregating results. - Is average 4G network coverage in this area better than a competitor? - Is the accumulated exposure at risk of hurricane damage too high? - Is this data point inside or outside of a geofence?

Session 5. Big Data Applications Panel

Emergent Themes from Workshop Loosely-coupled PB archives based on open standards for rapid geospatial information product creation at any scale Analysis Ready Data We live in a download mentality. How do we move to answering questions Focus shifting from understanding what happened last week to being able to predict what will happen next week Take better advantage of developments in Big Data Proper, which is only tangentially interested in Big Geo Data Multiple Applications: Telecommunications, property casualty insurance, financial services, Energy Monitoring and prediction, Population Dynamics, Settlement Mapping Input to Testbed 13, e.g., SpaceNet

Agenda for Today’s Webinar Introduction to Location Powers and Webinar Denise McKenzie, OGC Summary of Sept 20th Workshop George Percivall, OGC Management and Dissemination of Earth Observation Data in a Big Data World Jeff Walter, NASA Exploring Strategies For Optimizing Knowledge Derivation From Imagery Dan Getman, DigitalGlobe Summary and next steps on Big Data

© 2016 Open Geospatial Consortium OGC Actions on Big Data Location Powers: Big Linked Data Workshop Delft, 22 March 2017 Publish “Big Geo Data - White Paper” Apply/extend OGC Standards to Big Data WMS/WMTS, WFS, WCS/WCPS, WPS Moving Features Encoding Discrete Global Grid Systems (DGGS) Conduct OGC Innovation Program Testbeds Engineering Reports into Best Practices OGC Testbed 13 Coordinate: JTC 1 Big Data, Apache, Location Tech OGC Big Data Domain Working Group © 2016 Open Geospatial Consortium 25

OGC Big Data Domain Working Group Public forum for geospatial Big Data interoperability, access, and especially analytics. Encourage collaborative development among participants representing many organizations and communities, Ensure appropriate liaisons to other Big Data relevant working groups, both inside and outside OGC. http://www.opengeospatial.org/projects/groups/bigdatadwg E-mail list - Open to public: https://lists.opengeospatial.org/mailman/listinfo/bigdata.dwg © 2016 Open Geospatial Consortium 26

Agenda for Today’s Webinar Introduction to Location Powers and Webinar Denise McKenzie, OGC Summary of Sept 20th Workshop George Percivall, OGC Management and Dissemination of Earth Observation Data in a Big Data World Jeff Walter, NASA Exploring Strategies For Optimizing Knowledge Derivation From Imagery Dan Getman, DigitalGlobe Summary and next steps on Big Data