National Center for Supercomputing Applications University of Illinois at Urbana–Champaign Great Lakes to Gulf Virtual Observatory Marcus Slavenas September 2nd, 2015
Outline Motivation / Funding Architecture Front End Interface Data – Sources and Parsing Questions
Motivation and Funding Hypoxic Zone Aggregate Sources for: Scientists Decision Makers
Software Architecture Long Term Archive Geospatial Database Mongodb: Raw Data Postgres: Geospatial Cache Clowder: Web service (API) Data Manage Geodashboard: Visualize Search Retrieve Parsed DataRaw Data
Explore the Data Explore Data By: Sources Reaches Watershed
Explore Data Time Scale Binning Nitrate Load Cumulative Load
Calculations Gap Filling Load Calculation
Compare
Search Location Time Source Parameter
Search – River Reaches and Watershed
Raw Data Archive
Raw Files - Provenance Trail
CURRENT DATA SOURCES US Geological Survey Nitrate, Discharge -> Load (More Coming) ~Real Time U.S. Army Corps of Engineers Long Term Monitoring Program Quarterly National Oceanic and Atmospheric Administration Temperature ~Real Time Water Quality Portal (USGS, USEPA, USDA) Mostly for historical data Great Rivers Ecological Observation Network Water Quality and Environmental Data ~Real Time MORE…
Extract Transform Load (ETL) Python Scripting Extract 1.Through Web Service 2.From Files Generalization Transform All data parsed to similar format JSON/GEOJSON Load Through Clowder Web Service
Datapoint ISO 8601 With timezone Open JSON document GeoJSON Geolocation
Data Storage Structure/Format DatapointStreamSensor/Site
Provenance Trail
Questions? Contact Michael Brennan Marcus Slavenas Live Site