National Center for Supercomputing Applications University of Illinois at Urbana–Champaign Data Management for the Mississippi River Luigi Marini September 2nd, 2015
Great Lakes to Gulf (GLTG) Virtual Observatory Build on existing technology stack Live data streaming of 5 GREON buoys on the Mississippi River, and 2 GREON buoys in Illinois watersheds Historical data from the USGS, state agencies, water utilities Collaboration and funding:
Outline Goals Data sources Features Software Architecture Data Ingestion
Scalable Data Management Ability to grow in data size and data types Long Term Archive Geospatial Database Raw Data Geospatial Cache Long Term Archive Visualize Search Retrieve
Historical and Real Time Data 1 timeContinuous
DATA SOURCES CURRENTLY AVAILABLE
Water Quality Portal (USGS, USEPA, USDA) Data accessed via web services Parameters Nitrogen nitrate All other nutrients (coming soon) Other water quality parameters (coming soon)
United States Geological Survey Daily measurement Water discharge Nitrate nitrogen Nitrate Nitrogen Load Nitrate Nitrogen Cumulative Load Other water quality parameters (coming soon)
U.S. Army Corps of Engineers Long term Monitoring program
Great Rivers Ecological Observation Network Parameters water temperature, oxygen, conductivity, turbidity, chlorophyll-a, blue-green algae, nitrate, weather information Data streamed near real time 7 GREON platforms are currently deployed, positioned at La Crosse, Wi (2), Alton, IL (2), Cape Girardeau, MO (1) Carlisle Lake, IL (1), Lake Decatur, IL (1)
National Oceanic and Atmospheric Administration Data streamed near real time Temperature (More water quality parameters coming soon)
Other Data Sources?
FEATURES
Explore the Data
Search
Search Results
Data Archive
Provenance Trail
SOFTWARE ARCHITECTURE
Flexible and Robust Backend Historical Data Time Location Based Space Flexibility Flexible Data Representation RequirementsDimensions Discovery Slice and Dice Across All Dimensions
Flexible and Robust Backend Interact with backend in the most natural way for the browser Do some processing (e.g. averages) in the browser Not require mapping between formats Clear separation between client and server HTTP-based web service JavaScript Object Notation (JSON) GeoJSON! location & flexible attributes What about time? Browser GeoDashboard Server
Datapoint ISO 8601 With timezone Open JSON document GeoJSON Altitude
Datapoint
Data API Datapoint Stream Sensor/Site
Search HTTP GET /geostreams/datapoints? geocode= , ,74.759& since= :00:00& until= :00:00& sources=epa& attributes=NNN, mg/l& attributes=Si, mg/l& attributes=TP, ug/l& format=json/csv
DATA INGESTION
Add Datapoint HTTP POST /geostreams/datapoints In body:
Data Archive and Geotemporal API Imaginations unbound Raw file ER58.cnv Raw file ER63s.cnv Upload Data Archive GREON software (datalogger) Geotemporal API External Catalog Analytics Code Sensor
Data Proxy Geotemporal API ScriptService
Data Proxy Services can be written in any language HTML / JSON The most common standards on the web Caching the data For fast queries Each data point carries with it where it came from
Open source Software
Questions? Contact Michael Brennan Live Site