USU, RENCI, BYU, UNC, UVA, CUAHSI, Tufts, Texas, Purdue, SDSC

Slides:



Advertisements
Similar presentations
HydroServer A Platform for Publishing Space- Time Hydrologic Datasets Support EAR CUAHSI HIS Sharing hydrologic data Jeffery.
Advertisements

This work is funded by the Inland Northwest Research Alliance INRA Constellation of Experimental Watersheds: Cyberinfrastructure to Support Publication.
ICEWATER: INRA Constellation of Experimental Watersheds Cyberinfrastructure to Support Publication of Water Resources Data Jeffery S. Horsburgh, Utah State.
Linking HIS and GIS How to support the objective, transparent and robust calculation and publication of SWSI? Jeffery S. Horsburgh CUAHSI HIS Sharing hydrologic.
CUAHSI HIS Data Services Project David R. Maidment Director, Center for Research in Water Resources University of Texas at Austin (HIS Project Leader)
Components of an Integrated Environmental Observatory Information System Cyberinfrastructure to Support Publication of Water Resources Data Jeffery S.
This work was funded by the U.S. National Science Foundation under grant EAR Any opinions, findings and conclusions or recommendations expressed.
HydroServer A Platform for Publishing Space- Time Hydrologic Datasets Support EAR CUAHSI HIS Sharing hydrologic data Jeffery.
Time Series Analyst An Internet Based Application for Viewing and Analyzing Environmental Time Series Jeffery S. Horsburgh Utah State University David.
Development of a Community Hydrologic Information System Jeffery S. Horsburgh Utah State University David G. Tarboton Utah State University.
Introducing the CUAHSI Hydrologic Information System Desktop Application (HydroDesktop) and Open Development Community Jiří Kadlec, Daniel Ames, Teva Velupillai.
Tools for Publishing Environmental Observations on the Internet Justin Berger, Undergraduate Researcher Jeff Horsburgh, Faculty Mentor David Tarboton,
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
About CUAHSI The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) is an organization representing 120+ universities.
Information Requirements for Integrating Spatially Discrete, Feature- Based Earth Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Kerstin Lehnert,
HydroShare: An online, collaborative environment for the sharing of hydrologic data and models David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames,
HydroShare: Advancing Collaboration through Hydrologic Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
Water Web Services David R. Maidment Center for Research in Water Resources University of Texas at Austin Open Waters Symposium Delft, the Netherlands.
HydroShare: An online collaborative environment for the sharing of hydrologic data and models IN11A-1510 We envision that HydroShare will enable more rapid.
Data Interoperability in the Hydrologic Sciences The CUAHSI Hydrologic Information System David Tarboton, David Maidment, Ilya Zaslavsky, Dan Ames, Jon.
Advancing an Information Model for Environmental Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Richard P. Hooper, Kerstin Lehnert, Kim Schreuders,
GIS in Water Resources: Lecture 1 In-class and distance learning Geospatial database of hydrologic features GIS and HIS Curved earth and a flat map.
Water and Catchment Data Services David R. Maidment Center for Research in Water Resources University of Texas at Austin River Science Symposium Swansea,
The CUAHSI Hydrologic Information System Presented by Dr. Tim Whiteaker The University of Texas at Austin 22 February, 2011.
The CUAHSI Community Hydrologic Information System Jeffery S. Horsburgh Utah Water Research Laboratory Utah State University CUAHSI HIS Sharing hydrologic.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
Towards a 3D National Hydrography Dataset Presentation made to the US Geological Survey By David R. Maidment Center for Research in Water Resources University.
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
Clearing your Desk! Software and Data Services for Collaborative Web Based GIS Analysis David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall,
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
USU, RENCI, BYU, UNC, UVA, CUAHSI, Tufts, Texas, Purdue, Caktus
CUAHSI HIS: Science Challenges Linking small integrated research sites (
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
Hydroinformatics Lecture 15: HydroServer and HydroServer Lite The CUAHSI HIS is Supported by NSF Grant# EAR CUAHSI HIS Sharing hydrologic data.
Share and Publish Your Data and Models with HydroShare David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry Band, Venkatesh Merwade,
Using Python to Retrieve Data from the CUAHSI HIS Web Services Jeffery S. Horsburgh Hydroinformatics Fall 2015 This work was funded by National Science.
Developing a community hydrologic information system David G Tarboton David R. Maidment (PI) Ilya Zaslavsky Michael Piasecki Jon Goodall
Community Approaches to Hydrologic Data and Model Sharing using Cloud Resources David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall,
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
Hydroinformatics Lecture: HydroServer .NET/PHP
USU, RENCI, BYU, UNC, UVA, CUAHSI, Tufts, Texas, Purdue, SDSC
Under supervision of: David Tarboton
Water Information Sharing and HydroShare
Introduction to HydroShare Workshop CUAHSI Biennial Symposium, July 26
The CUAHSI Community Hydrologic Information System
Sharing models as social objects through HydroShare
Sharing Hydrologic Data with the CUAHSI* Hydrologic Information System
Hydrologic Terrain Processing Using Parallel Computing
Hydrologic Terrain Processing Using Parallel Computing
The CUAHSI Hydrologic Information System and NHD Plus A Services Oriented Architecture for Water Resources Data David G Tarboton David R. Maidment (PI)
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Jeffery S. Horsburgh Utah State University
Lecture 8 Database Implementation
CUAHSI HIS Sharing hydrologic data
Hydroinformatics Lecture 15: HydroServer (and HydroServer Lite)
Water Information Sharing and HydroShare
HydroDesktop: A Key Component of the CUAHSI/CZO HIS for Hydrologic Data Discovery, Visualization, and Analysis Daniel P. Ames, Ph.D. P.E. Idaho State University.
Automation of Input data preparation of TOPNET model using Python
KISTERS TimeSeries HUB
Digital Elevation Model Based Watershed and Stream Network Delineation
Digital Elevation Model Based Watershed and Stream Network Delineation
HydroShare: Advancing Collaboration through Hydrologic Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
David Tarboton, Dan Ames, Jeffery S. Horsburgh, Jon Goodall
Terrain Analysis Using Digital Elevation Models (TauDEM)
Channels, Watersheds, Flow Related Terrain Information
Extending ArcGIS using programming
OAC OAC OAC ACI ACI The HydroShare domain-specific repository for archiving and active management.
Hydrologic Information Systems to discover and combine data from multiple sources for hydrologic analysis David Tarboton Utah State University CUAHSI HIS.
Presentation transcript:

USU, RENCI, BYU, UNC, UVA, CUAHSI, Tufts, Texas, Purdue, SDSC HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry Band, Venkatesh Merwade, Alva Couch, Jennifer Arrigo, Rick Hooper, David Valentine, David Maidment, Jeff Heard, Pabitra Dash, Tian Gan, Tony Castronova, Stephen Jackson, Cuyler Frisby, Stephanie Mills, Brian Miles USU, RENCI, BYU, UNC, UVA, CUAHSI, Tufts, Texas, Purdue, SDSC http://www.hydroshare.org OCI-1148453 OCI-1148090 2012-2017

Research Areas Hydrologic Information Systems Digital Elevation Model Terrain Analysis Hydrology and Geomorphology Distributed Hydrologic Modeling Snow and glacier melt Non parametric stochastic hydrology Streamflow regimes for stream ecology The Great Salt Lake Data Catalog Desktop Server Metadata Search ODM, WaterML Production Consumption Discovery

Outline Data and computational challenges CUAHSI HIS HydroShare Goals Use cases Resource data model Architecture Web based data, modeling and analysis services Summary Data Analysis Models

The challenge of increasing Digital Elevation Model (DEM) resolution e.g. 50,000 km2 Watershed 1980’s DMA 90 m 102 cells/km2 27 MB 240 MB 1990’s USGS DEM 30 m 103 cells/km2 2000’s NED 10 m 104 cells/km2 2 GB 2010’s LIDAR ~1 m 106 cells/km2 200 GB

Rainfall and Meteorology Water quality Water quantity Data Heterogeneity From dispersed federal agencies From investigators collected for different purposes Different formats Points Lines Polygons Fields Time Series Rainfall and Meteorology Soil water Groundwater The way that data is stored can enhance or inhibit the analysis that can be done We need ways to organize the data we work with Data models GIS

Data intensive models to understand and examine consequences, impacts and effects Strata can be redefined any time From Larry Band

A Digital Divide Data Intensive High Performance Computing Hydrologic Experimentation and Modeling awk grep vi #PBS -l nodes=4:ppn=8 mpiexec chmod #!/bin/bash Do you have the access or know how to take advantage of advanced computing capability? Gateways, Web Interfaces, Software services

HydroShare Goals Enable more rapid advances in hydrologic understanding through collaborative data sharing, analysis and modeling To become a foundational cyberinfrastructure platform for hydrologic research that blends Software as a service Data as a service Models as a service Visualization and analysis services To solve problems of size and scope not otherwise solvable using desktop computing Address community needs

CUAHSI HIS HydroServer – Data Publication HydroCatalog Data Discovery The CUAHSI Hydrologic Information System (HIS) is an internet based system to support the sharing of hydrologic data. It is comprised of hydrologic databases and servers connected through web services as well as software for data publication, discovery and access. HydroServer – Data Publication HydroCatalog Data Discovery Lake Powell Inflow and Storage HydroDesktop – Data Access and Analysis HydroDesktop – Combining multiple data sources

Open Geospatial Consortium Web Service Standards http://www.opengeospatial.org/standards/waterml This document is an OGC® Encoding Standard for the representation of hydrological observations data with a specific focus on time series structures. These standards have been developed over the past 10 years …. by 400 companies and agencies ....

HydroDesktop An open source dotSpatial GIS based desktop client that supports discovery and analysis of hydrologic observations data The service URLs that the HD tool uses are seen in http://hydrodesktop.codeplex.com/SourceControl/changeset/view/475b15a84bf4#Source%2fEPADelineation%2fCallWebService.cs The HD tool uses the Point Indexing Service to find the nearest NHD reach to where the user clicked. This returns a location on that reach. This point location is then used as input to the Navigation Delineation Service to get the watershed, and the Upstream/Downstream Service to get the river lines. The delineation service has two limitations: * It only works up to a certain distance upstream. I think we have it set to 100km. So for large watersheds (those with more than 100km of stream length upstream of where the user clicked), we don't get the most upstream portions of the watershed. * It doesn't delineate exactly to where the user clicked. It delineates to the endpoint of the NHD reach. (Can't remember if it is the clicked reach or the upstream reach -- try it and see.) Uses EPA WATERS Web, Mapping, and Database Services at http://www.epa.gov/waters/geoservices/index.html to delineate Watersheds

Search last 22 years for all data in buffer around watershed

Download and Plot the Data Combining information from multiple sources

Perform an analysis using R At your fingertips the full analysis capability of R data from multiple sources accessed from distributed (cloud) resources. importance of interoperability

But Publishing data using CUAHSI HIS requires access to or setting up a HydroServer Accessing data requires HydroDesktop Generally limited to time series at a point Server Desktop Catalog

Web Services (REST API) HydroShare is a collaborative environment (being developed) for data sharing, analysis and modeling Users Browser Client Django web framework Web Services (REST API) Web Pages iRODS “Network File System” Resource Files User accounts Access control Web based data and software services to overcome Simplify working with large datasets and HPC Overcome platform dependency limitations Avoid software installation limitations Our goal is to make sharing of hydrologic data and models as easy as sharing videos on YouTube or shopping on Amazon.

Collaborative data analysis and publication use case 1 Observers and instruments Analysis Models 2 3 4 5 6 7 Data Publication, Archival, Curation Collaboration Digital Library Observe Analyze Publish (DOI) Store Model Discover and access Collaborate

Collaborative Integrated Modeling x y t Flow Time Flow Time P Pre-processing and model linking Modeling Services (e.g. SWATShare) Data: Links to national and global data sets of essential terrestrial variables (e.g. NASA NEX, HydroTerre) Tools to preprocess and configure inputs Preconfigured models and modeling systems as services Standards for information exchange for interoperability (OpenMI, CSDMS BMI) Tools for Visualization and Analysis Automated reasoning to couple models based on purpose, context, data and resources Data: Links to national and global data sets of essential terrestrial variables (e.g. NASA NEX, HydroTerre) Tools to preprocess and configure inputs (EcoHydroLib, TauDEM, CyberGIS) Preconfigured models and modeling systems as services (SWATShare) Standards for information exchange for interoperability (OpenMI, CSDMS BMI) Tools for visualization and analysis

At its heart, HydroShare is a system for sharing Resources and Collaborating Files and sets of files structured to represent a hydrologic process, model, or element in the hydrologic environment Standard data models enhance interoperability and support functionality “hydro value added” Tools that act on resources to visualize, modify and create new resources Encode standard/best practices Access control and sharing model

Resource Data Model Open Archives Initiative – Object Reuse and Exchange (OAI-ORE) - standards for the description and exchange of aggregations of Web resources BagIt – hierarchical file packaging format designed to support disk-based or network-based storage and transfer of generalized digital content Compatible with DataOne

Types of data to support as resources Resource Types Generic  Geographic Raster  Referenced Time Series (CUAHSI HIS web service link)  Geographic Feature set Multidimensional Space Time dataset River Geometry Sample based observations (ODM2 and CZO) HydroDesktop Project package Scripts Model program Model component Model instance Composite resources x y t

River Channel http://dx.doi.org/10.1016/j.envsoft.2014.12.002

Models Model package Model program Model input Model output Bundled components references existing resources Model program executable entity may consist of submodules and other complex relationships Model input input required by a program files, parameters, etc... Model output outputs produced by a program files, plots, etc...

Model Execution in HydroShare Input and output Hydroshare resources Link input, output, and program resources to create model packages Execution of model package within the HydroShare environment to create "new" resources Output CREATE OUTPUT CREATE PACKAGE EXECUTE Package Package Program Input Program Input Output

Model and Output (.zip, .xls or .txt file) SWAT Models (.zip file) Metadata Model and Output (.zip, .xls or .txt file) Location SWATShare

Demo

Collaborative functionality

Clearing your desk. The trend towards network (cloud) computing. Can we deliver Hydrologic Analysis functionality as a service over the web? Data Sources Server Software as a Service Functions and Tools Users Based on slide from Norm Jones

Terrain Analysis Raw DEM Pit Removal Flow Field Flow Related Terrain Information This slide shows the general model for deriving flow field related derivative surfaces from digital elevation data. The input is a raw digital elevation model, generally elevation values on a grid. This is basic information used to derive further hydrology related spatial fields that enrich the information content of this basic data. The first step is to remove sinks, either by filling, or carving. Then a flow field is defined. This enables the calculation of flow related terrain information. Watersheds are the most basic hydrologic landscape elements

TauDEM http://hydrology.usu.edu/taudem/ 5/8/2018 Stream and watershed delineation Multiple flow direction flow field Calculation of flow based derivative surfaces MPI Parallel Implementation for speed up and large problems Open source platform independent C++ command line executables for each function Deployed as an ArcGIS Toolbox with python scripts that drive command line executables CSDMS Cluster Implementation Open Topography and XSEDE implementation http://hydrology.usu.edu/taudem/

Using TauDEM today requires Expertise in Hydrologic DEM analysis The software ArcGIS licenses (for ArcGIS plugin) The ability to install software TauDEM command functions with MPI installation Compilation for other platforms Sufficient Hardware (RAM and Disk) The data (uncompressed GeoTIFF, projected, consistent grid size and spatial reference)

Moving TauDEM to the cloud CyberGIS http://gateway.cigi.illinois.edu/ Open Topography www.opentopography.org

Parallel TauDEM Functions MPI, distributed memory paradigm Row oriented slices Each process includes one buffer row on either side Each process does not change buffer row Improved runtime efficiency Capability to run larger problems

XSEDE Extended Collaboration Support Services (ECSS) improvements Reconfiguration of multiple file header reads to be broadcast from single node Reconfiguration of output files to avoid spanning processors Execution time of the three most costly TauDEM functions on a 36GB DEM dataset. I/O Time Comparison (before / after; in seconds) for 2 GB DEM StreamNet DinfFlowdir D8Flowdir #cores Compute Header Read Data Read Data Write 32 42.7 / 42.8 193.5 / 3.8 0.4 / 0.4 153.5 / 3.5 64 35.3 / 34.8 605.5 / 3.9 1.5 / 1.1 160.2 / 2.3 128 33.7 / 33.0 615.2 / 2.6 0.9 / 1.0 173.2 / 2.3 256 37.5 / 38.0 831.7 / 2.3 0.5 / 0.9 391.3 / 1.6

TauDEM Wetness index from Open Topography Eel River ln(a/S) a in meters

TauDEM in CyberGIS http://gateway.cigi.illinois.edu/

Select the products you want The wizard configures the sequence of functions to run to get the result

Results displayed in browser

Computation via Python Client calling API Input Python session on desktop but data and analysis on server Result Assumptions Research hydrologic modelers should be comfortable using a scientific programming language like Python or R. Hydrologic modelers are not expert in HPC systems and learning this is a barrier to the use of HPC. Hydrologic modeling is data intensive (large datasets from a range of sources)

Architecture Django web application framework (Python) iRODS “Network File System” iCAT Zone User Accounts Authentication and Authorization Infrastructure (AAI) for Access Control Web Services (REST API) Django web application framework (Python) Web Pages iRODS Native REST API Clients (3rd party), interoperable systems and web tools HydroDesktop SEAD Tethys CyberGIS BiG CZ SSI SESYNC GI Venture CI-WATER RENCI Utah www.hydroshare.org tools.hydroshare.org Distributed Resource Servers Files Science Metadata Discovery (using e.g. ElasticSearch via MSVC plugin) Others

Summary A new, web-based system for advancing model and data sharing Sharing features to HydroDesktop Access multiple types of hydrologic data using standards compliant data formats and interfaces Flexible discovery functionality Model sharing and execution Facilitate and ease access to use of high performance computing Social media and collaboration functionality Links to other data and modeling systems

Thanks to the HydroShare team! USU RENCI/UNC CUAHSI BYU Tufts UVA Texas Purdue SDSC The HydroShare project is part of a broad effort in CUAHSI in the area of Hydrologic Information Systems. We have a team of developers and domain scientists from eight universities working on HydroShare. This is part of the even broader focus in NSF on data management, Cyberinfrastructure and sustainable software. http://hydroshare.cuahsi.org OCI-1148453 OCI-1148090 2012-2017