Water Information Sharing and HydroShare

Slides:



Advertisements
Similar presentations
HydroServer A Platform for Publishing Space- Time Hydrologic Datasets Support EAR CUAHSI HIS Sharing hydrologic data Jeffery.
Advertisements

Sharing Hydrologic Data with the CUAHSI Hydrologic Information System Support EAR CUAHSI HIS Sharing hydrologic data David.
ICEWATER: INRA Constellation of Experimental Watersheds Cyberinfrastructure to Support Publication of Water Resources Data Jeffery S. Horsburgh, Utah State.
Linking HIS and GIS How to support the objective, transparent and robust calculation and publication of SWSI? Jeffery S. Horsburgh CUAHSI HIS Sharing hydrologic.
CUAHSI HIS Data Services Project David R. Maidment Director, Center for Research in Water Resources University of Texas at Austin (HIS Project Leader)
Components of an Integrated Environmental Observatory Information System Cyberinfrastructure to Support Publication of Water Resources Data Jeffery S.
This work was funded by the U.S. National Science Foundation under grant EAR Any opinions, findings and conclusions or recommendations expressed.
Crossing the Digital Divide
HydroServer A Platform for Publishing Space- Time Hydrologic Datasets Support EAR CUAHSI HIS Sharing hydrologic data Jeffery.
Development of a Community Hydrologic Information System Jeffery S. Horsburgh Utah State University David G. Tarboton Utah State University.
Introducing the CUAHSI Hydrologic Information System Desktop Application (HydroDesktop) and Open Development Community Jiří Kadlec, Daniel Ames, Teva Velupillai.
Deployment and Evaluation of an Observations Data Model Jeffery S Horsburgh David G Tarboton Ilya Zaslavsky David R. Maidment David Valentine
HIS Team and Collaborators University of Texas at Austin – David Maidment, Tim Whiteaker, Ernest To, Bryan Enslein, Kate Marney San Diego Supercomputer.
An End-to-End System for Publishing Environmental Observations Data Jeffery S. Horsburgh David K. Stevens, David G. Tarboton, Nancy O. Mesner, Amber Spackman.
Tools for Publishing Environmental Observations on the Internet Justin Berger, Undergraduate Researcher Jeff Horsburgh, Faculty Mentor David Tarboton,
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
About CUAHSI The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) is an organization representing 120+ universities.
Ocean Sciences What is CUAHSI? CUAHSI – Consortium of Universities for the Advancement of Hydrologic Science, Inc Formed in 2001 as a legal entity Program.
Crossing the Digital Divide Presented by: Fernando R. Salas David Maidment, Enrico Boldrini, Stefano Nativi, Ben Domenico OGC Technical Meeting – Met/Occean.
HydroShare: An online, collaborative environment for the sharing of hydrologic data and models David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames,
HydroShare: Advancing Collaboration through Hydrologic Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
, Increasing Discoverability and Accessibility of NASA Atmospheric Science Data Center (ASDC) Data Products with GIS Technology ASDC Introduction The Atmospheric.
, Implementing GIS for Expanded Data Accessibility and Discoverability ASDC Introduction The Atmospheric Science Data Center (ASDC) at NASA Langley Research.
Water Web Services David R. Maidment Center for Research in Water Resources University of Texas at Austin Open Waters Symposium Delft, the Netherlands.
HydroShare: An online collaborative environment for the sharing of hydrologic data and models IN11A-1510 We envision that HydroShare will enable more rapid.
Data Interoperability in the Hydrologic Sciences The CUAHSI Hydrologic Information System David Tarboton, David Maidment, Ilya Zaslavsky, Dan Ames, Jon.
Advancing an Information Model for Environmental Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Richard P. Hooper, Kerstin Lehnert, Kim Schreuders,
Water and Catchment Data Services David R. Maidment Center for Research in Water Resources University of Texas at Austin River Science Symposium Swansea,
EarthCube Building Block for Integrating Discrete and Continuous Data (DisConBB) David Maidment, University of Texas at Austin (Lead PI) Alva Couch, Tufts.
The CUAHSI Community Hydrologic Information System Jeffery S. Horsburgh Utah Water Research Laboratory Utah State University CUAHSI HIS Sharing hydrologic.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
Clearing your Desk! Software and Data Services for Collaborative Web Based GIS Analysis David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall,
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
Lecture 4 Data Models Jeffery S. Horsburgh Hydroinformatics Fall 2012 This work was funded by National Science Foundation Grant EPS
The CUAHSI Observations Data Model Jeff Horsburgh David Maidment, David Tarboton, Ilya Zaslavsky, Michael Piasecki, Jon Goodall, David Valentine,
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
USU, RENCI, BYU, UNC, UVA, CUAHSI, Tufts, Texas, Purdue, Caktus
From Prototype to Service: A CUAHSI Datacenter for Hydroinformatics Richard Hooper Consortium of Universities for the Advancement of Hydrologic Science,
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
Services-Oriented Architecture for Water Data David R. Maidment Fall 2009.
Hydroinformatics Lecture 15: HydroServer and HydroServer Lite The CUAHSI HIS is Supported by NSF Grant# EAR CUAHSI HIS Sharing hydrologic data.
Share and Publish Your Data and Models with HydroShare David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry Band, Venkatesh Merwade,
Developing a community hydrologic information system David G Tarboton David R. Maidment (PI) Ilya Zaslavsky Michael Piasecki Jon Goodall
Community Approaches to Hydrologic Data and Model Sharing using Cloud Resources David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall,
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
Hydroinformatics Lecture: HydroServer .NET/PHP
USU, RENCI, BYU, UNC, UVA, CUAHSI, Tufts, Texas, Purdue, SDSC
Using GIS in Creating an End-to-End System for Publishing Environmental Observations Data Jeffery S. Horsburgh David G. Tarboton, David R. Maidment, Ilya.
Water Information Sharing and HydroShare
Introduction to HydroShare Workshop CUAHSI Biennial Symposium, July 26
The CUAHSI Community Hydrologic Information System
Sharing models as social objects through HydroShare
Developing a Community Hydrologic Information System
Sharing Hydrologic Data with the CUAHSI* Hydrologic Information System
USU, RENCI, BYU, UNC, UVA, CUAHSI, Tufts, Texas, Purdue, SDSC
The CUAHSI Hydrologic Information System and NHD Plus A Services Oriented Architecture for Water Resources Data David G Tarboton David R. Maidment (PI)
Jeffery S. Horsburgh Utah State University
Lecture 8 Database Implementation
CUAHSI HIS Sharing hydrologic data
Hydroinformatics Lecture 15: HydroServer (and HydroServer Lite)
Model Sharing and Collaboration using HydroShare
Services-Oriented Architecture for Water Data
HydroDesktop: A Key Component of the CUAHSI/CZO HIS for Hydrologic Data Discovery, Visualization, and Analysis Daniel P. Ames, Ph.D. P.E. Idaho State University.
HydroShare: Advancing Collaboration through Hydrologic Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
David Tarboton, Dan Ames, Jeffery S. Horsburgh, Jon Goodall
OAC OAC OAC ACI ACI Water Information Sharing and HydroShare Advancing Hydrology through Collaborative.
Space, Time and Variables in Hydrology
OAC OAC OAC ACI ACI The HydroShare domain-specific repository for archiving and active management.
Presentation transcript:

Water Information Sharing and HydroShare Advancing Hydrology through Collaborative Data and Model Sharing using HydroShare David G Tarboton, Ray Idaszak, Jeffery S Horsburgh, Daniel P Ames, Jonathan L Goodall, Alva Couch, Lawrence E Band, Venkatesh Merwade, Richard P Hooper, David R Maidment, Pabitra K Dash, Michael Stealey, Hong Yi, Christopher Calloway, Tian Gan, Anthony M Castronova, Zhiyu Li, Mohamed M Morsy, Shawn Crawley, Mauriel Ramirez, Jeffrey Sadler, Zhaokun Xue, Martyn Clark, Shaowen Wang, Bart Nijssen and Christina Bandaragoda Utah State University, RENCI - University of North Carolina, CyberGIS Center – University of Illinois, Consortium of Universities for the Advancement of Hydrologic Sciences Inc., Brigham Young University, University of Virginia, Tufts University, National Center for Atmospheric Research, University of Washington, University of Texas, Purdue University, Caktus Group. OAC-1664061 OAC-1664018 OAC-1664119 2017-2021 http://www.hydroshare.org

Motivation: Hydrologic research is a team sport Advancing Hydrologic Understanding requires integration of information from multiple sources is data and computationally intensive requires collaboration and working as a team/community Grand challenge (NRC 2001): Better hydrologic forecasting that quantifies effects and consequences of land surface change on hydrologic processes and conditions Floods and Droughts

CyberInfrastructure Challenges Data Analysis Models The data deluge Large datasets, data heterogeneity, Inadequate metadata Data Organization and Model Input preparation Reproducibility Software installation and configuration Platform dependencies, Library dependencies, Licensing Computational resources Memory, disk and processing

You need some infrastructure to manage and share the data. The Data Deluge One day = 96 observations One year = 35,040 observations So far (~1.5 years) = 55,000+ observations One month = 2880 observations One week = 672 observations So, let’s talk about the volume of data being generated in the network. Times 14 Aquatic Sites with ~26 Variables Times 14 Climate Sites with ~74 Variables Plus different versions of the data (raw versus checked) = 43,400,000+ observations You need some infrastructure to manage and share the data. From Jeff Horsburgh

The challenge of data heterogeneity Water quality Water quantity The challenge of data heterogeneity From dispersed federal agencies From investigators collected for different purposes Different formats Points Lines Polygons Fields Time Series Rainfall and Meteorology Soil water Groundwater The way that data is stored can enhance or inhibit the analysis that can be done We need ways to organize the data we work with Data models GIS

Geographic Data Model “All geographic information systems are built using formal models that describe how things are located in space. A formal model is an abstract and well-defined system of concepts. A geographic data model defines the vocabulary for describing and reasoning about the things that are located on the earth. Geographic data models serve as the foundation on which all geographic information systems are built.” Scott Morehouse, Preface to “Modeling our World”, First Edition. He was the chief software engineer at ESRI Or, more simply: the way that data is organized can enhance or inhibit the analysis that can be done

I have your information right here … The way that data is organized can enhance or inhibit the analysis that can be done I have your information right here … Picture from: http://initsspace.com/

Hydrologic Science It is as important to represent hydrologic environments precisely with data as it is to represent hydrologic processes with equations Physical laws and principles (Mass, momentum, energy, chemistry) Hydrologic Process Science (Equations, simulation models, prediction) Hydrologic conditions (Fluxes, flows, concentrations) Hydrologic Information Science (Observations, data models, visualization Hydrologic environment (Physical earth) From David Maidment

Data models organize the diversity of natural systems ArcHydro – A model for Discrete Space-Time Data NetCDF (Unidata) - A model for Continuous Space-Time data Space, L Time, T Variables, V D Coordinate dimensions {X} Variable dimensions {Y} Space, FeatureID Time, TSDateTime Variables, TSTypeID TSValue CUAHSI Observations Data Model: What are the basic attributes to be associated with each single data value and how can these best be organized? Terrain Flow Data Model used to enrich the information content of a digital elevation model

How do people share other content now YouTube Facebook Instagram Drop Box Google Drive ArcGIS Online Hydrologic data ? From Jeff Horsburgh

CUAHSI Consortium of Universities for the Advancement of Hydrologic Science, Inc. CUAHSI is made up of more than 130 U.S. universities and international water science-related organizations. CUAHSI receives support from the U.S. National Science Foundation (NSF) to develop infrastructure and services for the advancement of water science in the United States. http://www.cuahsi.org/

BYU CEE Graduate Seminar 9/29/2011 What is HIS? HIS = Hydrologic Information System The CUAHSI* Hydrologic Information System (HIS) provides web services, tools, standards and procedures that enhance access to more and better data for hydrologic analysis. https://www.cuahsi.org/data-models/ HIS helps you discover and access hydrologic data.

Major Challenges Addressed by CUAHSI HIS How do I find and access hydrologic observations for use in my work? How can I store, manage, and share hydrologic observations in a way that makes my life easier and in a way that others could use them?

Finding Information on the Internet Catalogs Catalog Find Publish/Register Access Server Desktop Servers Desktops, Clients, Browsers

CUAHSI Hydrologic Information System HydroCatalog Data Discovery and Integration Metadata Services Search Services WaterML, Other OGC Standards HydroServer Data Publication Data Services Data Analysis and Synthesis HydroDesktop Information Model and Community Support Infrastructure

What are the basic attributes to be associated with each single data value and how can these best be organized? DateTime Interval (support) Space, S Time, T Variables, V s t Vi vi (s,t) “Where” “What” “When” A data value Variable Method Quality Control Level Sample Medium Value Type Data Type Source/Organization Location Feature of interest Latitude Longitude Site identifiers Units Accuracy Censoring Qualifying comments

Observations Data Model (ODM) Soil moisture data Streamflow Flux tower data Groundwater levels Water Quality Precipitation & Climate A relational data model at the single observation level Metadata for unambiguous interpretation Traceable heritage from raw measurements to usable information Promote syntactic and semantic consistency Cross dimension retrieval and analysis Horsburgh, J. S., D. G. Tarboton, D. R. Maidment, and I. Zaslavsky (2008), A relational model for environmental and water resources data, Water Resources Research, 44, W05406, doi:10.1029/2007WR006392.

CUAHSI HIS Enabling Water Data Discovery From Jeff Horsburgh

Data and models used by hydrologists are diverse… Time series Geographic rasters Geographic features Multidimensional space/time Model programs Model instances … http://www.usgs.gov But, the data we use are diverse, and we don’t currently have a lot of options for sharing models or model instances. We’ve nailed time series down, but we still don’t have great cyberinfrastructure embraced by our community that supports the broad spectrum of data and models we use. http://www.unidata.ucar.edu http://www.esri.com From Jeff Horsburgh

HydroShare is a collaborative environment for data sharing, analysis and modeling Share your data and models with colleagues Manage who has access to the content that you share Share, access, visualize and manipulate a broad set of hydrologic data types Sharing and execution of models Access to and use of high performance computing Publication of data and models with a DOI Our goal is to make sharing of hydrologic data and models as easy as sharing videos on YouTube or shopping on Amazon.

Data and models used by hydrologists are diverse… Time series Geographic rasters Geographic features Multidimensional space/time Model programs Model instances … http://www.usgs.gov But, the data we use are diverse, and we don’t currently have a lot of options for sharing models or model instances. We’ve nailed time series down, but we still don’t have great cyberinfrastructure embraced by our community that supports the broad spectrum of data and models we use. http://www.unidata.ucar.edu http://www.esri.com HydroShare can hold data in a wide variety of formats, and data in any format as “generic” From Jeff Horsburgh

Value that HydroShare provides Integration of information from multiple sources to enhance research Re-use of data beyond the purpose for which it was originally collected, extending the value of measurement, monitoring and research investments A platform for data management to support mandates for open data and access to the data that supports research findings Enhanced trust in research findings and management decisions through transparency and support for reproducibility Primary audience is US Hydrologic Research community (NSF funding) but open to international use and use by water resource professionals, educators and citizen scientists

The Steven Hall Story From Jeff Horsburgh Steven verified his data and metadata were correct but kept the data private Steven submitted his paper for publication and responded to reviews Steven published his data in HydroShare and received a DOI With a little help, Steven deposited his dataset in the online HydroShare repository Steven collected his data in the field and transformed into a sharable format Steven published his paper and cited published data in HydroShare The Steven Hall Story From Jeff Horsburgh

How HydroShare Works HydroShare CyberGIS Data Store Data Store HydroShare Apps Django website iRODS “Network File System” API OAuth Web software to operate on content you have access to (Apps) Extensibility Organize and annotate your content Manage access Resource exploration Actions on Resources Anyone can set up a server/app platform (software service) to operate on HydroShare resources through iRODS and API Distributed file storage HydroShare Data Store CyberGIS Data Store SWATShare (Hubzero) CyberGIS Unidata - THREDDS, Hyrax Landlab

Demo

Collaborative data sharing Add content to HydroShare to share with your colleagues or formally publish to document result reproducibility

Resources (data and models) in HydroShare are objects of collaboration (social objects) For each resource you can Manage who has access To edit To view Comment or rate Get unique identifier Describe with metadata Organize into collections Formally publish Version Open with compatible web app

Formal data publication ... Resources formally published receive a citable digital object identifier (DOI) and are made immutable to changes

Automatic and natural metadata gathering eases some of the pain of metadata entry For geographic raster WGS 84 Coverage information automatically harvested from GeoTIFF coordinate system information For multidimensional netCDF data with CF convention metadata the HydroShare metadata can be fully and automatically completed

Model Program and Model Instance Resource Types Software (source code) Documentation Model Instance: Inputs Outputs generated from inputs (optional)

Apps act on resources to support web based visualization and analysis http://apps.hydroshare.org

Hyrax OPeNDAP Server for public Multidimensional resources in netCDF format Snapshot in time in visualized in Panoply

TauDEM in CyberGIS with Data in HydroShare http://gateway.cigi.illinois.edu/ Submitted Running Results data created Results Ready Visualizations

Jupyter Python Notebook App Write and execute python code acting on content of HydroShare resources and saving results back to HydroShare

HydroShare 2.0 Hydrologic modeling Social Collaboration Systematic hypothesis testing (SUMMA) Transform National Water Model into community water model platform Social Collaboration User driven design Incentives, metrics, group collaboration Storage Agility Use 3rd party storage (Dropbox, Box, Google, University libraries and digital commons) App Nursery Make it easier to write apps Develop and deploy new apps

HydroShare 2.0 (CI for fully web based hydrologic innovation environment)

Summary A new, web-based system for advancing model and data sharing Access multiple types of hydrologic data using standards compliant data formats and interfaces Flexible discovery functionality Model sharing and execution Facilitate and ease access to use of high performance computing Social media and collaboration functionality Links to other data and modeling systems Enable more rapid advances in hydrologic understanding through collaborative data sharing, analysis and modeling Sustained operation by CUAHSI A growing community of users

To learn more www.hydroshare.org

Thanks to the HydroShare team! USU RENCI/UNC CUAHSI BYU Tufts UVA Texas Purdue SDSC Caktus The HydroShare project is part of a broad effort in CUAHSI in the area of Hydrologic Information Systems. We have a team of developers and domain scientists from eight universities working on HydroShare. This is part of the even broader focus in NSF on data management, Cyberinfrastructure and sustainable software. http://www.hydroshare.org ACI-1148453 ACI-1148090 2012-2017