HydroShare: Advancing Collaboration through Hydrologic Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.

Slides:



Advertisements
Similar presentations
ASIAES Project Overview Satellite Image Network for Natural Hazard Management in ASEAN+3 region Pakorn Apaphant Geo-Informatics and Space Technology Development.
Advertisements

HydroServer A Platform for Publishing Space- Time Hydrologic Datasets Support EAR CUAHSI HIS Sharing hydrologic data Jeffery.
ICEWATER: INRA Constellation of Experimental Watersheds Cyberinfrastructure to Support Publication of Water Resources Data Jeffery S. Horsburgh, Utah State.
Task WA-01 GEO Work Plan Symposium 2014 Managing and Sharing Data WA-01 R. Lawford and M. Schlummer based on contributions from D. Arctur, D. Maidment,
Venkatesh Merwade, Purdue University
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
Linking HIS and GIS How to support the objective, transparent and robust calculation and publication of SWSI? Jeffery S. Horsburgh CUAHSI HIS Sharing hydrologic.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Development of a Community Hydrologic Information System Jeffery S. Horsburgh Utah State University David G. Tarboton Utah State University.
Two NSF Data Services Projects Rick Hooper, President Consortium of Universities for the Advancement of Hydrologic Science, Inc.
Deployment and Evaluation of an Observations Data Model Jeffery S Horsburgh David G Tarboton Ilya Zaslavsky David R. Maidment David Valentine
Tools for Publishing Environmental Observations on the Internet Justin Berger, Undergraduate Researcher Jeff Horsburgh, Faculty Mentor David Tarboton,
A Guide to Getting Started
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
About CUAHSI The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) is an organization representing 120+ universities.
Information Requirements for Integrating Spatially Discrete, Feature- Based Earth Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Kerstin Lehnert,
HydroShare: An online, collaborative environment for the sharing of hydrologic data and models David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames,
HydroShare: Advancing Collaboration through Hydrologic Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
Water Web Services David R. Maidment Center for Research in Water Resources University of Texas at Austin Open Waters Symposium Delft, the Netherlands.
HydroShare: An online collaborative environment for the sharing of hydrologic data and models IN11A-1510 We envision that HydroShare will enable more rapid.
Data Interoperability in the Hydrologic Sciences The CUAHSI Hydrologic Information System David Tarboton, David Maidment, Ilya Zaslavsky, Dan Ames, Jon.
Water and Catchment Data Services David R. Maidment Center for Research in Water Resources University of Texas at Austin River Science Symposium Swansea,
Getting the most out of ArcGIS Web Application Templates
August 2003 At A Glance VMOC-CE is an application framework that facilitates real- time, remote cooperative work among geographically dispersed mission.
Towards a 3D National Hydrography Dataset Presentation made to the US Geological Survey By David R. Maidment Center for Research in Water Resources University.
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
Clearing your Desk! Software and Data Services for Collaborative Web Based GIS Analysis David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall,
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
The CUAHSI Observations Data Model Jeff Horsburgh David Maidment, David Tarboton, Ilya Zaslavsky, Michael Piasecki, Jon Goodall, David Valentine,
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
USU, RENCI, BYU, UNC, UVA, CUAHSI, Tufts, Texas, Purdue, Caktus
CUAHSI HIS: Science Challenges Linking small integrated research sites (
HydroShare: Advancing Hydrology through Collaborative Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry.
Hydroinformatics Lecture 15: HydroServer and HydroServer Lite The CUAHSI HIS is Supported by NSF Grant# EAR CUAHSI HIS Sharing hydrologic data.
Share and Publish Your Data and Models with HydroShare David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry Band, Venkatesh Merwade,
Reference Management Module I: Introduction By Rehema Chande-Mallya(PhD)
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
TRIG: Truckee River Info Gateway Dave Waetjen Graduate Student in Geography Information Center for the Environement (ICE) University of California, Davis.
Developing a community hydrologic information system David G Tarboton David R. Maidment (PI) Ilya Zaslavsky Michael Piasecki Jon Goodall
Community Approaches to Hydrologic Data and Model Sharing using Cloud Resources David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall,
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
USU, RENCI, BYU, UNC, UVA, CUAHSI, Tufts, Texas, Purdue, SDSC
SharePoint 101 – An Overview of SharePoint 2010, 2013 and Office 365
Water Information Sharing and HydroShare
Introduction to HydroShare Workshop CUAHSI Biennial Symposium, July 26
Sharing models as social objects through HydroShare
Sharing Hydrologic Data with the CUAHSI* Hydrologic Information System
USU, RENCI, BYU, UNC, UVA, CUAHSI, Tufts, Texas, Purdue, SDSC
Discovering Computers 2010: Living in a Digital World Chapter 14
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Cloud Computing I hear this question often. It is not easy to explain, because it means different things depending on who you talk to. Today’s Webinar.
DataNet Collaboration
CUAHSI HIS Sharing hydrologic data
Chapter 18 MobileApp Design
Water Information Sharing and HydroShare
Jay Bhatt Drexel University Libraries
HydroDesktop: A Key Component of the CUAHSI/CZO HIS for Hydrologic Data Discovery, Visualization, and Analysis Daniel P. Ames, Ph.D. P.E. Idaho State University.
KISTERS TimeSeries HUB
Overview of Social Computing in Microsoft SharePoint 2010
David Tarboton, Dan Ames, Jeffery S. Horsburgh, Jon Goodall
Code Analysis, Repository and Modelling for e-Neuroscience
Social media for global scientific community – Mendeley project
Bird of Feather Session
ArcGIS Online – The Road Ahead
Code Analysis, Repository and Modelling for e-Neuroscience
OAC OAC OAC ACI ACI The HydroShare domain-specific repository for archiving and active management.
Mark Quirk Head of Technology Developer & Platform Group
ArcGIS Online Steps for Success A best practices approach
Hydrologic Information Systems to discover and combine data from multiple sources for hydrologic analysis David Tarboton Utah State University CUAHSI HIS.
Presentation transcript:

HydroShare: Advancing Collaboration through Hydrologic Data and Model Sharing David Tarboton, Ray Idaszak, Jeffery Horsburgh, Dan Ames, Jon Goodall, Larry Band, Venkatesh Merwade, Alva Couch, Jennifer Arrigo, Rick Hooper, David Valentine http://www.hydroshare.org OCI-1148453 OCI-1148090

CUAHSI HIS Challenges Publishing data requires access to or setting up a HydroServer Accessing data requires HydroDesktop Generally limited to time series at a point Server Desktop Catalog

A digital divide Big Data and HPC Researchers Experimentalists Modelers awk grep vi #PBS -l nodes=4:ppn=8 mpiexec chmod #!/bin/bash How can we best structure data and computer models to enable the use of high-performance and data-intensive computing by discipline scientists coming to this problem without extensive computational knowledge and algorithmic experience? Gateways, Web Interfaces, CyberGIS

Can sharing data and models be as easy as sharing photos on Facebook or videos on YouTube? In short, we would like to see if sharing hydrologic data and models can be as easy as sharing photos on Facebook or videos on YouTube.

Can finding data and models be as easy as shopping on Amazon? Possible Filters Available Formats Items Or whether finding data and models can be as easy as shopping on Amazon, and perhaps not as heavy on your credit card. List of items also includes other providers, aka not shipped by amazon. List of data shows ‘facets’ , Items show formats recommendations. Add-in shows pricetracking Recommendations Prices (perhaps usage)

Cloud Computing Applications Models Storage Services Computation Wikipedia: Cloud computing is the use of computing resources (hardware and software) that are delivered as a service over a network (typically the Internet) Google, Amazon, Microsoft, Apple, DropBox XSEDE, Condor, BOINC

HydroShare is a web based collaborative system to support analysis, modeling and data publication Collaboration Observers and instruments Analysis Data Models HydroShare will be a collaborative environment for sharing hydrologic data and models aimed at giving hydrologists the technology infrastructure they need to address critical issues related to water quantity, quality, accessibility, and management. HydroShare will expand the data sharing capability of the CUAHSI Hydrologic Information System by broadening the classes of data accommodated, expanding capability to include the sharing of models and model components, and taking advantage of emerging social media functionality to enhance information about and collaboration around hydrologic data and models. Functionality will include A web portal for model and data sharing Sharing features added to HydroDesktop client software Access to more types of hydrologic data using standards compliant data formats and interfaces Enhanced catalog functionality that broadens discovery functionality to different data types and models New model sharing and discovery functionality Enhanced easy to use access to high performance computing Social media and collaboration functionality Linkages to other data and modeling systems such as USGS and CUAHSI data services, NASA earth exchange and HPC resources e.g. at CSDMS Publication, Archival, Curation

Currently in beta testing http://beta.hydroshare.org Currently in beta testing

HydroShare Functionality to be Developed A new, web-based system for advancing model and data sharing Sharing features to HydroDesktop Access more types of hydrologic data using standards compliant data formats and interfaces Enhance catalog functionality that broadens discovery functionality to different data types New model sharing and discovery functionality Facilitate and ease access to use of high performance computing New social media and collaboration functionality Links to other data and modeling systems

Upload

Support additional types of data Resource Types Time Series Geographic feature set Other Referenced HIS time series Geographic Raster Multidimensional Space Time dataset River geometry Sample based observations (ODM2 and CZO) Documents Tabular objects HydroDesktop Project package Scripts Models Model Components Referenced data sets from other (non HIS sources). Tools Uploaders to facilitate loading of resources Viewers to visualize the resource Exporters to download the resource Best practice tools for hydrologic data preprocessing and analysis Requires a Resource Data Model Documented resource content specification that dictates how the resource is stored in HydroShare

Imagine the Possibilities… Observe Publish and Catalog Discover and Analyze/Model (in Desktop or Cloud) Collaboration 3 Observers and instruments Analysis HydroServer (ODM) Data Models 1 2 Publication, Archival, Curation HydroShare to support integrated collaborative analysis, modeling and data publication

Imagine the Possibilities… Share the results (Data and Models) Collaboration Observers and instruments 4 Analysis HydroShare resource store Data Models Publication, Archival, Curation HydroShare to support integrated collaborative analysis, modeling and data publication

Imagine the Possibilities… Group Collaboration using HydroShare Preparation of a paper Collaboration 5 Observers and instruments Analysis 6 Data Models Publication, Archival, Curation HydroShare to support integrated collaborative analysis, modeling and data publication

Imagine the Possibilities… Submittal of paper, review, archival of electronic paper with data, methods and workflow Collaboration Observers and instruments Analysis 7 Data Models Publication, Archival, Curation HydroShare to support integrated collaborative analysis, modeling and data publication DataOne, EarthCube, …

HydroShare Modeling Flow Time x y t Data: Links to national and global data sets of essential terrestrial variables (e.g. NASA NEX, HydroTerre) Tools to preprocess and configure inputs Preconfigured models and modeling systems as services Standards for information exchange for interoperability (OpenMI, CSDMS BMI) Tools for Visualization and Analysis Automated reasoning to couple models based on purpose, context, data and resources Automated reasoning to couple models based on purpose, context, data and resources (Aaron Byrd) Standards for information exchange for interoperability (OpenMI, CSDMS BMI) Data: Links to national and global data sets of essential terrestrial variables (e.g. NASA NEX, HydroTerre) Tools to preprocess and configure inputs (TauDEM + CyberGIS) Preconfigured models and modeling systems as services (CI-WATER) Tools for visualization and analysis

A specific example Big snow year Will my city flood? Click to delineate watershed (model domain) Generate model package from Essential Terrestrial Variables Generate suite of input scenarios Execute model and view results P Time Flow Time

But there is more… What if I could express my decision needs to the system and have it reason and deduce which models need to run, then configure and run them based on the inputs available, precision needs and resources and time available.

Resource Repository Centric Paradigm for Modeling and Analysis Analysis Tools Visualization Tools Data Loaders Data Discovery Tools Models Resource Repository Enable multiple models to use common “best practice” tools

E.g. SWATShare A web based tool for publishing, sharing, and accessing Soil Water Assessment Tool (SWAT) You need to log-in to use the functions of SWAT share. On this slide, you see a map the watershed in red are the one for which SWAT models are available. Once you click a on one of the watersheds (highlighted in yellow), then you see then you see the associated metadata with the model. If the model is only published (and not shared), one cannot download or run the model. However, if the model is published and shared, any user can download, modify, and run the model on the SWATShare. www.water-hub.org/swat-tool

Model pre and post processing workflow Analysis Tools Visualization Tools Data Loaders Data Discovery Tools Models Resource Repository Input Files Output Files Pre-Processing Post -Processing Resource Repository Each model interacts with information in the common data store The modeler does not need to be concerned with and can take advantage of standardized analysis, visualization loading and discovery tools

Architecture and Development

Drupal – Content Management System Extensible Open Source Content Management Framework for Publication written in PHP Over 14,000 user contributed modules Themed and Styled Presentation of HydroShare Resources with in page visualization Off the shelf modules provide a Social Experience surrounding Hydrologic Data: Comments, Ratings, Group Behavior Custom module development supports HydroShare Data Model, GeoAnalytics and iRODS Integration

Enterprise iRODS Distributed Data Grid Middleware: E-iRODS in HydroShare Storage of HydroShare Resources Replicated across multiple institutions Access to Computation Access to Indexing for Discovery Rule Engine MSVC R. Server … Client Users iCAT Distributed Data Grid Middleware: Metadata Catalog holding virtual file system information and associated metadata Extensible number of ‘Resource Servers’ which may provide connectivity to storage resources Integrated Rule Engine for Policy Driven Data Management triggered by Data Management Activities Extensibility via Microservices (MSVC) – Plugins providing functionality to the Rule Engine

Informatics Standing Committee A community project http://www.cuahsi.org 109 US University members 7 affiliate members 20 international affiliate members 3 corporate members (as of January 2013) Informatics Standing Committee Users Committee

Implementation (Agile) Community / User Requirements Community Governance CUAHSI Board Standing Committee on Informatics HydroShare Executive Committee CUAHSI User Community Development Team Implementation (Agile) Hydrologic Information System (HIS) Integrated Rule-Oriented Data System (iRODS) Drupal Evaluation Metrics End-user involvement Quantitative and qualitative measurement Sustainability Prioritization Decision Making Oversight Released Software Community / User Requirements Surveys Conferences Workshops Embed UI with “Help us make our software better” Specification Requests Prototype

HydroShare project team USU RENCI/UNC CUAHSI BYU Tufts UVA Texas Purdue SDSC The HydroShare project is part of a broad effort in CUAHSI in the area of Hydrologic Information Systems. We have a team of developers and domain scientists from eight universities working on HydroShare. This is part of the even broader focus in NSF on data management, Cyberinfrastructure and sustainable software. OCI-1148453 OCI-1148090 2012-2017

User driven use cases Annotate uploaded hydrology models using an ontology Register a Package with HydroShare Add data resource for a model Notify Me When Related Resources Are Registered Register a Resource with HydroShare Evaluate Load Reduction Scenarios Suggest a Resource Related to the Current Resource Building an Intelligent Digital Watershed (IDW) Contribute to a Community Dataset Define Relationships between Resources Discover a Community Dataset to which I Can Contribute Execute a Model in HydroShare Register a Workflow with HydroShare Register a Community Dataset Download a Model, Execute It, and Share the Model and Results Define a Composite Resource Crowd sourcing modeling tasks Automated Visualization (thumbnails) User displays HydroShare Gallery Existing User Logs into HydroShare New User Creates a HydroShare User Account User Sets Personal Preferences User is provided a personal Dashboard User Chooses to “Follow” Another User User Chooses to “Follow” a Group User Views His/Her Personal Content User Uploads a Resource User Deletes a Resource User Shares a Resource in HydroShare User Publishes a Resource to DataONE User Publishes a Resource to the CUAHSI Water Data Center User Exports a Resource to their Local Machine User Searches / Filters / Sorts their Personal Resources User Views Details Page for a Resource User Groups Resources into a “Folder” or “Collection” User “Opens” a Resource User Edits Metadata Description for a Resource User Adds a Comment to a Resource User Rates / Reviews a Resource User Derives a New Resource from an Existing Resource User Executes a Resource User Explores / Searches Available HydroShare Resources User “Pins” a Discovered Resource to a “Resource Collection” User Filters Discovered Resources User Imports Data from Externally Hosted Resources User Searches For Collaboration Groups User Views Group Details User Creates a Collaboration Group User Requests Group Membership User Creates a Comment on a Collaboration Group User Creates a Discussion in a Collaboration Group Discussion Forum User Edits a Collaboration Group’s Description User Searches / Filters / Sorts a Group’s Resources User Views Documentation and Gets Support User Views / Subscribes to the HydroShare Blog User Exports a HydroShare Resource Citation into Mendeley or Zotero User Transfers Ownership of a Resource to Another User User Receives HydroShare Social Media Notifications via Mobile Device User Views Access / Download Statistics for a Resource User Views HydroShare Resources via Mobile Devices Searching and/or browsing HydroShare Translate data automatically for HydroShare operations. Translate data automatically for export. Publish translated data. Translate replicated data. Registration of a new HydroShare Tool Editing a Published (with DOI) resource User Creates New “Model Package” Resource User Transfers Ownership of a Group to Another User User Develops a Client for HydroShare Summarize hydrologic model input parameters for a user defined region Discover specialist/ Promote specialized services Visualize Time Series Upload a Model

Metrics Metric Number Number of registered users 35 Number of host institutions 15 Github HydroShare code repository owners and members Use Metric Number of active users Number of resources stored Number of resources downloaded Size of resources stored (GB) CPU hours of compute resources used Number of compute jobs run Number of logons Average duration of session Total use   Use by user type University Faculty Post-Doctoral Fellow …. Use by Geographic Location State Country Use by resource type Time Series Geographic Feature Set User Types: University Faculty, University Professional or Research Staff, Post-Doctoral Fellow, University Graduate Student, University Undergraduate Student, Commercial/Professional, Government Official, School Student Kindergarten to 12th Grade, School Teacher Kindergarten to 12th Grade, Other, Unspecified Resource Types: Time Series, Geographic Feature Set, Geographic Raster, Multidimensional Space Time Array, River Geometry, Model, Workflow, Other, …

Collaborative Open Development http://github.com/organizations/hydroshare http://hydrodesktop.codeplex.com

Summary A collaborative website for the sharing of hydrologic data and models To expand data sharing capability of CUAHSI HIS Additional data classes Models, scripts, tools and workflows Community Participation Interoperability Standards Open Development To boldly go where no one has gone before

Thanks to a lot of people USU RENCI/UNC CUAHSI BYU Tufts USC Texas Purdue SDSC The HydroShare project is part of a broad effort in CUAHSI in the area of Hydrologic Information Systems. We have a team of developers and domain scientists from eight universities working on HydroShare. This is part of the even broader focus in NSF on data management, Cyberinfrastructure and sustainable software. HydroShare team: Dave Tarboton, Ray Idaszak, Dan Ames, Jeff Horsburgh, Jon Goodall, Larry Band, Venkatesh Merwade, Jeff Heard, Carol Song, Alva Couch, David Valentine, Rick Hooper, Jennifer Arrigo, David Maidment, Tim Whiteaker, Alex Bedig, Laura Christopherson, Pabitra Dash, Tian Gan, Tony Castronova, Karl Gustafson, Stephen Jackson, Cuyler Frisby, Stephanie Mills, Brian Miles, Jon Pollak, Stephanie Reeder, Ash Semien, Yaping Xiao, Lan Zhao http://www.cuahsi.org/hydroshare.aspx OCI-1148453 OCI-1148090

Next Class

Representing River Geometry in HydroShare LiDAR Cross Sections Attached to River Network Cross Sections Hydraulic Calculations

Modular design, linking river geometry, catchment geometry, network topology, and time series observations Data is linked by common reference points along the river, which can be represented as point or cross section shapefiles and shown on a map. Based on OGC HY_Features Model