Lecture 4 Data Models Jeffery S. Horsburgh Hydroinformatics Fall 2012 This work was funded by National Science Foundation Grant EPS 1135482.

Slides:



Advertisements
Similar presentations
HydroServer A Platform for Publishing Space- Time Hydrologic Datasets Support EAR CUAHSI HIS Sharing hydrologic data Jeffery.
Advertisements

How to share and publish your data using HIS David G Tarboton Jeff Horsburgh Ilya Zaslavsky Tom Whitenack David Valentine Support EAR
ICEWATER: INRA Constellation of Experimental Watersheds Cyberinfrastructure to Support Publication of Water Resources Data Jeffery S. Horsburgh, Utah State.
A Community Data Model for Hydrologic Observations Observations Data Model Schema ODM Data Source and Network SitesVariables ValuesMetadata Depth of snow.
Managing Sensor Infrastructure: A Sensor Extension for ODM2 Amber Spackman Jones, Jeffery S. Horsburgh, Juan Caraballo, Maurier Ramírez Utah Water Research.
Linking HIS and GIS How to support the objective, transparent and robust calculation and publication of SWSI? Jeffery S. Horsburgh CUAHSI HIS Sharing hydrologic.
This work is funded by National Science Foundation Grant EAR Accessing and Sharing Data Using the CUAHSI Hydrologic Information System CUAHSI HIS.
ODM2: Developing a Community Information Model and Supporting Software to Extend Interoperability of Sensor and Sample Based Earth Observations Jeffery.
CUAHSI HIS Data Services Project David R. Maidment Director, Center for Research in Water Resources University of Texas at Austin (HIS Project Leader)
Components of an Integrated Environmental Observatory Information System Cyberinfrastructure to Support Publication of Water Resources Data Jeffery S.
This work was funded by the U.S. National Science Foundation under grant EAR Any opinions, findings and conclusions or recommendations expressed.
HydroServer A Platform for Publishing Space- Time Hydrologic Datasets Support EAR CUAHSI HIS Sharing hydrologic data Jeffery.
Arc Hydro: GIS for Water Resources David R
Development of a Community Hydrologic Information System Jeffery S. Horsburgh Utah State University David G. Tarboton Utah State University.
Using GIS in Creating an End-to- End System for Publishing Environmental Observations Data Jeffery S. Horsburgh David G. Tarboton, David R. Maidment, Ilya.
Integrating Historical and Realtime Monitoring Data into an Internet Based Watershed Information System for the Bear River Basin Jeff Horsburgh David Stevens,
Deployment and Evaluation of an Observations Data Model Jeffery S Horsburgh David G Tarboton Ilya Zaslavsky David R. Maidment David Valentine
An End-to-End System for Publishing Environmental Observations Data Jeffery S. Horsburgh David K. Stevens, David G. Tarboton, Nancy O. Mesner, Amber Spackman.
GIS in Water Resources: Lecture 1
1 Space-Time Datasets in Arc Hydro II by Steve Grise (ESRI), David Maidment, Ernest To, Clark Siler (CRWR)
Using HydroServer Organize, Manage, and Publish Your Data Support EAR CUAHSI HIS Sharing hydrologic data Jeffery S. Horsburgh.
INTRODUCTION TO ARC HYDRO GROUNDWATER USGS Community for Data Integration December 2010.
Arc Hydro groundwater data model: a data model for groundwater systems within ArcGIS ESRI user conference May 2004 Gil Strassberg and David Maidment, University.
Information Requirements for Integrating Spatially Discrete, Feature- Based Earth Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Kerstin Lehnert,
Exercises: Organizing, Loading, and Managing Point Observations Using HydroServer Support EAR CUAHSI HIS Sharing hydrologic data
Arc Hydro Groundwater Data Model
Data Interoperability in the Hydrologic Sciences The CUAHSI Hydrologic Information System David Tarboton, David Maidment, Ilya Zaslavsky, Dan Ames, Jon.
Flow Time Time Series Hydro FeaturesHydro Network Channel System Drainage System ArcGIS Hydro Data Model.
Advancing an Information Model for Environmental Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Richard P. Hooper, Kerstin Lehnert, Kim Schreuders,
Wendy Welch ADAPTING THE ARC HYDRO DATA MODEL AND TOOLS TO A HYDROGEOLOGIC FRAMEWORK KITSAP PENINSULA, KITSAP, MASON AND PIERCE COUNTIES, WASHINGTON.
GIS in Water Resources: Lecture 1 In-class and distance learning Geospatial database of hydrologic features GIS and HIS Curved earth and a flat map.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
CUAHSI Hydrologic Information Systems. HIS Project Team Yao Liang John Helly Project co-PI Collaborator.
Space and Time By David R. Maidment with contributions from Gil Strassberg and Tim Whiteaker.
GIS in Water Resources: Lecture 1 In-class and distance learning Geospatial database of hydrologic features GIS and HIS Curved earth and a flat map.
CUAHSI HIS Features of Observations Data Model. NWIS ArcGIS Excel NCAR Trends NAWQA Storet NCDC Ameriflux Matlab AccessSAS Fortran Visual Basic C/C++
CE 394K.2 Surface Water Hydrology Lecture 1 – Introduction to the course Readings for today –Applied Hydrology, Chapter 1 –“Integrated Observatories to.
The CUAHSI Observations Data Model Jeff Horsburgh David Maidment, David Tarboton, Ilya Zaslavsky, Michael Piasecki, Jon Goodall, David Valentine,
Data Model / Database Implementation (continued) Jeffery S. Horsburgh Hydroinformatics Fall 2014 This work was funded by National Science Foundation Grants.
GIS for Atmospheric Sciences and Hydrology By David R. Maidment University of Texas at Austin National Center for Atmospheric Research, 6 July 2005.
Creating Hydrologic Information Systems David R. Maidment Utah State University 9 February 2004.
Arc Hydro groundwater data model: a data model for groundwater systems within ArcGIS AWRA Specialty Conference Geographic Information Systems (GIS) and.
HydroGET A web service client for ArcGIS Ernest To PhD Candidate, University of Texas at Austin August 2008.
UC2008 Pre-conference Seminars 1 Arc Hydro Groundwater Gil Strassberg (Aquaveo) Norm Jones (Brigham Young University) David Maidment (University of Texas)
Lecture 9 Using Structured Query Language (SQL) Jeffery S. Horsburgh Hydroinformatics Fall 2012 This work was funded by National Science Foundation Grant.
Lecture 5 Data Model Design Jeffery S. Horsburgh Hydroinformatics Fall 2012 This work was funded by National Science Foundation Grant EPS
Hydroinformatics Lecture 15: HydroServer and HydroServer Lite The CUAHSI HIS is Supported by NSF Grant# EAR CUAHSI HIS Sharing hydrologic data.
The Bear River Watershed Information System Jeffery S. Horsburgh Utah Water Research Laboratory Utah State University David.
Bdsfbdsfb NHDPlus for Corpus Christi Bay Term Project Report Nishesh Mehta Nishesh Mehta 21 st November st November 2006.
GIS in Water Resources: Lecture 1 The goal of this class is to learn how to apply geographic information systems in water resources. Hydrologists use many.
1 of 42 Lecture 5 Data Model Design Jeffery S. Horsburgh Hydroinformatics Fall 2013 This work was funded by National Science Foundation Grants EPS
Developing a community hydrologic information system David G Tarboton David R. Maidment (PI) Ilya Zaslavsky Michael Piasecki Jon Goodall
Jeffery S. Horsburgh Hydroinformatics Fall 2014
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
Hydroinformatics Lecture: HydroServer .NET/PHP
Using GIS in Creating an End-to-End System for Publishing Environmental Observations Data Jeffery S. Horsburgh David G. Tarboton, David R. Maidment, Ilya.
Using an Observations Data Model in Hydrologic Information Systems
Developing a Community Hydrologic Information System
Sharing Hydrologic Data with the CUAHSI* Hydrologic Information System
The CUAHSI Hydrologic Information System and NHD Plus A Services Oriented Architecture for Water Resources Data David G Tarboton David R. Maidment (PI)
Jeffery S. Horsburgh Utah State University
Lecture 8 Database Implementation
CUAHSI HIS Sharing hydrologic data
Hydroinformatics Lecture 15: HydroServer (and HydroServer Lite)
Jeffery S. Horsburgh Hydroinformatics Fall 2014
Arc Hydro for EPA Basins
GIS in Water Resources: Lecture 1
Space, Time and Variables in Hydrology
GIS in Water Resources: Lecture 1
ESRI User Conference Water Resources User Group, July 15, 2010
Presentation transcript:

Lecture 4 Data Models Jeffery S. Horsburgh Hydroinformatics Fall 2012 This work was funded by National Science Foundation Grant EPS

Objectives Identify and describe important entities and relationships to model data Describe important data models used in Hydrology such as the Observations Data Model (ODM), ArcHydro, and NetCDF

What is a Data Model? Abstract model that documents and organizes data Explicitly provides the definition of and determines the structure of data Used as a plan and structure for developing applications that use the data

Data Models Define the “entity” types within a domain Methods (how) Sites (where) Values Data Sources (who)

Entities Associated with Observations Variables – the things you measure or observe Observers – who made the observation Samples – a bottle of water, a sediment core Offsets – distance below ground, below surface, etc. Versions – raw data, processed data, simulations Qualifiers – limitations to data use

Data Models Define the attributes of entities Entity = Site AttributesValues Site Name:Little Bear River near Wellsville Site Code:USU-LBR-Wellsville Latitude: Longitude: Elevation: 1365 m State: Utah County: Cache Description: Attached to SR101 bridge. Site Type: Stream

Data Models Define the relationships among entities Water temperature values in degrees Celsius measured in the Little Bear River at Mendon Road using a Hydrolab MS5 multiparameter sonde by Utah State University Site Variable and Method Source Values

Data Models Define the “business rules” for data – Observations are recorded at one and only one site – One or more variables are measured at a site – A site must have a name – A variable name must be chosen from a controlled vocabulary

Types of Data Models Relational data models – e.g., relational databases 1 * * 1

Relational Data Models Great for data with many transactions Great in a multiple-user environment Powerful query language – Structured Query Language (SQL) Robust database servers and software tools available

Types of Data Models File based data models – ESRI File Geodatabase – NetCDF Structured file or set of files that store data

File Based Data Models Usually tied to a tool or set of tools for reading, writing, etc. Can be portable across platforms Can be optimized for performance or compression (e.g., custom binary files)

Types of Data Models Extensible Markup Language (XML) schemas

XML Schemas Great for transporting data in a machine readable format Platform and programming language independent Special form of file based data model

Types of Data Models Object models

Object Models A collection of objects or classes through which a computer program can manipulate data Objects have “properties” and “methods” Container that wraps data within a set of functions – Ensure that the data are used appropriately – Provide standardized, reusable functionality

Object Model Class/Object Properties Methods

Some Data Models Commonly Used in Hydrology CUAHSI Observations Data Model (ODM) Arc Hydro Arc Hydro Groundwater NetCDF

Observations Data Model (ODM) Soil moisture data Streamflow Flux tower data Groundwater levels Water Quality Precipitation & Climate A relational database at the single observation level Metadata for unambiguous interpretation Traceable heritage from raw measurements to usable information Promote syntactic and semantic consistency Cross dimension retrieval and analysis Horsburgh, J. S., D. G. Tarboton, D. R. Maidment, and I. Zaslavsky (2008), A relational model for environmental and water resources data, Water Resources Research, 44, W05406, doi: /2007WR

What are the basic attributes to be associated with each single data value and how can these best be organized? Space, S Time, T Variables, V s t vivi v i (s,t) “Where” “What” “When” A data value Variable Method Quality Control Level Sample Medium Value Type Data Type Source/Organization Units Accuracy Censoring Qualifying comments Location Feature of interest DateTime Interval (support)

Data Series – A Time Series of Hydrologic Observations Space Variable, V i Site, S j End Date Time, t 2 Begin Date Time, t 1 Time Variables Count, C There are C measurements of Variable V i at Site S j from time t 1 to time t 2 Defined by unique combinations of: Site Variable Method Source Quality Control Level

ODM Sites (where) Variables (what) Methods (how) Sources (who) Quality Control Levels Values + (when)

Controlled Vocabularies

Controlled Vocabularies Reducing Semantic Heterogeneity

Implementing ODM Relational database schemas exist for: – Microsoft SQL Server – MySQL

ODM Example: Water Quality from a Profile in a Lake

Linking Point Observations to Hydrologic Features

Arc Hydro: GIS for Water Resources Arc Hydro – An ArcGIS data model for water resources – Arc Hydro toolset for implementation – Framework for linking hydrologic simulation models The Arc Hydro data model and application tools are in the public domain Published in 2002, now in revision for Arc Hydro II

Real World Hydrologic Features

What are some important entities in a data model for surface water hydrology?

Streams WatershedsWaterbody Hydro Points Arc Hydro Framework Input Data

Arc Hydro Framework Data Model

What Can I do with ArcHydro? ArcHydro defines flow lines and junctions and encodes flow directions ArcHydro encodes relationships among watersheds, streams, and junctions Establishes hydrologic connectivity between polygon catchments (polygons), stream reaches (lines), and junctions (points)

What Can I Do with ArcHydro? Network Tracing Select all streams above a point Select the downstream path for a point

Arc Hydro Tools for ArcGIS Terrain analysis: preparing DEM derivatives Watershed processing: watershed delineation from DEMs Attribute tools: computing and populating attributes and identifiers Network tools: creating the hydro network Focus: getting data into Arc Hydro and working with it once it is there.

Arc Hydro Time Series Variable: string describing what is being measured or calculated Units: string describing units IsRegular: boolean inidicating if the data are regularly spaced TSInterval: controlled vocabulary for time intervals DataType: statistic for value measured over interval Origin: indication of whether the values are measured or calculated

Arc Hydro Groundwater Data model and tools for managing groundwater data in ArcGIS

What are important entities in a groundwater data model?

Arc Hydro GW Data Model

Arc Hydro GW Tools Groundwater Analyst Subsurface Analyst MODFLOW Analyst

NetCDF A platform independent format for representing multi-dimensional, array-orientated scientific data Continuous space-time data model – Both time and space are varying Especially useful for time-varying grids – Time varying precipitation fields (e.g., radar rainfall data) Used extensively in the weather and climate domains

NetCDF Characteristics NetCDF (network Common Data Form) Self Describing - a netCDF file includes information about the data it contains Direct Access - a small subset of a large dataset may be accessed efficiently, without first reading through all the preceding data Sharable - one writer and multiple readers may simultaneously access the same netCDF file

Multidimensional Data Time = 1 Time = 2 Time = 3

Multidimensional Data – Space and Time

The NetCDF File NetCDF is a binary file A NetCDF file consists of: Global Attributes: Describe the contents of the file Dimensions:Define the structure of the data (e.g., Time, Depth, Latitude, Longitude) Variables:Holds the data in arrays shaped by Dimensions Variable Attributes: Describes the contents of each variable CDL (network Common Data form Language) description takes the following form netCDF name { dimensions:... variables:... data:... }

Considerations in Modeling Data Is there an existing data model that will work for my data? What are the top 20 queries or analyses you need to do with the data? What software do I want to use? How will you want to share the data?

Advantages of Formal Data Models Provide a high degree of structure to data Generally implemented in software that has robust querying, manipulation, and visualization capabilities (e.g., RDBMS or GIS) Facilitate software development Can help in capturing the semantics of data

Disadvantages Can be stiff and difficult to change Difficult to anticipate needs in the design stages Can be incompatible across organizations Can become complex

Summary (1) A data model provides a definition of a formal structure for data There are several flavors of data models, each with different strengths, weaknesses, and appropriate uses Data models can facilitate software development

Summary (2) Common data models used in hydrology – The CUAHSI Observations Data Model (ODM) provides an organizational structure for hydrologic time series data – Arc Hydro is a geographic data model for surface hydrologic features – ArcHydro Groundwater adds subsurface hydrologic features, geology, borehole data, and hydrostratigraphy – NetCDF combines both geospatial and temporal domains into a continuous space-time data model

References and Credits Horsburgh, J.S., D.G. Tarboton (2012). CUAHSI Community Observations Data Model (ODM) Version Design Specifications, CUAHSI, Washington, D.C, Horsburgh, J. S., D. G. Tarboton, D. R. Maidment, and I. Zaslavsky (2008), A relational model for environmental and water resources data, Water Resources Research, 44, W05406, Maidment, D.R. (ed.) (2002). Arc Hydro GIS for Water Resources, ESRI Press, Redlands, CA, 203 p. Strassberg, G., N.L. Jones, D.R. Maidment (2011). Arc Hydro Groundwater GIS for Hydrogeology, ESRI Press, Redlands, CA, 160 p. Credits: Arc Hydro slides used with permission from David Maidment, University of Texas at Austin. ArcHydro Groundwater slides used with permission from Norm Jones, Brigham Young University/Aquaveo.