Logical consistency February 24, 2006 Geog 458: Map Sources and Errors.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Chapter 10: Designing Databases
Graphic Data Clean-Up Issues. Graphic Data Clean-up Very Important if you plan to use existing CAD or GIS data from another agency, department, or private.
WFM 6202: Remote Sensing and GIS in Water Management © Dr. Akm Saiful IslamDr. Akm Saiful Islam WFM 6202: Remote Sensing and GIS in Water Management Akm.
ESRM 250 & CFR 520: Introduction to GIS © Phil Hurvitz, KEEP THIS TEXT BOX this slide includes some ESRI fonts. when you save this presentation,
Vector-Based GIS Data Processing Chapter 6. Vector Data Model Feature Classes points lines polygons Layers limited to one class of data Figure p. 186.
Lecture 4: Intro to the Vector Data Model and to Map Layout
Lecture 4: Intro to the Vector Data Model and to Map Layout
Geog 458: Map Sources and Errors January 20, 2006 Data Storage and Editing.
@2007 Austin Troy Lecture 4: An Introduction to the Vector Data Model and Map Layout Techniques Introduction to GIS By Brian Voigt University of Vermont.
The Relational Database Model:
Completeness February 27, 2006 Geog 458: Map Sources and Errors.
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 3 The Relational Database Model.
The Relational Model Codd (1970): based on set theory Relational model: represents the database as a collection of relations (a table of values --> file)
Dr. David Liu Objectives  Understand what a GIS is  Understand how a GIS functions  Spatial data representation  GIS application.
Geographical Information System GIS By: Yahia Dahash.
Rebecca Boger Earth and Environmental Sciences Brooklyn College.
@2007 Austin Troy Lecture 4: An Introduction to the Vector Data Model and Map Layout Techniques Introduction to GIS By Brian Voigt University of Vermont.
Data Quality Data quality Related terms:
Advanced Editing: Rules-Based Topology in ArcEditor.
Entity-Relationship Design
Esri UC 2014 | Technical Workshop | Data Alignment and Management in ArcMap Lisa Stanners, Sean Jones.
Spatial data models (types)
Spatial Data Editing Chapter 5. Introduction All digitizing involves errors In the real world, revisions are required Keeping data up-to-date is part.
Introduction to ArcGIS for Environmental Sciences Day 2 – Fundamentals Module 8 Creating & Editing Data Creating Metadata.
NR 422: Topology Jim Graham Fall 2010 See: odatabase-topology.pdf.
Parcel Data Models for the Geodatabase
Applied Cartography and Introduction to GIS GEOG 2017 EL
Geographic Information System GIS This project is implemented through the CENTRAL EUROPE Programme co-financed by the ERDF GIS Geographic Inf o rmation.
Applied Cartography and Introduction to GIS GEOG 2017 EL Lecture-2 Chapters 3 and 4.
Major parts of ArcGIS ArcView -Basic mapping, editing and Analysis tools ArcEditor -all of ArcView plus Adds ability to deal with topological and network.
Data input 1: - Online data sources -Map scanning and digitizing GIS 4103 Spring 06 Adina Racoviteanu.
Lecture 7 Integrity & Veracity UFCE8K-15-M: Data Management.
8. Geographic Data Modeling. Outline Definitions Data models / modeling GIS data models – Topology.
How do we represent the world in a GIS database?
CHAPTER 3 VECTOR DATA MODEL.
1 Data models Vector data model Raster data model.
URBDP 422 Urban and Regional Geo-Spatial Analysis Lecture 2: Spatial Data Models and Structures Lab Exercise 2: Topology January 9, 2014.
Data Storage and Editing (17/MAY/2010) Dr. Ahmad BinTouq URL:
Database Systems, 9th Edition 1.  In this chapter, students will learn: That the relational database model offers a logical view of data About the relational.
Map overlays & Geoprocessing Learn about spatial analysis functions overlays, clipping & buffering Use overlays to analyze multiple spatial criteria Understand.
Query and Reasoning. Types of Queries Most GIS queries will select spatial features Query by Attribute (Select by Attribute) –Structured Query Language.
Database Systems: Design, Implementation, and Management Tenth Edition Chapter 3 The Relational Database Model.
Database Systems: Design, Implementation, and Management Ninth Edition Chapter 3 The Relational Database Model.
Advanced Editing: Rules-Based Topology in ArcEditor
GIS Data Types. GIS technology utilizes two basic types of data 1. Spatial Data Describes the absolute and relative location of geographic features.
Geographic Data in GIS. Components of geographic data Three general components to geographic information Three general components to geographic information.
Information Access Mgt09/12/971 Entity-Relationship Design Information Level Design.
Topology Relationships between features: Supposed to prevent:
Copyright © 2006 by Maribeth H. Price 13-1 Chapter 13 Working with Geodatabases.
Data Alignment and Management in ArcMap
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 3 The Relational Database Model.
Chapter 3 The Relational Database Model. Database Systems, 10th Edition 2 * Relational model * View data logically rather than physically * Table * Structural.
What is GIS? “A powerful set of tools for collecting, storing, retrieving, transforming and displaying spatial data”
Database Systems: Design, Implementation, and Management Eighth Edition Chapter 3 The Relational Database Model.
GTECH 361 Lecture 09 Features in the Geodatabase.
Data Storage & Editing GEOG370 Instructor: Christine Erlien.
IT 5433 LM3 Relational Data Model. Learning Objectives: List the 5 properties of relations List the properties of a candidate key, primary key and foreign.
TUGIS March 15, 2016 Next Generation 911 Data Management TUGIS 2016.
Chapter 13 Editing and Topology.
Physical Structure of GDB
Physical Structure of GDB
Chapter 4 Relational Model Characteristics
Lecture 2 The Relational Model
Geography 413/613 Lecturer: John Masich
Database solutions Chosen aspects of the relational model Marzena Nowakowska Faculty of Management and Computer Modelling Kielce University of Technology.
Analysis models and design models
The Arc-Node Data Model
Chapter 17 Designing Databases
ArcCatalog and Geodatabases
Presentation transcript:

Logical consistency February 24, 2006 Geog 458: Map Sources and Errors

Outlines What is logical consistency? Testing spatial consistency –Attribute consistency –Temporal consistency –Spatial consistency Documenting logical consistency in metadata

Logical consistency Lack of contradiction in a database Fidelity of relationships encoded in the data structure of the digital cartographic data (SDTS) Ensures logical consistency of operations performed on data Deals with logical rules applied to space, time, and attribute Attribute consistency draws upon database integrity rules (database consistency) Spatial consistency refers to the conformance to topological rules based on graph theory (topological consistency)

Attribute consistency Key constraints –Primary Key (PK: attributes of a class that uniquely identify an instance of the class) should be unique I.e. should not be repeated –PK should be Not Null (if it’s missing, how would you identify an instance uniquely?) Referential consistency constraints –Foreign Key (FK: attributes of a class that establish relationships among tables) should correspond to PK in related tables, or ensure the existence of related tables –No value should be inserted in a table as FK without a corresponding column in the related tables (i.e. they should be updated simultaneous whenever update occurs)

Attribute consistency Domain consistency rules (consistency within column values) –Attribute values must fall within certain ranges or may assume only certain pre-defined values E.g. the value for the day may only be in the range 1 and 31 E.g. area or length cannot be negative E.g. a location in geographical coordinates can only in a certain range –Domains can be composite E.g. a location in Degree Minute Second E.g. a day in Day Month Year

Attribute consistency Consistency between column values –A given attribute value must comply with other attributes when its value is derived from the attributes 5 digit FIPS code for county should be consistent with values for state and county –A given attribute value must also comply with the value that can be derived from the metric characteristics of spatial objects The area of a parcel stored in a DB must be consistent with the computed are based on the stored geometry –Whenever column values are changed, whose values are derived from that column, should be updated accordingly

Attribute consistency Broadly defined, conformance to logical rules applied to attributes –E.g. If the study area is the conterminous U.S., longitude should be negative (since it’s in the western hemisphere) Usually, database management system enforces integrity rules automatically –Automatic update, NotNull enforcement If GIS is not implemented as DBMS, you should check them manually –Use of summarize tool for checking key/domain constraints –Compute metric values; Compare metric values with stored values –Table join with master table; compare the master value to stored values

Temporal consistency No violation of temporal topological rule Mostly temporal information is treated as attributes in common DB system For example, –Individual travel survey data: one person can exist at one point in time and at one point in space –Traffic accident data: accident time should occur before dispatch time

Possible temporal relationships between entities Process: a series of changes with some unifying principle Event: countable occurrence located in point in space –Can be point-based (at some point in time) –Can be line-based (at some interval in time) Possible qualitative relations between two intervals You can use this interval-logic for testing temporal consistency: e.g. session A should be during the conference

Spatial consistency Mostly refers to the conformance to topological rules Can arise at various stages of data handling –Digitizing/ updating error –Error propagation through processing –Miscoding of topological relationships Topological rules can be checked on topological vector data model (link-node model); can be checked within a single layer or between layers

Topological rules within a layer Missing node makes a correct topological description impossible (e.g. road intersection will make it connected to other links) Pseudo nodes (nodes where only two edges meet) can lengthen computation time Undershoots and overshoots (edges that end in only one node) may be fictitious line due to digitizing errors Duplicate lines: comes from manual digitizing, when two data sets are to be merged; can create slivers Label points (reference points or centroids): used to link polygon to attributes to place labels –Missing label points: can cause the inconsistency between geometric data description and topological data description –Multiple label points: # polygons =/= actual # polygons Read SDTS data quality (or reading in the course package)

Some encodings are not necessarily errors What ‘s encoded in the database? What it is indeed?Is it consistent?How to check errors? Dangling nodeDead endYes (no error)Check dangling node: too short one? How do I know if it’s a dangling node? Dangling nodeThere’s no such road segment No (error) Intersecting nodeRoad intersectionYes (no error)Check intersecting node: overlay with overpass layer if any Intersecting nodeOverpassNo (error)

Possible spatial relationships between entities Metric –Distance, direction Topological –Qualitative spatial relation between regions Disjoint, meet, within, covered by, cover, contain, equal, overlap You can compare this relational info. encoded in the database to the actual relational info. (e.g. King County should be within Washington State)

Topological rules between layers For example, the street center lines should fall inside the pavement area; rivers should be inside their floodplain; zoning boundaries should follow certain parcel boundaries (most of the time); hierarchical relationships should nest (cities inside counties inside states - except Bothell)... too many potential relationships to delineate them all

Detecting topological errors Functionalities available in GIS –Nodeerrors: check whether there are node errors (pseudo node, dangling node) –Labelerrors: check whether there are missing/multiple label points of polygon –Display node by types –Trace: check connectivity Your own method –Missing node: build/clean the data and compare the test data with the cleaned data…

Editing topological errors: automated vs manual Automated methods –Use “BUILD” and “CLEAN” command given a threshold; it can remove duplicate lines, sliver, undershoot, and overshoot given threshold in batch –But care should be exercised since topology building is dependent on threshold Interactive methods –Manual inspection with some visual aids

Editing topological errors: depending GISystem The Arc/INFO CLEAN and BUILD do topology building/cleaning. This is now hidden in the ArcToolbox, and invokes in hidden ways with less ability to control it. Since the coverage model is not as central anymore, An ArcMap shapefile can contain overlapping shapes. There is no easy way to get all the slivers and gaps edited away. SDE (Spatial Database Engine) does have advanced topological error checking even though it uses something equivalent to a shapefile (and quite different in storage details). SDE can find all shared segments and correct them.

Logical consistency and object- oriented database You can embed any rules (as much as you can come up with) in the database Spatial consistency by defining topological rules Temporal consistency by defining topological rules Thematic consistency by defining domain values It can also embed the relationships with other entities, in addition to internal consistency Quality of database can be ensured better as it checks the possible logical inconsistency internally based on the rules

Logical consistency test: general procedures Divide database into space, time, and attribute components (whenever necessary) Choose samples in space, time, and attribute Come up with logical rules to be followed for each component Check if the encoding in the database conforms to the rules Report on % violation of rules out of total cases tested for each component

Documenting logical consistency in the metadata State what elements are tested State the number of features that were checked State the number of inconsistencies encountered –Describe the methods used for detecting errors State a detailed specification of the nature of the problem and possible solutions –Mention whether it is corrected or not If possible, provide a graphical display of the error condition