Data Storage and Editing (17/MAY/2010) Dr. Ahmad BinTouq URL:

Slides:



Advertisements
Similar presentations
Center for Modeling & Simulation.  A Map is the most effective shorthand to show locations of objects with attributes, which can be physical or cultural.
Advertisements

GIS UPDATE? ARE YOU TAKING NOTES? How will you remember what you did if you do not take notes. Lab 9 this week: Music Festival 3: Vector Analysis.
From portions of Chapter 8, 9, 10, &11. Real world is complex. GIS is used model reality. The GIS models then enable us to ask questions of the data by.
Managing Error, Accuracy, and Precision In GIS. Importance of Understanding Error *Until recently, most people involved with GIS paid little attention.
Vector-Based GIS Data Processing Chapter 6. Vector Data Model Feature Classes points lines polygons Layers limited to one class of data Figure p. 186.
Introduction to Cartography GEOG 2016 E
Maps as Numbers Getting Started with GIS Chapter 3.
Logical consistency February 24, 2006 Geog 458: Map Sources and Errors.
CS 128/ES Lecture 5b1 Vector Based Data. CS 128/ES Lecture 5b2 Spatial data models 1.Raster 2.Vector 3.Object-oriented Spatial data formats:
Lecture 4: Intro to the Vector Data Model and to Map Layout
Geog 458: Map Sources and Errors January 20, 2006 Data Storage and Editing.
@2007 Austin Troy Lecture 4: An Introduction to the Vector Data Model and Map Layout Techniques Introduction to GIS By Brian Voigt University of Vermont.
Introduction to Spatial Analysis
Maps as Numbers Lecture 3 Introduction to GISs Geography 176A Department of Geography, UCSB Summer 06, Session B.
Data Storage and Editing
Lecture 16: Data input 1: Digitizing and Geocoding By Austin Troy University of Vermont Using GIS-- Introduction to GIS.
Fundamentals of GIS Materials by Austin Troy © 2008 Lecture 18: Data Input: Geocoding and Digitizing By Austin Troy University of Vermont NR 143.
WFM 6202: Remote Sensing and GIS in Water Management
©2005 Austin Troy. All rights reserved Lecture 3: Introduction to GIS Part 1. Understanding Spatial Data Structures by Austin Troy, University of Vermont.
1 CIS / Introduction to Business GIS Winter 2005 Lecture 2 Dr. David Gadish.
Data Input How do I transfer the paper map data and attribute data to a format that is usable by the GIS software? Data input involves both locational.
Digitizing There are three primary methods for digitizing spatial information: Manual Methods include: Tablet Digitizing Heads-up Digitizing An Automated.
Dr. David Liu Objectives  Understand what a GIS is  Understand how a GIS functions  Spatial data representation  GIS application.
9. GIS Data Collection.
Data Acquisition Lecture 8. Data Sources  Data Transfer  Getting data from the internet and importing  Data Collection  One of the most expensive.
Lecture 23: Brief Introduction to Data quality By Austin Troy Using GIS-- Introduction to GIS.
Raster and Vector 2 Major GIS Data Models. Raster and Vector 2 Major GIS Data Models.
@2007 Austin Troy Lecture 4: An Introduction to the Vector Data Model and Map Layout Techniques Introduction to GIS By Brian Voigt University of Vermont.
Data Quality Data quality Related terms:
Spatial data models (types)
Spatial Data Editing Chapter 5. Introduction All digitizing involves errors In the real world, revisions are required Keeping data up-to-date is part.
NR 422: Topology Jim Graham Fall 2010 See: odatabase-topology.pdf.
Spatial (coordinate) data model Relational (tabular) data model
Preparing Data for Analysis and Analyzing Spatial Data/ Geoprocessing Class 11 GISG 110.
Applied Cartography and Introduction to GIS GEOG 2017 EL
Spatial Analysis (Vector I) Reading Assignment: Bolstad Chapter 9 (pp )
Map Scale, Resolution and Data Models. Components of a GIS Map Maps can be displayed at various scales –Scale - the relationship between the size of features.
Chapter 3 Sections 3.5 – 3.7. Vector Data Representation object-based “discrete objects”
Applied Cartography and Introduction to GIS GEOG 2017 EL
Fundamentals of GIS Materials by Austin Troy © 2008 Lecture 18: Data Input: Geocoding and Digitizing By Austin Troy University of Vermont.
Scale, Resolution and Accuracy in GIS
Maps as Numbers Getting Started with GIS Chapter 3.
GIS Data Quality.
Data input 1: - Online data sources -Map scanning and digitizing GIS 4103 Spring 06 Adina Racoviteanu.
8. Geographic Data Modeling. Outline Definitions Data models / modeling GIS data models – Topology.
How do we represent the world in a GIS database?
Cartographic and GIS Data Structures Dr. Ahmad BinTouq URL:
1 Data models Vector data model Raster data model.
URBDP 422 Urban and Regional Geo-Spatial Analysis Lecture 2: Spatial Data Models and Structures Lab Exercise 2: Topology January 9, 2014.
Vector data input and editing Scanning (scan digitizing) –drum scanner scanner scans in map image –specialized software ‘extracts’ point, line, and area.
OUTLINE:  geocoding  digitizing terms and methods  scanning methods  adding attributes OUTLINE:  geocoding  digitizing terms and methods  scanning.
GIS Data Models Vector Data Models Vector File Formats Raster Data Models Raster File Formats.
1 Spatial Data Models and Structure. 2 Part 1: Basic Geographic Concepts Real world -> Digital Environment –GIS data represent a simplified view of physical.
Map overlays & Geoprocessing Learn about spatial analysis functions overlays, clipping & buffering Use overlays to analyze multiple spatial criteria Understand.
Topology Relationships between features: Supposed to prevent:
Introduction to Geographic Information Systems
GIS September 27, Announcements Next lecture is on October 18th (read chapters 9 and 10) Next lecture is on October 18th (read chapters 9 and 10)
Raster Data Models: Data Compression Why? –Save disk space by reducing information content –Methods Run-length codes Raster chain codes Block codes Quadtrees.
Spatial Data Models Geography is concerned with many aspects of our environment. From a GIS perspective, we can identify two aspects which are of particular.
WFM 6202: Remote Sensing and GIS in Water Management © Dr. Akm Saiful IslamDr. Akm Saiful Islam WFM 6202: Remote Sensing and GIS in Water Management Dr.
CENTENNIAL COLLEGE SCHOOL OF ENGINEERING & APPLIED SCIENCE VS 361 Introduction to GIS ERROR, ACCURACY & PRECISION COURSE NOTES 1.
Data Storage & Editing GEOG370 Instructor: Christine Erlien.
Geocoding Chapter 16 GISV431 &GEN405 Dr W Britz. Georeferencing, Transformations and Geocoding Georeferencing is the aligning of geographic data to a.
Geocoding Chapter 16 GISV431 &GEN405 Dr W Britz. Georeferencing, Transformations and Geocoding Georeferencing is the aligning of geographic data to a.
Chapter 13 Editing and Topology.
Data Queries Raster & Vector Data Models
GIS Analysis Functions
Geography 413/613 Lecturer: John Masich
URBDP 422 URBAN AND REGIONAL GEO-SPATIAL ANALYSIS
Spatial Data Entry via Digitizing
Presentation transcript:

Data Storage and Editing (17/MAY/2010) Dr. Ahmad BinTouq URL: GEO 321: GIS 1

Overview 1.Accuracy vs. precision 2.Error propagation and cascading of error in GIS 3.Types of errors in vector GIS 4.Types of errors in attribute data 5.The concept of topology - what is it, what types of Relationships are stored for point, line, and poly features, why do we need it in GIS?

Introduction Any analysis performs must be based on good data, correctly organized and in the proper format. cyVsPrecision.pdfhttp://gif.berkeley.edu/documents/Accura cyVsPrecision.pdf ft/notes/error/error.htmlhttp:// ft/notes/error/error.html

Accuracy The degree to which information on a map or in a digital database matches true or accepted values An issue pertaining to the quality of data and the number of errors contained in a data set or map It is possible to consider horizontal and vertical accuracy with respect to geographic position Attribute accuracy - conceptual, and logical accuracy Level of accuracy required for particular applications varies greatly. Highly accurate data can be very difficult and costly to produce and compile e.g., mapping standards employed by the United States Geological Survey (USGS): "requirements for meeting horizontal accuracy as 90 per cent of all measurable points must be within 1/30th of an inch for maps at a scale of 1:20,000 or larger, and 1/50th of an inch for maps at scales smaller than 1:20,000."

Accuracy Standards for Various Scale Maps 1:1,200 ± 3.33 feet 1:2,400 ± 6.67 feet 1:4,800 ± feet 1:10,000 ± feet 1:12,000 ± feet 1:24,000 ± feet 1:63,360 ± feet 1:100,000 ± feet

Accuracy Standards for Various Scale Maps 1:1,200 ± 3.33 feet 1:2,400 ± 6.67 feet 1:4,800 ± feet 1:10,000 ± feet 1:12,000 ± feet 1:24,000 ± feet 1:63,360 ± feet 1:100,000 ± feet

Precision Refers to the level of measurement and exactness of description in a GIS database (e.g., number of decimal places) Precise locational data may measure position to a fraction of a unit e.g. to the millimeter Precise attribute information may specify the characteristics of features in great detail Important to realize, however, that precise data--no matter how carefully measured--may be inaccurate Level of precision required for particular applications varies greatly. Engineering projects such as road construction require very precise information measured to the millimeter. Demographic analyses of marketing or electoral trends can often make do with less, say to the closest zip code or precinct boundary

Why be concerned about error? - The Problems of Propagation and Cascading Discussion focused to this point on errors that may be present in single sets of data ”Doing" GIS usually involves comparisons of many sets of data. If errors exist in one or all of the data layers, the solution to the GIS problem generated from them may itself be erroneous Inaccuracy, imprecision, and error may be compounded in GIS that employ many data sources

DIGITIZATION- continue Tic Geographic features

Error Propagation and Cascading Occurs when one error leads to another Means that erroneous, imprecise, and inaccurate information will skew a GIS solution when information is combined DeMers - "error prone data will lead to error prone analysis" e.g., if a map registration point has been mis-digitized in one coverage and is then used to register a second coverage Result = the second coverage will propagate the first mistake In this way, a single error may lead to others and spread until it corrupts data throughout the entire GIS project

Entity Errors: Vector Six categories identified by DeMers/ESRI 1. All entities that should have been entered are present 2. No extra entities have been entered 3.Entities are in the right place and 4.Entities are in the correct shape and size 5. Entities that are supposed to be connected to each other are all polygons have a single label point which identifies them 6. All entities are within the outside boundary identified

Nodes and Vertices Specific types of entity errors in vector GIS –can involve points, lines, polygons, nodes, vertices, label points –nodes - denote ends of lines or point where polygon closes on itself – vertices - denote change or direction within a line – points -> lines -> polys Nodes are used to show specific topological relationships, e.g.: – intersection of roads or streams – intersection between stream and lake – node errors include pseudo nodes and dangle nodes

Pseudo nodes Occur where lines connect with itself or other line A line connects with itself to form a polygon, a.k.a. island pseudo node (fig. 6.1a, p. 161) Also occur where two lines intersect (rather than crossing) (fig. 6.1b) Pseudo nodes are not necessarily errors, but indicate the potential location of errors e.g., pseudo node in the middle of a line representing a node can be used to separate road into two different speed limit zones Others may indicate error, (pushed wrong button when digitizing, placed cursor at wrong location)

Digitization errors- Pseudo node (Diamond) Pseudo node Not representing a serious errors Pseudo node connects two and only two arcs Error Pseudo node

Dangle nodes A single node connected to a single line Again, not necessarily and error, but may be Can result from three possible mistakes: (fig. 6.2, p. 162) –Failure to close a polygon –Undershoot –Overshoot –Sometimes result from incorrect placement, sometimes from fuzzy tolerance and snapping distance One method of general error detection is comparing digitized to original document at equivalent scales good for broad scale obvious errors, not for finer scale errors

DIGITIZATION For linear features such as rivers, roads, railways it is important to digitize each section separately (start node and end node at a specified section) or use Route latter Node

Digitization errors - Dangle Error (square) Dangle error Closed polygon Natural feature Road Overshoot Undershoo t Acceptable dangle node e.g. end of roads

Label point and sliver errors Polygon label point errors ( points -> lines -> polys) Label point is used to associate a polygon with attributes If label point is missing, or there are more than one, indicates error e.g., fig. 6.4, p. 163 Sliver polygon errors Commonly result from incorrect practice of double digitizing Can also result from overlay or merging operations which join coverages from different sources Can be removed manually or by dissolving polygons less than a certain area and/comparing intended number of polys with actual number (Fig. 6.5, p.164)

Digitization errors-Labels  Missing labels or too many labels missing labels too many labels

Sliver polygon errors

Topology Topology is the process of projecting complex surfaces to a simple ones Topology is a procedure for explicitly defining spatial relationships connecting adjacent features (e.g., arcs, nodes, polygons, and points). Different types of spatial relationships are expressed as lists of features e.g. An area is defined by the arcs comprising its border An arc is defined by set of points (X,Y)

Topology-Main Concepts The three major topological concepts are: Connectivity: Arcs connected to each other at nodes Contiguity/Adjacency: Arcs have direction and left and right sides Area Definition:: Arcs connected to surround an area define a polygon (area)

Spatial Relationships (Topology) Area Definition Adjacency Connectivity

PolygonTopology

Advantages of Topology  Check for digitization errors ( overshoot, undershoot, unclosed polygon, missing labels, too many labels)  Store data more efficiently ( eliminate data redundancy-normalization)  Make spatial analysis more faster

Topology Topological data structures dominate GIS software. Topology allows automated error detection and elimination. Rarely are maps topologically clean when digitized or imported. A GIS has to be able to build topology from unconnected arcs. Nodes that are close together are snapped. Slivers due to double digitizing and overlay are eliminated.

Attribute Errors: Raster and Vector Attribute errors generally more difficult to detect Types include: Missing attributes perhaps only kind of attribute error traceable without comparison to source material e.g., plot all polygons and color them according to a certain attribute, if color is missing, attribute is missing Incorrect attribute values or text more difficult to detect one method is to plot all polygons and color them according to a certain attribute, if only one polygon has a certain attribute and there should be other, it may stick out, in general, involves direct comparison with source material)

Dealing With Projection Changes Often times, regardless of input method, separate GIS data input for a project will be based on different projection systems Necessary to transform all data to common system before use in integrated modeling examples in ArcMAP Joining Adjacent Coverages: Edge Matching (Union) Joining two adjacent coverages (usually of the same theme) together to produce a single data set that covers a broader region edge matching also done in raster systems

Conflation and Rubber Sheeting Conflation and Rubber Sheeting : Refers to the registration (georeferencing) of two maps (vector or raster) in a non-linear way (Ovelay two maps) Used to make maps of different sources spatially correspond with one another. Most often used in raster data using ground control points (GCPs). Conflation and rubber sheeting are synonymous terms according to DeMers (Figure 6.1, p. 174) The need to geo-reference internal objects themselves not just the map corners ( Rubber Sheeting) Templating : " cookie cutting" If you have multiple coverage of different extents, the template is used to "cookie cut" them all to the same extent

Exercise 1.Error propagation and cascading of error in GIS 2.Types of errors in vector GIS 3.Types of errors in attribute data 4.The concept of topology - what is it, what types of Relationships are stored for point, line, and poly features, why do we need it in GIS?