Data Quality Issues-Chapter 10

Slides:



Advertisements
Similar presentations
Visualizing maps on the web. What is a Map? A map is a drawing that is the representation, on a certain scale, of a terrain.
Advertisements

Lecture 6 Data entry. Getting the Map into the Computer Get data in finished form Analog-to-Digital maps Digitizing Data Entry Editing and validation.
Managing Error, Accuracy, and Precision In GIS. Importance of Understanding Error *Until recently, most people involved with GIS paid little attention.
MANAGING A GIS PROJECT. Starting Points for GIS: Do your homework: GIS, RS, GPS Get familiar with the terminology Gain general knowledge of spatial analysis:
GIS: The Grand Unifying Technology. Introduction to GIS  What is GIS?  Why GIS?  Contributing Disciplines  Applications of GIS  GIS functions  Information.
1 CPSC 695 Data Quality Issues M. L. Gavrilova. 2 Decisions…
Introduction to Cartography GEOG 2016 E
TERMS, CONCEPTS and DATA TYPES IN GIS Orhan Gündüz.
Maps as Numbers Getting Started with GIS Chapter 3.
Geographic Information Systems and Science SECOND EDITION Paul A. Longley, Michael F. Goodchild, David J. Maguire, David W. Rhind © 2005 John Wiley and.
Beginning the Research Design
You have just been given an aerial photograph that is not registered to real world coordinates. How do you display the aerial with other data layers that.
@2007 Austin Troy Lecture 4: An Introduction to the Vector Data Model and Map Layout Techniques Introduction to GIS By Brian Voigt University of Vermont.
Where we are going today… GPS GPS GIS GIS Hey, there are exams next week. Oct. 4 th and 6 th. Powerpoints now online. Hey, there.
Data Input How do I transfer the paper map data and attribute data to a format that is usable by the GIS software? Data input involves both locational.
Spatial Data: Elements, Levels and Types. Spatial Data: What GIS Uses Bigfoot Sightings: Spatial Data.
Data vs. Information  Data: raw facts or measurements  Information: collection of facts organized/processed in such a way that they have value beyond.
Spatial data quality February 10, 2006 Geog 458: Map Sources and Errors.
GIS Development: Step5 - DB Planning and Design Step6 - Database Construction Step7 - Pilot/Benchmark (Source: GIS AsiaPacific, June/July & August/September.
GIS DATA AND SOURCES. Building Topography Land use Utility Soil Type Roads District Land Parcels Nature of Geography Objects.
Copyright, © Qiming Zhou GEOG1150. Cartography Quality Control and Error Assessment.
GI Systems and Science January 23, Points to Cover  What is spatial data modeling?  Entity definition  Topology  Spatial data models Raster.
9. GIS Data Collection.
Data Acquisition Lecture 8. Data Sources  Data Transfer  Getting data from the internet and importing  Data Collection  One of the most expensive.
Data Quality Data quality Related terms:
EG1106: GI: a primer Field & Survey data collection 19 th November 2004.
KGA172 Space, Place and Nature Accuracy in Mapping Dr Christopher Watson.
Data source for Google earth
GROUP 4 FATIN NUR HAFIZAH MULLAI J.DHANNIYA FARAH AN-NUR MOHAMAD AZUWAN LAU WAN YEE.
Objectives: * 1. Define significant digits. * 2. Explain how to determine which digits in measurement are significant. * 3. Convert measurements in to.
Map Scale, Resolution and Data Models. Components of a GIS Map Maps can be displayed at various scales –Scale - the relationship between the size of features.
Chapter 3 Sections 3.5 – 3.7. Vector Data Representation object-based “discrete objects”
OVERVIEW- What is GIS? A geographic information system (GIS) integrates hardware, software, and data for capturing, managing, analyzing, and displaying.
Geographic Information System GIS This project is implemented through the CENTRAL EUROPE Programme co-financed by the ERDF GIS Geographic Inf o rmation.
Basic Geographic Concepts GEOG 370 Instructor: Christine Erlien.
GIS Data Quality.
GIS Data Structure: an Introduction
Data input 1: - Online data sources -Map scanning and digitizing GIS 4103 Spring 06 Adina Racoviteanu.
Support the spread of “good practice” in generating, managing, analysing and communicating spatial information Introduction to GIS for the Purpose of Practising.
Data Sources Sources, integration, quality, error, uncertainty.
Chapter 3 Digital Representation of Geographic Data.
How do we represent the world in a GIS database?
Support the spread of “good practice” in generating, managing, analysing and communicating spatial information Introduction to GIS for the Purpose of Practising.
May 4 th (4:00pm) Multiple choice (50 points) Short answer (50 points)
Support the spread of “good practice” in generating, managing, analysing and communicating spatial information Introduction to GIS for the Purpose of Practising.
URBDP 422 Urban and Regional Geo-Spatial Analysis Lecture 2: Spatial Data Models and Structures Lab Exercise 2: Topology January 9, 2014.
Copyright 2010, The World Bank Group. All Rights Reserved. Managing Mapping Operations Section B 1.
Accuracy Assessment Having produced a map with classification is only 50% of the work, we need to quantify how good the map is. This step is called the.
DATA QUALITY AND ERROR  Terminology, types and sources  Importance  Handling error and uncertainty.
Mapping in Surveys Uses of maps: Plan operations Facilitate data collection Presentation and analysis of results There are two main categories of maps:
GIS Data Structures How do we represent the world in a GIS database?
Exploring GIS concepts. Introduction to ArcGIS I (for ArcView 8, ArcEditor 8, and ArcInfo 8) Copyright © 2000–2003 ESRI. All rights reserved. 2-2 Organizing.
AN INTRODUCTION TO GIS SYSTEMS TAKEN AND MODIFIED FROM TEXT BY David J. Buckley Corporate GIS Solutions Manager Pacific Meridian Resources, Inc.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
Chapter 10.  Data collection workflow  Primary geographic data capture  Secondary geographic data capture  Obtaining data from external sources 
GIS September 27, Announcements Next lecture is on October 18th (read chapters 9 and 10) Next lecture is on October 18th (read chapters 9 and 10)
Definition: What is data? Data is anything in the form of 1.Charts 2.Tables 3.Text 4.Maps 5.Photos 6.Imageries, etc.
Data Entry Getting coordinates and attributes into our GIS.
Objectives: * 1. Define significant digits. * 2. Explain how to determine which digits in measurement are significant. * 3. Convert measurements in to.
Content Standards for Digital Geospatial Metadata Mandatory Legend Identification Information Data Quality Information Spatial Data Organization Information.
What is GIS? “A powerful set of tools for collecting, storing, retrieving, transforming and displaying spatial data”
1 Basic Geographic Concepts Real World  Digital Environment Data in a GIS represent a simplified view of physical entities or phenomena 1. Spatial location.
Spatial Data Models Geography is concerned with many aspects of our environment. From a GIS perspective, we can identify two aspects which are of particular.
Chapter 11: Measurement and data processing Objectives: 11.1 Uncertainty and error in measurement 11.2 Uncertainties in calculated results 11.3 Graphical.
UNIT 3 – MODULE 5: Data Input & Editing. INTRODUCTION Putting data into a computer (called data coding) is a fundamental process for virtually all GIS.
Czech Technical University in Prague Faculty of Transportation Sciences Department of Transport Telematics Pavel Hrubeš Geographical Information Systems.
Data Quality Data quality Related terms:
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
URBDP 422 URBAN AND REGIONAL GEO-SPATIAL ANALYSIS
Geographic Information Systems
Presentation transcript:

Data Quality Issues-Chapter 10 GiGo: garbage in, garbage out Quality Issues Terminology Sources, propagation, and management What is Data Quality? Overall fitness or suitability of data for a specific purpose

Errors, Accuracy, Precision, & Bias Difference between real world and GIS Could be one error or the whole thing is off Accuracy Extent in which an estimated value approaches a true value Can never get 100% accurate Precision Recorded level of detail

Errors, Accuracy, Precision, & Bias Consistent error throughout data set Human, equipment Difficult to spot The usefulness of measurement is enhanced by knowledge of its level of certainty.  Multiple measurements of the same property are like multiple shots at the same target.  The pattern of the shots tells you something about the measurement and its ability to describe the 'true' value of the property being sought.   The patterns above depict possible outcomes of different experiments to measure the same property.  Expt IV is of course the best, because it give very reproducible results (precise) and also results that are very close to the true value or bulls eye (accurate).  Experiment III is precise but not accurate.  It exhibits systematic error, which is insidiously difficult to estimate at times.

Resolution Smallest feature or data that can be displayed RasterCell size Vector-point size, line widths

Generalization Process of simplifying

Completeness & Consistency Are all instances of a feature the GIS/map claims to include, in fact, there? Simply put, how much data is missing? Logical Consistency The presence of contradictory relationships in the database Some crimes recorded at place of occurrence, others at place where report taken Data for one country is for 2000, for another its for 2001 Annual data series not taken on same day/month etc. (sometimes called lineage error) Data uses different source or estimation technique for different years (again, lineage)

Compatibility Compatibility Slope Overlay maps different scales Can not be combined Combining nominal and ratio Nominal scales distinguish one item from another, but they do not rank or quantify data. Soil Name, City Name, Polygon Identification Number Ordinal scales identify the relative magnitudes, but they do not quantify exact differences between values. Income = ( low , medium , or high) Slope = ( A , B ); where A = 0-4%, and B = 5-9% Crop

Applicability Applicability Suitability of data for commands, operations or analysis Using your GIS data collected points for a parcel fabric

Sources of Error in GIS Survey Data surveyor or instrument error choice of spheroid and datum Data encoding and entry E.g. keying or digitizing errors Remotely Sensed Data or Aerial Photography Mistakes in classification Change in time

Manual Digitizing Errors Cleaning and editing always required

Vector to Raster or Raster to Vector

Errors in Data Processing and Analysis is this data suitable for analysis? Is in a suitable format? Different datum's? Are the data sets compatible? Incompatible units? Widely different scales? Will the output mean anything?

Classification Errors

EVALUATING CURRENT DATA Most of the information captured in a GIS generally exists somewhere in the office that requires the application. Some additional data may be purchased or obtained by data sharing with other agencies. The source, accuracy, reliability, condition and scale for each document or record must be evaluated.

SOURCE The data may be in paper or map form, or it may exist in computer files on another system. Where did that information come from? What is the source of the source? Do you know how the map was compiled? Do you know who compiled the map or record? Have you spoken with the author to learn as much as possible about the data? What are the strong & weak points about the data?

Data Accuracy & Reliability There are different types of accuracy. Absolute positional accuracy refers to the measurement of map location as it relates to a real world location (For example; a GPS coordinate point). Relative positional accuracy is a measure of the relationships between the different features on the map. Relative accuracy compares the scaled distance between features measured from the map data with distances measured between the same features on the ground. The other type of accuracy deals with the content of the information in the GIS database. Are there errors or missing data? A road may have positional accuracy but have the wrong road name associated to the feature. We think of this as Reliability. Another very important aspect of reliability is how current the data sources are. If the map or record has not been properly maintained some method of bringing the document up to date must be instituted.

Data Accuracy & Reliability

MAINTENANCE OF DATA Many of the answers needed to insure proper data maintenance are flushed out in a preliminary needs and data analysis. Specifically, maintaining data involves knowing Frequency of change Quantity of change Sources of change It must be re-iterated: If data is not going to be maintained DO NOT PUT IT IN YOUR GIS.

Condition The condition of the source documents, especially maps, will determine how difficult the conversion will be. Clear mylar and ink drawings will be easier to digitize (no matter what the method) than maps of poor legibility.