Fundamentals of GIS Materials by Austin Troy © 2008 Lecture 23: Data Quality and Documentation By Austin Troy University of Vermont NR 143.

Slides:



Advertisements
Similar presentations
More Input Methods and Data Quality and Documentation
Advertisements

Metadata Workshop Frank Roberts, Coeur d’Alene Tribe GIS Bruce Godfrey, U of I, Inside Idaho Project (Funded by FGDC)
Center for Modeling & Simulation.  A Map is the most effective shorthand to show locations of objects with attributes, which can be physical or cultural.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Return to Outline Copyright © 2009 by Maribeth H. Price 2-1 Chapter 2 Mapping GIS Data.
Managing Error, Accuracy, and Precision In GIS. Importance of Understanding Error *Until recently, most people involved with GIS paid little attention.
Lecture 24: More on Data Quality and Metadata By Austin Troy Using GIS-- Introduction to GIS.
Geographic Information Systems
GIS Geographic Information System
Geog 458: Map Sources and Errors January 20, 2006 Data Storage and Editing.
Lecture 16: Data input 1: Digitizing and Geocoding By Austin Troy University of Vermont Using GIS-- Introduction to GIS.
Lecture 23: Data quality and documentation By Austin Troy Using GIS-- Introduction to GIS.
Fundamentals of GIS Materials by Austin Troy © 2008 Lecture 18: Data Input: Geocoding and Digitizing By Austin Troy University of Vermont NR 143.
Data Input How do I transfer the paper map data and attribute data to a format that is usable by the GIS software? Data input involves both locational.
Completeness February 27, 2006 Geog 458: Map Sources and Errors.
GIS Tutorial 1 Lecture 6 Digitizing.
Digitizing There are three primary methods for digitizing spatial information: Manual Methods include: Tablet Digitizing Heads-up Digitizing An Automated.
Spatial data quality February 10, 2006 Geog 458: Map Sources and Errors.
©2005 by Austin Troy. All rights reserved Lecture 5: Introduction to GIS Legend Visualization Lecture by Austin Troy, University of Vermont.
CSDGM: Content Standard for Digital Geospatial Metadata February 7, 2006 Geog 458: Map Sources and Errors.
Specific Types and Coordinate Systems
@ 2007 Austin Troy. Geoprocessing Introduction to GIS Geoprocessing is the processing of geographic information. – Creating new polygon features through.
Data Acquisition Lecture 8. Data Sources  Data Transfer  Getting data from the internet and importing  Data Collection  One of the most expensive.
@ 2007 Austin Troy. Geoprocessing Introduction to GIS Geoprocessing is the processing of geographic information. Perform spatial analysis and modeling.
Lecture 23: Brief Introduction to Data quality By Austin Troy Using GIS-- Introduction to GIS.
Rebecca Boger Earth and Environmental Sciences Brooklyn College.
Introduction to ArcGIS for Environmental Scientists Module 2 – GIS Fundamentals Lecture 5 – Coordinate Systems and Map Projections.
Data Quality Data quality Related terms:
Introduction to Geospatial Metadata – ISO 191** Metadata National Coastal Data Development Center A division of the National Oceanographic Data Center.
Map Scale, Resolution and Data Models. Components of a GIS Map Maps can be displayed at various scales –Scale - the relationship between the size of features.
Chapter 3 Sections 3.5 – 3.7. Vector Data Representation object-based “discrete objects”
Fundamentals of GIS Materials by Austin Troy © 2008 Lecture 18: Data Input: Geocoding and Digitizing By Austin Troy University of Vermont.
1 Introduction to Geographical Data Kris Ray Confederated Tribes of the Colville Reservation.
GIS Data Quality.
2008 EPA and Partners Metadata Training Program: 2008 CAP Project Geospatial Metadata: Introduction Module 2: FGDC CSDGM Metadata Compliancy.
Metadata and Geographical Information Systems Adrian Moss KINDS project, Manchester Metropolitan University, UK
How do we represent the world in a GIS database?
Fundamentals of GIS Materials by Austin Troy © 2006 Lecture 9: Data Quality and Input Methods By Austin Troy University of Vermont.
May 4 th (4:00pm) Multiple choice (50 points) Short answer (50 points)
URBDP 422 Urban and Regional Geo-Spatial Analysis Lecture 2: Spatial Data Models and Structures Lab Exercise 2: Topology January 9, 2014.
Fundamentals of GIS Materials by Austin Troy © 2008 Lecture 9: More Input Methods and Data Quality and Documentation By Austin Troy University of Vermont.
Data Storage and Editing (17/MAY/2010) Dr. Ahmad BinTouq URL:
Rupa Tiwari, CSci5980 Fall  Course Material Classification  GIS Encyclopedia Articles  Classification Diagram  Course – Encyclopedia Mapping.
Transitioning from FGDC CSDGM Metadata to ISO 191** Metadata
Data Creation and Editing Based in part on notes by Prof. Joseph Ferreira and Michael Flaxman Lulu Xue | Nov. 3, :A Workshop on Geographical.
2008 EPA and Partners Metadata Training Program: 2008 CAP Project Geospatial Metadata: Introduction Module 1: Introduction & Overview of the FGDC CSDGM.
NR 143 Study Overview: part 2 By Austin Troy University of Vermont Using GIS-- Introduction to GIS.
URBDP 422 URBAN AND REGIONAL GEO-SPATIAL ANALYSIS Lecture 3: Building a GeoDatabase; Projections Lab Session: Exercise 3: vector analysis Jan 14, 2014.
NR 143 Study Overview: part 1 By Austin Troy University of Vermont Using GIS-- Introduction to GIS.
1 Overview Finding and importing data sets –Searching for data –Importing data_.
The FGDC and Metadata. To maintain an organization's internal investment in geospatial data To provide information about an organization's data holdings.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
When you begin a project, a reference data layer is placed on the map first. This initial layer(s) is called the base map. There are different types of.
GIS September 27, Announcements Next lecture is on October 18th (read chapters 9 and 10) Next lecture is on October 18th (read chapters 9 and 10)
ESRI Education User Conference – July 6-8, 2001 ESRI Education User Conference – July 6-8, 2001 Introducing ArcCatalog: Tools for Metadata and Data Management.
ISO 191** Overview A “Family” of Standards. Resources ISO Standards Web Page – Technical.
Getting Familiar with Metadata Laurie Porth Rocky Mountain Research Station Audience: Scientists/researchers who have heard of metadata and now need to.
Content Standards for Digital Geospatial Metadata Mandatory Legend Identification Information Data Quality Information Spatial Data Organization Information.
Spatial Data Models Geography is concerned with many aspects of our environment. From a GIS perspective, we can identify two aspects which are of particular.
CENTENNIAL COLLEGE SCHOOL OF ENGINEERING & APPLIED SCIENCE VS 361 Introduction to GIS ERROR, ACCURACY & PRECISION COURSE NOTES 1.
1 National Standard for Spatial Data Accuracy Missoula GIS Coffee Talk and MT GPS Users Group Julie Binder Maitra March 17, 2006.
Demystifying Metadata Examining and Understanding the Elements of the Content Standards for Geospatial Metadata.
Geog. 377: Introduction to GIS - Lecture 16 Overheads 1 5. Metadata 6. Summary of Database Creation 7. Data Standards 8. NSDI Topics Lecture 16: GIS Database.
Geocoding Chapter 16 GISV431 &GEN405 Dr W Britz. Georeferencing, Transformations and Geocoding Georeferencing is the aligning of geographic data to a.
Geocoding Chapter 16 GISV431 &GEN405 Dr W Britz. Georeferencing, Transformations and Geocoding Georeferencing is the aligning of geographic data to a.
Data Quality Data quality Related terms:
GEOGRAPHICAL INFORMATION SYSTEM
Presented by Sharon Shin, FGDC Developed by Lynda Wayne, GeoMaxim-FGDC
Data Queries Raster & Vector Data Models
URBDP 422 URBAN AND REGIONAL GEO-SPATIAL ANALYSIS
Presentation transcript:

Fundamentals of GIS Materials by Austin Troy © 2008 Lecture 23: Data Quality and Documentation By Austin Troy University of Vermont NR 143

Fundamentals of GIS Materials by Austin Troy © 2008 Data Quality Accuracy + Precision = Quality Error = fn(accuracy, precision) Cost vs. quality tradeoff

Fundamentals of GIS Scale? Too little detail Too much detail? Materials by Austin Troy © 2008 Image source: Esri

Fundamentals of GIS Materials by Austin Troy © 2008 Accuracy “the degree to which information on a map or in a digital database matches true or accepted values.” From Kenneth E. Foote and Donald J. Huebner Reflection of how close a measurement represent the actual quantity measured and of the number and severity of errors in a dataset or map. Image source:

Fundamentals of GIS Materials by Austin Troy © 2008 Precision Hence, a measurement down to a fraction of a cm is more precise than a measurement to a cm However, data with a high level of precision can still be inaccurate—this is due to errors Each application requires a different level of precision Intensity or level of preciseness, or exactitude in measurements. The more precise a measurement is, the smaller the unit which you intend to measure

Fundamentals of GIS Materials by Austin Troy © 2008 Random and Systematic error Error can be systematic or random Systematic error can be rectified if discovered, because its source is understood Image source:

Fundamentals of GIS Materials by Austin Troy © 2008 Random and Systematic error Systematic errors affect accuracy, but are usually independent of precision; data can use highly precise methods but still be inaccurate due to systematic error Accurate and precise: no systematic, little random error inaccurate and precise: little random error but significant systematic error Accurate and imprecise: no systematic, but considerable random error inaccurate and imprecise: both types of error

Fundamentals of GIS Materials by Austin Troy © 2008 Measurement of Accuracy Positional accuracy is often stated as a confidence interval: e.g cm +/-.01 means true value lies between and

Fundamentals of GIS Materials by Austin Troy © 2008 Measurement of Accuracy One of the key measurements of positional accuracy is root mean squared error (MSE); equals squared difference between observed and expected value for observation i divided by total number of observations, summed across each observation i This is just a standardized measure of error—how close the predicted measure is to observed

Fundamentals of GIS Materials by Austin Troy © 2008 Positional Accuracy Positional accuracy standards specify that acceptable positional error varies with scale Data can have high level of precision but still be positionally inaccurate Positional error is inversely related to precision and to amount of processing positional error precision positional error processing effort

Fundamentals of GIS Materials by Austin Troy © 2008 Accuracy is tied to scale

Fundamentals of GIS Materials by Austin Troy © 2008 Positional Error Standards Different agencies have different standards for positional error Example: USGS horizontal positional requirements state that 90% of all points must be within 1/30th of an inch for maps at a scale of 1:20,000 or larger, and 1/50th of an inch for maps at scales smaller than 1:20,000

Fundamentals of GIS Materials by Austin Troy © 2008 Positional Error Standards USGS Accuracy standards on the ground: 1:4,800 ± feet 1:10,000 ± feet 1:12,000 ± feet 1:24,000 ± feet 1:63,360 ± feet 1:100,000 ± feet Hence, a point on a map represents the center of a spatial probability distribution of its possible locationsprobability distribution Thanks to Kenneth E. Foote and Donald J. Huebner, The Geographer's Craft Project, Department of Geography, The University of Colorado at Boulder

Fundamentals of GIS Materials by Austin Troy © 2008 Attribute Accuracy Attribute accuracy and precision refer to quality of non-spatial, attribute data Precision for numeric data means lots of digits Quantitative measurement errors: e.g. truncation A common error is to measure a phenomenon in only one phase of a temporal cycle:  Bird counts  River flows  Average weather metrics  Soil moisture $ vs. $1200

Fundamentals of GIS Materials by Austin Troy © 2008 Categorical Attributes Accuracy refers to amount of misclassification of categorical data The chance for misclassification grows as number of possible classes increases; accuracy is a function of precision, or number of classes If just classifying as “land and water”, that is not very precise, and not likely to result in an error

Fundamentals of GIS Materials by Austin Troy © 2008 Other measures of data quality Logical consistency Completeness Data currency/timeliness Accessibility These apply to both attribute and positional data Image source:

Fundamentals of GIS Materials by Austin Troy © 2008 Example of Currency and Timeliness

Fundamentals of GIS Shelburne Road: 1937Shelburne Road: 2003

Fundamentals of GIS New North End and Colchester: 2003 New North End and Colchester: Historic

Fundamentals of GIS Materials by Austin Troy © 2008 Some common sources of error Numerical processing (math operations, data type, rounding, etc) Geocoding (e.g. rural address matching and street interpolation) Topological errors from digitizing (overshoots, dangling nodes, slivers, etc) Automated classification steps, like unsupervised or supervised land cover classification in remote sensing, can result in processing errors

Fundamentals of GIS Materials by Austin Troy © 2008 Error propagation and cascading Propagation: where one error leads to another (single step) Cascading: Refers to when errors are allowed to propagate unchecked from one layer to the next and on to the final set of products or recommendations (multi-step) Cascading error can be managed to a certain extent by conducting “sensitivity analysis” Image source:

Fundamentals of GIS Materials by Austin Troy © 2008 Conflation When one layer is better in one way and another is better in another and you wish to get the best of both Way of reconciling best geometric and attribute features from two layers into a new one Very commonly used for case where one layer has better attribute accuracy or completeness and another has better geometric accuracy or resolution Also used where newer layer is produced for some theme but is has lower resolution than older one

Fundamentals of GIS Materials by Austin Troy © 2008 Two general types of Conflation Attribute conflation: transferring attributes from an attribute rich layer to features in an attribute poor layer Feature conflation: improvement of features in one layer based on coordinates and shapes in another, often called rubber sheeting. User either transforms all features or specifies certain features to be kept fixed

Fundamentals of GIS Materials by Austin Troy © 2008 Conflation example Source: Stanley Dalal, GIS cafe

Fundamentals of GIS Materials by Austin Troy © 2008 Documentation and Metadata To avoid many of these errors, good documentation of source data is needed Metadata is data documentation, or “data about data” Ideally, the metadata describes the data according to federally recognized standards of accuracystandards of accuracy Almost all state, local and federal agencies are required to provide metadata with geodata they make

Fundamentals of GIS Materials by Austin Troy © 2008 Documentation and Metadata The federal geographic data committee (FGDC) is a federal entity that developed a “Content Standard for Digital Geospatial Metadata” in 1998, which is a model for all spatial data users to followFGDC Purpose is: “to provide a common set of terminology and definitions for the documentation of digital geospatial data.” All federal agencies are required to use these standards

Fundamentals of GIS Materials by Austin Troy © 2008 Documentation and Metadata Some roles/purposes of metadata: 1.Information retrieval, cataloguing, querying and searching for data electronically. 2.Describing fitness for use (applicability) and documenting the usability and quality of data. 3.Describing how to transfer, access or process data 4.Documenting all relevant characteristics of data needed to use it 5.Data permanence; creates institutional memory; advertises an organization’s research (generate partnerships)

Fundamentals of GIS Materials by Austin Troy © 2008 Documentation and Metadata Metadata usually include sections similar to these

Fundamentals of GIS Materials by Austin Troy © 2008 Documentation and Metadata Critical components usually break down into: 1. Dataset identification, overview 2. Data quality 3. Spatial reference information 4. Data definition 5. Administrative information (distribution) 6. Meta-metadata

Fundamentals of GIS Materials by Austin Troy © 2008 Documentation and Metadata 1. Data identification, overview & administrative info: General info: name and brief ID of dataset and owner organization, geographic domain, general description/summary of content, data model used to represent spatial features, intent of production, language used, reference to more detailed documents, if applicable Constraints on access and use This is usually where info on currency is found

Fundamentals of GIS Materials by Austin Troy © 2008 Documentation and Metadata 2. Data quality should address: Positional accuracy Attribute accuracy Logical consistency Completeness Lineage Processing steps

Fundamentals of GIS Materials by Austin Troy © 2008 Documentation and Metadata 3. Spatial reference should include: horizontal coordinate system (e.g. State Plane) Includes projection used, scale factors, longitude of central meridian, latitude of projection origin, distance units Geodetic model (e.g. NAD 83), ellipsoid, semi-major axis

Fundamentals of GIS Materials by Austin Troy © 2008 Documentation and Metadata 4. Data definition, also known as “Entity and Attribute Information,” should include: Entity types (e.g. point, polygon, raster) Information about each attribute, including label, definition, domain of values Sometimes will include a data dictionary, or description of attribute codes, while sometimes it will reference a document with those codes if they are too long and complex

Fundamentals of GIS Materials by Austin Troy © 2008 Documentation and Metadata 5. Data distribution info usually includes: Name, address, phone, of contact person and organization Liability information Ordering information, including online and ordering by other media; usually includes fees

Fundamentals of GIS Materials by Austin Troy © 2008 Documentation and Metadata 6. Metadata reference, or meta-metadata This is data about the metadata Contains information on When metadata updated Who made it What standard was used What constraints apply to the metadata

Fundamentals of GIS Materials by Austin Troy © 2008 Metadata in ArcGIS Arc GIS allows you to display, import and export metadata in and to a variety of Metadata formats: It defaults to FGDC ESRI which looks like:

Fundamentals of GIS Materials by Austin Troy © 2008 Metadata in ArcGIS XML is the most flexible form because its tag structure allows it to be used in programming; tags can be called as variables or can be created through form interfaces; allows for compatibility across platforms and programs

Fundamentals of GIS Materials by Austin Troy © 2008 Metadata in ArcGIS In the past, complete metadata was only available as text; you had to create most embedded metadata tags yourself. Today many state and nationwide datasets come with complete embedded metadata including full attribute codes E.g. NEDs, NLCD, all VCGI data

Fundamentals of GIS Materials by Austin Troy © 2008 Metadata in ArcGIS Can edit, import, and export metadata in multiple formats allowing help with proper sharing of data. Can also make templates to save time in repeat documentation of big data sets See NPS metadata extension for cool utilities