UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Constructing an EA-level Database for the Census.

Slides:



Advertisements
Similar presentations
UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Census Cartography: Organizational and Institutional Issues.
Advertisements

Topographic mapping in Fiji: Challenges and opportunities Conway Pene 2012 Pacific GIS&RS Conference November 2012, Suva.
UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Definition of the National Census Geography.
GIS for Environmental Science
Raster Based GIS Analysis
Vector-Based GIS Data Processing Chapter 6. Vector Data Model Feature Classes points lines polygons Layers limited to one class of data Figure p. 186.
Geographic Information Systems and Science SECOND EDITION Paul A. Longley, Michael F. Goodchild, David J. Maguire, David W. Rhind © 2005 John Wiley and.
GIS Overview. What is GIS? GIS is an information system that allows for capture, storage, retrieval, analysis and display of spatial data.
You have just been given an aerial photograph that is not registered to real world coordinates. How do you display the aerial with other data layers that.
Geog 458: Map Sources and Errors January 20, 2006 Data Storage and Editing.
GIS 200 Introduction to GIS Buildings. Poly Streams, Line Wells, Point Roads, Line Zoning,Poly MAP SHEETS.
Geographic Information Systems : Data Types, Sources and the ArcView Program.
Needs Assessment April 5 th 2006 Geog 463: GIS Workshop.
So What is GIS??? “A collection of computer hardware, software and procedures that are used to organize, manage, analyze and display.
Introduction to Databases Transparencies
NPS Introduction to GIS: Lecture 1
Data Input How do I transfer the paper map data and attribute data to a format that is usable by the GIS software? Data input involves both locational.
GIS Tutorial 1 Lecture 6 Digitizing.
Digitizing There are three primary methods for digitizing spatial information: Manual Methods include: Tablet Digitizing Heads-up Digitizing An Automated.
Spatial Data: Elements, Levels and Types. Spatial Data: What GIS Uses Bigfoot Sightings: Spatial Data.
Data vs. Information  Data: raw facts or measurements  Information: collection of facts organized/processed in such a way that they have value beyond.
GIS Development: Step5 - DB Planning and Design Step6 - Database Construction Step7 - Pilot/Benchmark (Source: GIS AsiaPacific, June/July & August/September.
Geography 241 – GIS I Dr. Patrick McHaffie Associate Professor Department of Geography Cook County, % population < 5.
Dr. David Liu Objectives  Understand what a GIS is  Understand how a GIS functions  Spatial data representation  GIS application.
9. GIS Data Collection.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis Amman, Jordan, May, 2011 Use of GIS and Web-based Mapping for Census.
Geographical Information System GIS By: Yahia Dahash.
Spatial data Visualization spatial data Ruslan Bobov
Intro. To GIS Lecture 4 Data: data storage, creation & editing
GIS Lecture 1 Introduction to GIS Buildings. Poly Streams, Line Wells, Point Roads, Line Zoning,Poly MAP SHEETS.
PHASE 3: SYSTEMS DESIGN Chapter 7 Data Design.
Introduction to Systems Analysis and Design Trisha Cummings.
Lecture 4 Data. Why GIS? Ask questions Solve a problem Support a decision Make Maps Involve others, share data, procedures, ideas.
Introduction to ArcGIS for Environmental Sciences Day 2 – Fundamentals Module 8 Creating & Editing Data Creating Metadata.
GROUP 4 FATIN NUR HAFIZAH MULLAI J.DHANNIYA FARAH AN-NUR MOHAMAD AZUWAN LAU WAN YEE.
Ref: Geographic Information System and Science, By Hoeung Rathsokha, MSCIM GIS and Remote Sensing WHAT.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis Bangkok, Thailand, 5-8 October, 2010 Building a Geographic Database and.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis Amman, Jordan, May, 2011 Spatial Analysis & Dissemination of Census.
Introduction to Geographic Information Systems (GIS) Lesson 1.
Geographic Information System GIS This project is implemented through the CENTRAL EUROPE Programme co-financed by the ERDF GIS Geographic Inf o rmation.
Applied Cartography and Introduction to GIS GEOG 2017 EL Lecture-2 Chapters 3 and 4.
Copyright 2010, The World Bank Group. All Rights Reserved. COVERAGE, FRAMES & GIS, Part 2 Quality assurance for census 1.
Data input 1: - Online data sources -Map scanning and digitizing GIS 4103 Spring 06 Adina Racoviteanu.
Lab 1 slides 7/25/2005. Chapter 1Slide 2 Principles of Information Systems, Fifth Edition Data vs. Information Data: raw facts or measurements Information:
How do we represent the world in a GIS database?
Needs Assessment Geog 469 GIS Workshop. Outline What is the rationale behind needs assessment? What are the benefits of GIS projects? What is a hierarchical.
Census Mapping A Case of Zambia UN Workshop on Census Cartography and Management, Lusaka, 8-12 th October 2007.
Workshop on Census Cartography and Management, Bangkok, Thailand, 15–19 October 2007 Data Conversion & Integration.
Lecture2: Database Environment Prepared by L. Nouf Almujally & Aisha AlArfaj 1 Ref. Chapter2 College of Computer and Information Sciences - Information.
Support the spread of “good practice” in generating, managing, analysing and communicating spatial information Introduction to GIS for the Purpose of Practising.
INTRODUCTION TO GEOGRAPHICAL INFORMATION SCIENCE RSG620 Week 1, Lecture 2 April 11, 2012 Department of RS and GISc Institute of Space Technology, Karachi.
UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Use of geographic databases (maps) and other geospatial tools.
Workshop on Census Cartography and Management, Port-of-Spain, Trinidad and Tobago, October 2007 Data Conversion & Integration.
GIS Data Structures How do we represent the world in a GIS database?
Workshop on International Standards, Contemporary Technologies and Regional Cooperation, Noumea, New Caledonia, 04–08 February 2008 Data Conversion & Integration.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
WFM 6202: Remote Sensing and GIS in Water Management © Dr. Akm Saiful IslamDr. Akm Saiful Islam WFM 6202: Remote Sensing and GIS in Water Management Dr.
Introduction to Geographic Information Systems
Distance measure Point A: UTM Eastings = 450,000m; Northings = 4,500,000m Point B: UTM Eastings = 550,000m; Northings = 4,500,000m.
Spatial Data Models Geography is concerned with many aspects of our environment. From a GIS perspective, we can identify two aspects which are of particular.
GIS Project1 Physical Structure of GDB Geodatabase Feature datasets Object classes, subtypes Features classes, subtypes Relationship classes Geometric.
Chapter 13 Editing and Topology.
GIS Basic Training June 7, 2007 – ICIT Midyear Conference
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Databases and Information Management
Nairobi, Kenya, September, 2010
Data Queries Raster & Vector Data Models
Databases and Information Management
Lecture 2 Components of GIS
NPS Introduction to GIS: Lecture 1 Based on NIMC and Other Sources.
Presentation transcript:

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Constructing an EA-level Database for the Census

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Overrview  Stages in the Geographic Database Development Sources of geographic information Data conversion Data integration  Implementation of the Database  Conclusion

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Stages in the geographic database development  Geographic data sources for EA delineation Inventory of existing data sources Additional geographic data collection  Geographic data conversion Digitizing/Scanning + ratser-to-vector conversion Editing Geographic features Constructing and maintaining topology for geographic features  Data integration Georeferencing/Coding Combining and integrating/Additional delineation of EA boundaries  Parallel activity Develop geographic attribute database Metadata development

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Sources of geographic information Identify existing data sources Additional geographic data collection Paper maps, existing printed air photos and satellite imagery Field mapping products such as sketch maps Digital air photos and satellite images GPS coordinate collection Existing digital maps

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Why Data Inventory?  Geographic data: Labor intensive, tedious and error-prone  Up to 70% of GIS projects  Identify existing data sources

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Geographic data conversion  Data conversion: The process of converting features that are visible on a hardcopy map into digital point, line, polygon and attribute information is called data automation or data conversion.  The best strategy for data conversion depends on many factors including data availability and time and resource constraints

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Data Conversion Paper maps, existing printed air photos and satellite imagery Field mapping products such as sketch maps Digital air photos and satellite images DigitizingScanning Raster-to-vector conversion

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Geographic data conversion  2 main approaches for converting information on hardcopy maps to digital data Scanning Digitizing

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Scanning  Scanning has arguably bypassed digitizing as the main method of spatial data input, mainly because of the potential to automate some tedious data-input steps using large-format feed scanners and interactive vectorization software.  The result of the scanning process is a raster image of the original map which can be stored in a standard image format such as GIF or TIFF  After georeferencing it can be displayed in GIS packages as a backdrop to existing vector data

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Advantages and Disadvantages of Scanning Disadvantages  Converting large maps with small format scanners requires tedious re-assembly of the individual parts;  Scanning large volumes of hard- copy maps will present challenges for file storage on many desktop computer systems  Despite recent advances in vectorization software, considerable manual editing and attribute labeling may still be required. Advantages  Scanned maps can be used as image backdrops for vector information.  Clear base maps or original color separations can be vectorized relatively easily using raster-to- vector conversion software; and  Small-format scanners are relatively inexpensive and provide quick data capture.

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Raster to Vector Conversion  Raster to Vector Conversion Since the end result of the conversion process is a digital geographic database of points and lines, the scanned information contained on the raster images needs to be converted into coordinate information. Scanning Raster-to-vector conversion Digital air photos and satellite images

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Digitizing  Manual Digitizing Digitizing is often tedious and tiring to the operators  Heads up Digitizing (old and new method) In the old method, the operator traced map features on a transparency and attached this map to the computer screen In the new method of heads-up digitizing, a scanned map image is used digitally to trace the outlines into a GIS layer

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round  Operator uses a Raster- scanned image on the computer screen (a scanned map, air photo or satellite image) as a backdrop.  Operator follows lines on- screen in vector mode Heads-Up Digitizing II

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Advantages and Disadvantages of Digitizing Disadvantages  Digitizing is tedious possibly leading to operator fatigue and resulting quality problems which may require considerable post- processing;  Manual digitizing is quite slow;  In contrast to primary data collection using GPS or aerial photography, the accuracy of digitized maps is limited by the quality of the source material. Advantages  Digitizing is easy to learn and thus does not require expensive skilled labor;  Attribute information can be added during the digitizing process;  High accuracy can be achieved through manual digitizing; i.e., there is usually no loss of accuracy compared to the source map.

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Editing and Building topology Paper maps, existing printed air photos and satellite imagery Field mapping products such as sketch maps Digital air photos and satellite images GPS coordinate collection Existing digital maps DigitizingScanning Raster-to-vector conversion Editing geographic features Construct Topology for Geographic features Generate lines and polygones

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Editing  Manual digitizing is error prone  Objective is to produce an accurate representation of the original map data  This means that all lines that connect on the map must also connect in the digital database  There should be no missing features and no duplicate lines  The most common types of errors Reconnect disconnected line segments, etc

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round

Fixing Errors  Some of the common digitizing errors shown in the figure can be avoided by using the digitizing software’s snap tolerances that are defined by the user  For example, the user might specify that all endpoints of a line that are closer than 1 mm from another line will automatically be connected (snapped) to that line  Small sliver polygons that are created when a line is digitized twice can also be automatically removed

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Topology  Data structure in which each point, line and piece or whole of a polygon : “knows” where it is “knows” what is around it “understands” its environment “knows” how to get around Helps answer the question what is where?

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Example of “Spaghetti” data structure Poly coordinates A (1,4), (1,6), (6,6), (6,4), (4,4), (1,4) B (1,4), (4,4), (4,1), (1,1), (1,4) C (4,4), (6,4), (6,1), (4,1), (4,4) A BC

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Example of Topological data structure A BC Node X Y Lines I 1 4 1,2,4 II 4 4 4,5,6 III 6 4 1,3,5 IV 4 1 2,3, IIIIII IV Poly Lines A 1,4,5 B 2,4,6 C 3,5,6 From To Left Right Line Node Node Poly Poly 1 I III O A 2 I IV B O 3 III IV O C 4 I II A B 5 II III A C 6 II IV C B O = “outside” polygon

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round

Constructing and maintaining topology (cont.)  Storing the topological information facilitates analysis, since many GIS operations do not actually require coordinate information, but are based only on topology  The user typically does not have to worry about how the GIS stores topological information. How this is actually done is software-specific.  Building topology thus also acts as a test of database integrity

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Digital data integration Construct Topology for Geographic features Existing digital maps Geo-referencing (coordinate transformation and projection change) Coding (labeling) of digital geographic features Combine and integrate attribute data

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Integrating data  Georeferencing Converting map coordinates to the real world coordinates corresponding to the source map’s cartographic projection (or at digitizing stage). Attaching codes to the digitized features  Integrating attribute data Spreadsheets links to external database

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Integrating attribute data  After the completed digital database has been verified to be error-free, the final step is to add additional attributes  These can be linked to the database permanently, or the additional information about each database feature can be stored in separate files which are linked to the geographic database as needed

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Implementation of an EA database  All large operational GISs are built on geodatabases;  Arguably the most important part of the GIS  Geodatabases form the basis for all queries, analysis, and decision-making.  A DBMS, or database management system, is where databases are stored.

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Definition of database content (data modeling)  Once the scope of census geographic activities has been determined, the census office needs to define and document the structure of the geographic databases in more detail.  This process is sometimes termed data modeling and involves the definition of the geographic features to be included in the database, their attributes and their relationships to other features.  The resulting output is a detailed data dictionary that guides the database development process and also serves as documentation in later stages.

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Several types of data organization  Varieties of relational database and geodatabase structure  Database management systems (DBMSs) can be divided into various types, including: Relational, Object, Object-relational

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Example: the Relational Database Model  The relational database model is used to store, retrieve and manipulate tables of data that refer to the geographic features in the coordinate database.  It is based on the entity-relationship model  In a geographic context, an entity can be administrative or census units, or any other spatial feature for which characteristics will be compiled.

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Entity-Relationship Example: EA entity can be linked to the entity crew leader area. The table for this entity could have attributes such as the name of the crew leader, the regional office responsible, contact information, and the crew leader code (CL code) as primary code, which is also present in the EA entity. Crew leader area CL-code Name RO responsible 1-N EA EA-code Area Pop. 1-1 R

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Implementation of an EA database  : Example of an entity table – enumeration area

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Example: Census GIS database  - Basic elements Entity: administrative or census units  enumeration areas Entity type / Relations Components of a digital spatial census database:  Boundary database  Geographic attribute tables  Census data tables

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Components of a digital spatial census database

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Data Dictionary  Definition: A data catalog that describes the contents of a database. Information is listed about each field in the attribute table and about the format, definitions and structures of the attribute tables. A data dictionary is an essential component of metadata information.

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Spatial Analysis: Query  select features by their attributes: “find all districts with literacy rates < 60%”  select features by geographic relationships “find all family planning clinics within this district”  combined attributes/geographic queries “find all villages within 10km of a health facility that have high child mortality” Query operations are based on the SQL (Structured Query Language) concept

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Spatial Analysis (cont.)  Buffer: find all settlements that are more than 10km from a health clinic  Point-in-polygon operations: identify for all villages into which vegetation zone they fall  Polygon overlay: combine administrative records with health district data  Network operations: find the shortest route from village to hospital

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Illustration

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Summary  Data conversion Conversion of hard-copy maps to digital maps Digitizing Scanning Editing Building Topology  Data integration Geo-referencing Projection change Coding Integration of attribute data

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Thank You!

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round An example of land parcels

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round The E/R diagram for land parcels STREET -name PARCEL -number POINT -number -x,y 2-N 3-N 2-N SEGMENT -number LANDOWNER -name -date-of-birth 1-N AB C D A: Streets have edges (segments) B: parcels have boundaries (segments) C: line have two endpoints D: parcels have owners, and people own land.

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Data Tables

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Inventory of existing sources  National mapping agency (often the lead agency in the country);  Military mapping services;  Province, district and municipal governments. (transportation, social services, utility services and planning relevant information);  Various government/private organizations dealing with spatial data; Geological or hydrological survey, Environmental protection authority, Utility and communication sector companies;  Donor activities

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Implementation of an EA database  Geographic databases (hereafter referred to as geodatabases) are more than spreadsheets  Entity types can be defined as having specific properties that govern behavior in the real world.  The EA as a geographic unit is a kind of object whose function is to delineate territory for the census canvassing operation.  Morphologically, the EA is contiguous, it nests within administrative units, and it is composed of population-based units.

UNSD-CELADE Regional Workshop on Census Cartography for the 2010 Latin America’s census round Definition of database content (data modeling)  Many national and international agencies have already been active in developing generic data models for spatial information as part of a national spatial data infrastructure (NSDI).  Often, a census office will be able to simply adapt an NSDI standard to the specific needs of statistical data collection.  In cases where such information is unavailable, a data model needs to be developed in house.  Templates from mapping or statistical agencies in other countries will provide a useful reference for that purpose.