Data Input and GIS refer to Chapter 4 Data input, verification, storage, and output Text Book Burrough, P. A. and R. A. McDonnell, 1998. Principles.

Slides:



Advertisements
Similar presentations
Starting from Text Files. User Interface Graphic user interface –Mouse, buttons, windows, menus Commands-driven interface –Commands, parameters.
Advertisements

Copyright, © Qiming Zhou GEOG1150. Cartography Data Models for Computer Cartography.
Lecture 6 Data entry. Getting the Map into the Computer Get data in finished form Analog-to-Digital maps Digitizing Data Entry Editing and validation.
From portions of Chapter 8, 9, 10, &11. Real world is complex. GIS is used model reality. The GIS models then enable us to ask questions of the data by.
Raster Based GIS Analysis
Vector-Based GIS Data Processing Chapter 6. Vector Data Model Feature Classes points lines polygons Layers limited to one class of data Figure p. 186.
Maps as Numbers Getting Started with GIS Chapter 3.
Cartographic and GIS Data Structures
Geographic Information Systems and Science SECOND EDITION Paul A. Longley, Michael F. Goodchild, David J. Maguire, David W. Rhind © 2005 John Wiley and.
GIS Geographic Information System
CS 128/ES Lecture 5a1 Raster Formats (II). CS 128/ES Lecture 5a2 Spatial modeling in raster format  Basic entity is the cell  Region represented.
Geog 458: Map Sources and Errors January 20, 2006 Data Storage and Editing.
Lecture 16: Data input 1: Digitizing and Geocoding By Austin Troy University of Vermont Using GIS-- Introduction to GIS.
Fundamentals of GIS Materials by Austin Troy © 2008 Lecture 18: Data Input: Geocoding and Digitizing By Austin Troy University of Vermont NR 143.
Data Input How do I transfer the paper map data and attribute data to a format that is usable by the GIS software? Data input involves both locational.
GIS Tutorial 1 Lecture 6 Digitizing.
Digitizing There are three primary methods for digitizing spatial information: Manual Methods include: Tablet Digitizing Heads-up Digitizing An Automated.
Geography 241 – GIS I Dr. Patrick McHaffie Associate Professor Department of Geography Cook County, % population < 5.
GI Systems and Science January 23, Points to Cover  What is spatial data modeling?  Entity definition  Topology  Spatial data models Raster.
9. GIS Data Collection.
Data Acquisition Lecture 8. Data Sources  Data Transfer  Getting data from the internet and importing  Data Collection  One of the most expensive.
Database Design IST 7-10 Presented by Miss Egan and Miss Richards.
Digitizing and Scanning. Primary Data Sources Measurements Field Lab Remotely sensed data already secondary? Creating geometries Definitely in the realm.
Applied Cartography and Introduction to GIS GEOG 2017 EL Lecture-3 Chapters 5 and 6.
Lecture 4 Data. Why GIS? Ask questions Solve a problem Support a decision Make Maps Involve others, share data, procedures, ideas.
Data Quality Issues-Chapter 10
GROUP 4 FATIN NUR HAFIZAH MULLAI J.DHANNIYA FARAH AN-NUR MOHAMAD AZUWAN LAU WAN YEE.
GSP 270 Digitizing with an Introduction to Uncertainty and Metadata
Ref: Geographic Information System and Science, By Hoeung Rathsokha, MSCIM GIS and Remote Sensing WHAT.
Applied Cartography and Introduction to GIS GEOG 2017 EL
Support the spread of “good practice” in generating, managing, analysing and communicating spatial information Georeferencing for Digitising By: Willy.
Map Scale, Resolution and Data Models. Components of a GIS Map Maps can be displayed at various scales –Scale - the relationship between the size of features.
Chapter 3 Sections 3.5 – 3.7. Vector Data Representation object-based “discrete objects”
Fundamentals of GIS Materials by Austin Troy © 2008 Lecture 18: Data Input: Geocoding and Digitizing By Austin Troy University of Vermont.
Faculty of Applied Engineering and Urban Planning Civil Engineering Department Geographic Information Systems Vector and Raster Data Models Lecture 3 Week.
Geographic Information System GIS This project is implemented through the CENTRAL EUROPE Programme co-financed by the ERDF GIS Geographic Inf o rmation.
Major parts of ArcGIS ArcView -Basic mapping, editing and Analysis tools ArcEditor -all of ArcView plus Adds ability to deal with topological and network.
Data input 1: - Online data sources -Map scanning and digitizing GIS 4103 Spring 06 Adina Racoviteanu.
Chapter 3 Digital Representation of Geographic Data.
8. Geographic Data Modeling. Outline Definitions Data models / modeling GIS data models – Topology.
How do we represent the world in a GIS database?
Cartographic and GIS Data Structures Dr. Ahmad BinTouq URL:
Geographic Information Systems Data Analysis. What is GIS Data ?
Support the spread of “good practice” in generating, managing, analysing and communicating spatial information Introduction to GIS for the Purpose of Practising.
URBDP 422 Urban and Regional Geo-Spatial Analysis Lecture 2: Spatial Data Models and Structures Lab Exercise 2: Topology January 9, 2014.
Vector data input and editing Scanning (scan digitizing) –drum scanner scanner scans in map image –specialized software ‘extracts’ point, line, and area.
GUS: 0262 Fundamentals of GIS Lecture Presentation 8: Data Input and Editing Jeremy Mennis Department of Geography and Urban Studies Temple University.
Data Entry Getting coordinates and attributes into our GIS.
OUTLINE:  geocoding  digitizing terms and methods  scanning methods  adding attributes OUTLINE:  geocoding  digitizing terms and methods  scanning.
Lecture 3 The Digital Image – Part I - Single Channel Data 12 September
Raster data models Rasters can be different types of tesselations SquaresTrianglesHexagons Regular tesselations.
1 Spatial Data Models and Structure. 2 Part 1: Basic Geographic Concepts Real world -> Digital Environment –GIS data represent a simplified view of physical.
GIS Data Structures How do we represent the world in a GIS database?
Verification & Validation. Batch processing In a batch processing system, documents such as sales orders are collected into batches of typically 50 documents.
INTRODUCTION TO GIS  Used to describe computer facilities which are used to handle data referenced to the spatial domain.  Has the ability to inter-
Distance measure Point A: UTM Eastings = 450,000m; Northings = 4,500,000m Point B: UTM Eastings = 550,000m; Northings = 4,500,000m.
Data Entry Getting coordinates and attributes into our GIS.
What is GIS? “A powerful set of tools for collecting, storing, retrieving, transforming and displaying spatial data”
Raster Data Models: Data Compression Why? –Save disk space by reducing information content –Methods Run-length codes Raster chain codes Block codes Quadtrees.
Spatial Data Models Geography is concerned with many aspects of our environment. From a GIS perspective, we can identify two aspects which are of particular.
Geoprocessing and georeferencing raster data
UNIT 3 – MODULE 5: Data Input & Editing. INTRODUCTION Putting data into a computer (called data coding) is a fundamental process for virtually all GIS.
Data Storage & Editing GEOG370 Instructor: Christine Erlien.
Rayat Shikshan Sanstha’s Chhatrapati Shivaji College Satara
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Data Input and GIS refer to Chapter 4 Data input, verification, storage, and output Text Book Burrough, P. A. and R. A. McDonnell, Principles.
Chapter 3 Raster & Vector Data.
Spatial Data Models Raster uses individual cells in a matrix, or grid, format to represent real world entities Vector uses coordinates to store the shape.
Cartographic and GIS Data Structures
Spatial Data Entry via Digitizing
Presentation transcript:

Data Input and GIS refer to Chapter 4 Data input, verification, storage, and output Text Book Burrough, P. A. and R. A. McDonnell, 1998. Principles of Geographical Information Systems. Oxford University Press, London.

Overview Input of spatial data Modes of data input Rasterization and vectorization Map preparation and the digitizing Remote Sensing: Special Raster Data Input Integrating different data sources External Databases Exercise

Input of spatial data Need to have tools to transform spatial data of various types into digital format Data input is a major bottleneck in application of GIS technology. Costs of input often consume 80% or more of project costs Many commercial GIS operations generate most of their revenue through data input Data input is labor intensive, tedious, and error-prone There is a danger that construction of the database may become an end in itself and the project may not move on to analysis of the data collected

Input of spatial data-continue Essential to find ways to reduce costs and maximize accuracy Need to automate the input process as much as possible, but: automated input cab create bigger editing problems later Source documents (maps) may often have to be redrafted to meet rigid quality requirements of automated input Sharing of digital data is one way around the input bottleneck. More and more spatial data is becoming available in digital form

Input of spatial data-continue Data input to a GIS involves encoding both the locational and attribute data The locational data is encoded as coordinates on a particular cartesian coordinate system Source maps may have different projections and scales Several stages of data transformation may be needed to bring all data to a common coordinate system Attribute data is often obtained and stored in tables (Database Management System)

Input of spatial data-continue There are two methods for spatial data acquisition Primary methods Surveying, Photogrammetry, GPS, and Remote Sensing Secondary methods Digitization, Automatic line following, and scanning

The input subsystem Designed to transfer data into the GIS from external sources (attribute and map data) Must allow for encoding in either raster or vector(TIN) Must provide a means for spatial referencing (projections, cartesian coordinate systems, etc) Must provide link between storage and editing subsystems (ensure input can be saved and any errors corrected)

Modes of data input: Input Devices Grid overlay keyboard Digitizer Scanner Data in digital format (Total station, digital photogrammetry, remote sensing, GPS)

Grid overlay Grid on clear material is overlaid on map Identity of each cell in the grid is determined by what map features are in a particular cell Number or code is assigned to each class of map features, and used to label cells in grid After filling in the grid, numbers or codes are typed into the computer to produce a raster layer Pretty antiquated method, seldom used

Keyboard Keyboard entry (X,Y,Z), (Ø, , h), or angle and distance Input through keyboard is time consuming, but it is more accurate It is suitable for small areas i.e. when the number of points/lines/areas are limited Because of its high accuracy, sometimes it is used in applications that need high quality e.g. cadastral mapping

Digitizing: Digitizing Tablet Tablet composed of a flat surface, in which are embedded a grid of electronically active wires and mouse-like device (puck or stylus) usually with cross hairs. When puck is moved over the tablet, its location is known because the grid of wires senses it location. Puck also has buttons which allow communication with the computer Grid acts like a cartersian (X,Y) coordinate system. To input data, map is taped on digitizing tablet. Puck is placed over the feature of interest, and message is sent to compute through buttons on puck e.g., node is used to mark beginning and end of line feature, or point where polygon closes on itself

Digitization Digitization is a process of converting existing maps to digital form (vector format) A digitizer is connected to a computer and map features are followed manually Digitizers are available at different sizes (A4, A3, A2, A0) and different accuracy (0.05 mm) Example of digitizers are CalComp 9500 and Summagraphic

Digitizing-continue Digitizing the map contents can be done in two different modes: point or stream Point mode: the operator identifies the points to be captured explicitly by pressing a button Stream mode: points are captured at set time intervals (typically 10 per second) or on movement of the cursor by a fixed amount In point mode the operator selects points subjectively, two point mode operators will not code a line in the same way Stream mode generates large numbers of points, many of which may be redundant

Digitizing- Problems Paper maps are unstable: each time the map is removed from the digitizing table, the reference points must be re-entered when the map is affixed to the table again If the map has stretched or shrunk in the interim, the newly digitized points will be slightly off in their location when compared to previously digitized points Errors occur on these maps, and these errors are entered into the GIS database as well the level of error in the GIS database is directly related to the error level of the source maps

Digitizing- Problems-continue Maps are meant to display information, and do not always accurately record locational information, for example, when a railroad, stream and road all go through a narrow mountain pass, the pass may actually be depicted wider than its actual size to allow for the three symbols to be drafted in the pass Discrepancies across map sheet boundaries can cause discrepancies in the total GIS database e.g. roads or streams that do not meet exactly when two map sheets are placed next to each other User error causes overshoots, undershoots

Scanners Types of scanners: Line following and drum Line following placed on a line and follow line using a guiding device such as a laser Two short comings: 1. sample lines at regular time or distance intervals (more complex parts of the line should have more samples, less complex need less samples) 2. lines that converge then diverge (e.g., contours along a cliff, road intersections, etc), device doesn’t know which line is which also broken lines (dashes, interrupted by label etc.) Line following technology can be reproduced in a software environment (line tracing software)

Drum scanners Drum scanners (Fig 5.2, p. 129) as the drum rotates about its axis, a scanner head containing a light source and photo-detector reads the reflectivity of the target graphic, and digitizing this signal, creates a single row of pixels from the graphic. The scanner head moves along the axis of the drum to create the next column of pixels, and so on through the entire scan Systems may have a scan spot size of as little as 25 micrometers, and be able to scan graphics of the order of 1 meter on a side an alternative mechanism involves an array of photo-detectors which extract data from several rows of the raster simultaneously. The detector moves across the document in a swath when all the columns have been scanned, the detector moves to a new swath of rows initially, scanning produces a raster image, which can be converted to vector using on screen digitizing or automated line tracing software

Scanning Scanning is a process of converting existing maps to digital form (raster format) A scanner is connected to a computer and map features are scanned automatically Scanners are available at different sizes (A4, A3, A2, A0) and different accuracy (300 dpi, 600dpi, 1000 dpi) Example of Scanners are UMAX-S12, HP

Scanners-Problems Scanners are generally very expensive Editing can take nearly as long as manual digitizing would have taken Scanners should be thought of as time-saving devices only when maps are clear, show good contrast, and contain a relatively simple amount of content

Rasterization and vectorization Regardless of input device, it is necessary to determine if the final product will be raster or vector Most GIS programs allow conversion between the two, but problems are involved If vector to raster, cell size is also important, but the results are satisfactory If raster to vector, lines become blocky and step-like, can’t reverse the procedure to produce original content of vector line also, resolution (or cell size) has a direct effect on the spatial integrity of the object Spline algorithms - apply a smoothing function to vector lines Examples of software convert from raster to vector or vice versa are R2V and ArcScan under Arc/Info

Map Preparation and the Digitizing Identification of features to be digitized. Sometimes marked directly on the map or on clear overlay. Sometimes, identification of nodes vs. vertices The digitizing process usually starts with telling the computer about the coordinate system that the map is in. Digitizer operates in its own cartesian coordinate system, need to establish relationship between digitizer coordinates and map coordinates (Transformation) Registration points or tick marks identified. Allows you to remove map from tablet to allow others to access it, then put it back on and register the input system using tic marks. It is essential to locate these precisely because they provide the reference for all other spatial data entered

Reference Frameworks and Transformations Digitizer coordinate must be transformed to map coordinates using a minimum number of four registration marks to cater for Translation: Object movement Rotation: Reorient the object Scale change: adjust the object size (Figure 5.4, p. 133, DeMers)

Setting up digitizing environment to handle errors Fuzzy tolerance - attempts to account for errors caused by the "shaky hand”. Based on the idea that you will not be able to place the cursor exactly the same location twice. Essentially defines a distance for maximum separation . If two nodes are within the limits of fuzzy tolerance, the are snapped together. Same idea for line features. Can be done before digitizing starts or can be implemented in post-digitizing editing process Other variables: Material of map shrink/swell with changes in humidity and temperature and stable medium such as plastic (Mylar) is preferred

What to Input Sample more for more information Define your purpose before hand and make sure the data you are using are suitable for the goals of your project and pre-plan carefully Use the most accurate data, but not data that is too accurate for your purpose Check to see if data are already available Keep coverages simple and use the same map to extract different coverages when possible How Much to Input: Scale dependent General rule - more complex features at larger scales require more detail (more vertices, smaller cell size) Sample more for more information

Methods of Vector Input Manual digitizing, Registration marks Location of nodes, lines not become nodes and nodes don’t become just points Building of topology Correcting of digitizing errors Transformation and projection Adding attribute data Checking the accuracy of attribute data

Methods of Raster Input Presence/absence method: If object occurs in a cell (anywhere) it is recorded as present ( simplicity ) best method for coding points and lines (Fig. 5.6 p. 143) Centroid of cell method : Presence only recorded if object is at the center of the cell . Disadv. - less simple, requires calculation of centroid, location of object relative to centroid. Generally restricted to raster encoding of polygons Dominant type method : Commonly used for encoding polygons into raster format . Identified as present if it occupies more than 50% of the cell Percent occurrence: Not only encodes presence/absence, but % occurrence (Urban/Rural) Generally, each attribute is recorded as a separate coverage e.g., one grid of percent urban, one of percent rural, percent water, percent forest, etc.

Remote Sensing: Special Raster Data Input Remote Sensing data is considered as special raster data (in digital form). Image processing software can be used to extract/classify remote sensing imagery (cover later in the semester) Attention should be paid to geometric and radiometric corrections and method of classification (supervised/unsupervised), different radiometric, geometric, and temporal resolutions Institutional problems related to remote sensing data include availability of data (limited coverage, cloud cover), cost, education and training, and organizational infrastructure

External Databases An efficient method of building a GIS database is to limit the amount of time and cost necessary to develop database A plenty of data already available in different digital format an in different media 9-inch tape, 8 mm tape, CD-ROM, etc. Need to evaluate data for its utility/quality for projects and ability to import Meta-data or data dictionary should be prepared for the GIS database (information about the content)

Exercise 2: Digitization 1. Load any image of the campus into ArcView. 2. Create three new themes (point, line, polygon) (hint: View/new theme) a. Create a point theme to show your classroom buildings, where you park, and where you have lunch. b. Create a line theme of the paths that you walk between those points c. Create a polygon theme of each building that has a point. 3. Give a unique id number to each point, line and polygon in each of the new themes tables. (hint: Table/start editing) 4. Change the legend of each theme to show the different id numbers. 5. Give the points a new symbol that represent what is happening there. 6. Give the lines arrows to show what direction you are walking. 7. Create a layout that has a title, north arrow, your name, date, and a custom legend.

Digitization-continue To create a custom legend a. First load the legend into a layout. b. Second select the legend and right click simplify. (This will separate the legend) c. Now edit the legend to give it a unique look and take out the polygon theme part of the legend. d. Once you are finish with the legend select everything in the legend. When everything is selected go to graphics/group. This will group the graphic back together. 9. In the layout, select File and Export the layout to JPEG format, but before that make sure the JPEG (JFIF) image support Extension is loaded What to hand in. 1. Jpeg layout 2. Layout contains a. Campus image with three themes b. Custom legend c. Title, scale, north arrow and your name