By Serena Coetzee and Magnus Rademeyer presented at the ICC 2009, Santiago, Chile, November 2009 Testing the.

Slides:



Advertisements
Similar presentations
Graphic Data Clean-Up Issues. Graphic Data Clean-up Very Important if you plan to use existing CAD or GIS data from another agency, department, or private.
Advertisements

An Interactive-Voting Based Map Matching Algorithm
OS Places New Service Products from May 2014 Address Capture & Verification Address Matching GeoSearch Ordnance Survey 2014.
1 Efficient Record Linkage in Large Data Sets Liang Jin, Chen Li, Sharad Mehrotra University of California, Irvine DASFAA, Kyoto, Japan, March 2003.
Linking Dispatch, Paramedic, Hospital, and Regional Planning Data in Portland, Oregon: Christopher Bangs, MS Department of Emergency Medicine, Oregon.
WSS/DC-AAPOR Seminar November 10, 2009 Uses of and Experiences with Address-Based Sampling Jill Montaquila Westat.
Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.
1 DDL – subquery Sen Zhang. 2 Objectives What is a subquery? Learn how to create nested SQL queries Read sample scripts and book for different kinds of.
Spatial Outlier Detection and implementation in Weka Implemented by: Shan Huang Jisu Oh CSCI8715 Class Project, April Presented by Jisu.
Lecture 16: Data input 1: Digitizing and Geocoding By Austin Troy University of Vermont Using GIS-- Introduction to GIS.
Getting the Map into the Computer Getting Started with Geographic Information Systems Chapter 4.
1 BENCHMARKING FINGERPRINT ALGORITHMS Dr. Jim Wayman, Director US National Biometric Test Center San Jose State University
Geocoding: - Table to geocode may be an ASCII, spreadsheet, dBase, or MapInfo table - Referred to as the “target” table - The target table is the attribute.
David Martin Department of Geography University of Southampton 2001 Census: the emergence of a new geographical framework.
DEMOGRAPHIC ANALYSIS RESTON CENTER STUDENT POPULATION.
Programming Logic and Design, Introductory, Fourth Edition1 Understanding Computer Components and Operations (continued) A program must be free of syntax.
1 Real Time, Online Detection of Abandoned Objects in Public Areas Proceedings of the 2006 IEEE International Conference on Robotics and Automation Authors.
@ 2007 Austin Troy. Geoprocessing Introduction to GIS Geoprocessing is the processing of geographic information. Perform spatial analysis and modeling.
Lecture 5 Geocoding. What is geocoding? the process of transforming a description of a location—such as a pair of coordinates, an address, or a name of.
15 th TRB Transportation Planning Applications Conference Tuesday, May 19 th, 2015 – Atlantic City, NJ Integrating Travel Demand Models & SHRP2 C11 Tools:
Attention Deficit Hyperactivity Disorder (ADHD) Student Classification Using Genetic Algorithm and Artificial Neural Network S. Yenaeng 1, S. Saelee 2.
Discovering Interesting Subsets Using Statistical Analysis Maitreya Natu and Girish K. Palshikar Tata Research Development and Design Centre (TRDDC) Pune,
Session: Parcel/Address Data Maintenance How Addresses Are Used in GIS Presenter: Wade Kloos, ESRI Date: Thursday, October 4, 2001.
Creating a Statewide Geocoding Service for West Virginia 2008 West Virginia GIS Conference.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis Amman, Jordan, May, 2011 Spatial Analysis & Dissemination of Census.
Iterative Batch Geocoding With Atlas GIS Version 4.0.
1 Creating File Access Services Presented by Ashraf Memon Hands-on Ashraf Memon, Ghulam Memon.
Introduction to ArcGIS for Environmental Scientists Module 3 – GIS Analysis Address Geocoding.
Spatial Analysis.
Automated Construction of Parameterized Motions Lucas Kovar Michael Gleicher University of Wisconsin-Madison.
AUTOMATIZATION OF COMPUTED TOMOGRAPHY PATHOLOGY DETECTION Semyon Medvedik Elena Kozakevich.
University of Southern California Background Photo by NASA, date unknown Texas A&M University Big GeoData Problems – High Volume Transactions and National.
Presenter: Mathias Jahnke Authors: M. Zhang, M. Mustafa, F. Schimandl*, and L. Meng Department of Cartography, TU München *Chair of Traffic Engineering.
Efficient Elastic Burst Detection in Data Streams Yunyue Zhu and Dennis Shasha Department of Computer Science Courant Institute of Mathematical Sciences.
Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names by Serena Coetzee.
Stefan Mutter, Mark Hall, Eibe Frank University of Freiburg, Germany University of Waikato, New Zealand The 17th Australian Joint Conference on Artificial.
For use only with Perreault and McCarthy texts. © The McGraw-Hill Companies, Inc., 1999 Irwin/McGraw-Hill Chapter 8: Improving Decisions with Marketing.
1 Statistical Significance Testing. 2 The purpose of Statistical Significance Testing The purpose of Statistical Significance Testing is to answer the.
Analysis of the 2006 IPA Proofing Roundup Data William B. Birkett Charles Spontelli CGATS TF1 November, 2006 Mesa, AZ William B. Birkett Charles Spontelli.
GIS is about geography and about thinking geographically Demers,
Data Creation and Editing Based in part on notes by Prof. Joseph Ferreira and Michael Flaxman Lulu Xue | Nov. 3, :A Workshop on Geographical.
Accessibility and Feasibility of Recreational and Fitness Facilities in Ames GIS-CRP 551 Final Project Yang Bai.
Image Comparison Tool Product Proposal Tim La Fond and Peter Beckfield.
Organization Name Presenter Name Presenter Title.
Address Geocoding With support from: NSF DUE Prepared by: in partnership with: John McGee Jennifer McKee Geospatial Technician Education Through.
1 On Optimal Worst-Case Matching Cheng Long (Hong Kong University of Science and Technology) Raymond Chi-Wing Wong (Hong Kong University of Science and.
When Tests Collide: Evaluating and Coping with the Impact of Test Dependence Wing Lam, Sai Zhang, Michael D. Ernst University of Washington.
Juanita Cano City of Sacramento Spring 2014 Geography 375.
Probabilistic km-anonymity (Efficient Anonymization of Large Set-valued Datasets) Gergely Acs (INRIA) Jagdish Achara (INRIA)
Introduction to Geographic Information Systems Fall 2013 (INF 385T-28620) Dr. David Arctur Research Fellow, Adjunct Faculty University of Texas at Austin.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Lab 6: Geocoding You have received a dBase file that contains the address list of over 500 homes in your neighborhood that have had reports of lead poisoning.
Vector data model TIN: Triangulated Irregular Network.
Address matching also commonly called ‘geocoding’ Very common for: –Crime reports –Customer records –Tax/Parcel records –911 systems Most common form of.
Kevin A Henry, Ph.D New Jersey Cancer Registry Cancer Epidemiology Services Frank Boscoe, Ph.D New York State Cancer Registry Estimating the accuracy of.
TRANSPORTATION PROGRAMS Statewide Transportation Survey Arizona Transportation Summit May 29, 2008.
Geocoding Chapter 16 GISV431 &GEN405 Dr W Britz. Georeferencing, Transformations and Geocoding Georeferencing is the aligning of geographic data to a.
GIS Project1 Physical Structure of GDB Geodatabase Feature datasets Object classes, subtypes Features classes, subtypes Relationship classes Geometric.
Geocoding Chapter 16 GISV431 &GEN405 Dr W Britz. Georeferencing, Transformations and Geocoding Georeferencing is the aligning of geographic data to a.
Haploid-Diploid Evolutionary Algorithms
Geocoding Addresses Ming-Chun Lee.
Introduction Most samples in Household Travel Surveys (HTS) complete via web Geocoding is an important element in HTS collection Online geocoding services.
MS Access: Creating Advanced Queries
Composite Geocoding in ArcGIS
Haploid-Diploid Evolutionary Algorithms
Spatial Data Processing
GTECH 709 Geocoding and address matching
GEOCODING Creates map features from addresses or place-names.
Report on Data Cleaning Framework
Efficient Record Linkage in Large Data Sets
Presentation transcript:

by Serena Coetzee and Magnus Rademeyer presented at the ICC 2009, Santiago, Chile, November 2009 Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names by Serena Coetzee and Magnus Rademeyer presented at the ICC 2009, Santiago, Chile, November

Overview Problem statement Address matching with a spatial adjacency match Test runs Results Conclusion Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009

Problem statement Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Alphanumeric matching 101 Rubida Street, Murrayfield incorrectly matched to 110 Rubida Street, Murrayfield 

Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Problem statement Alphanumeric matching only causes errors (previous slide) Potential solution: attribute relaxation (i.e. ignore suburb) Most common cause of errors (Goldberg et al. 2007)

Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match Intiendo = alphanumeric matching + spatial adjacency match Improves geocoding results Alphanumeric match: propose matched address from reference dataset Above threshold? Yes, proposed matched address is result No, search for street number in radius around proposed address

Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match NO YES

With spatial adjacency match Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009

Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match 1.Geocode without SpatialAdjacencyMatch (Non-spatial run) 2.Geocode with SpatialAdjacencyMatch enabled (Spatial run) Compare results

Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match Sample input address data  14,760 address records  Test for misleading names  Therefore include only addresses for which province, suburb, street name and street number are populated

With spatial adjacency match Intiendo hierarchy database Reference dataset: AfriGIS address data Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match

Intiendo settings Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Test runs

Results Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results

Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results Spatial runNon-spatial run Customer address records14,670 Matched address records8,905 (61%)8,514 (58%) Non-matched address records5,765 (39%)6,156 (42%) 3% is low but improvement can be significant, e.g. address on different sides of a highway

Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results Specific example

Results 35 Voortrekker Road 16 Voortrekker Road Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results

Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results Misleading suburb names in address Alphanumeric match only causes errors Intiendo = alphanumeric + spatial adjacency match More input addresses are matched more accurately Improves quality of results Sample test runs: 3% improvement

Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Conclusion Intiendo address matching = alphanumeric string matching + spatial adjacency match Improves quality of results More addresses matched more accurately This work Specific sample dataset showed improvement Future More tests to understand average percentage improvement

Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Acknowledgements Christopher Ueckermann from AfriGIS for running the geocoding tests with Intiendo