by Serena Coetzee and Magnus Rademeyer presented at the ICC 2009, Santiago, Chile, November 2009 Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names by Serena Coetzee and Magnus Rademeyer presented at the ICC 2009, Santiago, Chile, November
Overview Problem statement Address matching with a spatial adjacency match Test runs Results Conclusion Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009
Problem statement Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Alphanumeric matching 101 Rubida Street, Murrayfield incorrectly matched to 110 Rubida Street, Murrayfield
Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Problem statement Alphanumeric matching only causes errors (previous slide) Potential solution: attribute relaxation (i.e. ignore suburb) Most common cause of errors (Goldberg et al. 2007)
Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match Intiendo = alphanumeric matching + spatial adjacency match Improves geocoding results Alphanumeric match: propose matched address from reference dataset Above threshold? Yes, proposed matched address is result No, search for street number in radius around proposed address
Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match NO YES
With spatial adjacency match Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009
Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match 1.Geocode without SpatialAdjacencyMatch (Non-spatial run) 2.Geocode with SpatialAdjacencyMatch enabled (Spatial run) Compare results
Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match Sample input address data 14,760 address records Test for misleading names Therefore include only addresses for which province, suburb, street name and street number are populated
With spatial adjacency match Intiendo hierarchy database Reference dataset: AfriGIS address data Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 With spatial adjacency match
Intiendo settings Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Test runs
Results Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results
Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results Spatial runNon-spatial run Customer address records14,670 Matched address records8,905 (61%)8,514 (58%) Non-matched address records5,765 (39%)6,156 (42%) 3% is low but improvement can be significant, e.g. address on different sides of a highway
Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results Specific example
Results 35 Voortrekker Road 16 Voortrekker Road Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results
Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Results Misleading suburb names in address Alphanumeric match only causes errors Intiendo = alphanumeric + spatial adjacency match More input addresses are matched more accurately Improves quality of results Sample test runs: 3% improvement
Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Conclusion Intiendo address matching = alphanumeric string matching + spatial adjacency match Improves quality of results More addresses matched more accurately This work Specific sample dataset showed improvement Future More tests to understand average percentage improvement
Testing the spatial adjacency match of the Intiendo address matching tool for geocoding of addresses with misleading suburb or place names, Serena Coetzee and Magnus Rademeyer, presented at the ICC 2009, Santiago, Chile, November 2009 Acknowledgements Christopher Ueckermann from AfriGIS for running the geocoding tests with Intiendo