Introduction Most samples in Household Travel Surveys (HTS) complete via web Geocoding is an important element in HTS collection Online geocoding services.

Slides:



Advertisements
Similar presentations
Better Accessible Transport to Encourage Robust Intermodal Enterprise Work Package 6 Dr John Harrison.
Advertisements

ADePT Automated DECs Poverty Tables Michael Lokshin, Zurab Sajaia and Sergiy Radyakin DECRG-PO The World Bank.
Graphic Data Clean-Up Issues. Graphic Data Clean-up Very Important if you plan to use existing CAD or GIS data from another agency, department, or private.
Mark Finch GIS and Roadway Data Office Manager Using GIS Spatial Services to Improve Collision Event Location Information NW GIS Conference Boise, Idaho.
Building an online tool for spatial joins using open source software Karsten Vennemann Seattle.
From portions of Chapter 8, 9, 10, &11. Real world is complex. GIS is used model reality. The GIS models then enable us to ask questions of the data by.
Agenda Overview Why TransCAD Challenges/tips Initiatives Applications.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
Geocoding: - Table to geocode may be an ASCII, spreadsheet, dBase, or MapInfo table - Referred to as the “target” table - The target table is the attribute.
1 Location-Based Services Using GSM Cell Information over Symbian OS Final Year Project LYU0301 Mok Ming Fai (mfmok1) Lee Kwok Chau (leekc1)
Map Analysis with Networks Francisco Olivera, Ph.D., P.E. Department of Civil Engineering Texas A&M University Some of the figures included in this presentation.
Business Intelligence Dr. Mahdi Esmaeili 1. Technical Infrastructure Evaluation Hardware Network Middleware Database Management Systems Tools and Standards.
MOBIGUIDE MOBIGUIDE CS 8803 – ADVANCED INTERNET APPLICATION DEVELOPMENT Project Presentation By: Ashwin Pallikarana Tirumala Lalanthika Vasudevan Sneha.
Lecture 5 Geocoding. What is geocoding? the process of transforming a description of a location—such as a pair of coordinates, an address, or a name of.
OMap By: Haitham Khateeb Yamama Dagash Under Suppervision of: Benny Daon.
NR 422: Topology Jim Graham Fall 2010 See: odatabase-topology.pdf.
GIS technologies and Web Mapping Services
MOBIGUIDE MOBIGUIDE CS 8803 – ADVANCED INTERNET APPLICATION DEVELOPMENT Project Presentation By: Ashwin Pallikarana Tirumala ( ) Lalanthika Vasudevan( )
4-1 INTERNET DATABASE CONNECTOR Colorado Technical University IT420 Tim Peterson.
Introduction to ArcGIS for Environmental Scientists Module 1 – Data Visualization Chapter 1 – GIS Basics.
Optimum route finder to the point of interest through public transport By Pratik Mehta Submitted to Dept. Comp. Science and Engineering IIT-Bombay.
Assignee Name Harmonization Efforts at the U.S. Patent and Trademark Office US Patent and Trademark Office Office of Electronic Information Products Patent.
Using a LDAP Directory Server for Environmental Data Discovery Donald Denbo NOAA-PMEL/UW-JISAO Presented by Eugene Burger NOAA-PMEL/UW-JISAO
Harry Williams, Cartography1 INTRODUCTION TO GIS A Geographic Information System is a combination of software and hardware that can store, manipulate,
Composition in Modeling Macromolecular Regulatory Networks Ranjit Randhawa September 9th 2007.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
Scavenger Hunt Scavenger Hunt Scavenger Hunt Scavenger Hunt is an excellent game to play on-line or live when you want learners to be more self-directed.
U.S. Environmental Protection Agency Central Data Exchange Pilot Project Promoting Geospatial Data Exchange Between EPA and State Partners. April 25, 2007.
Presented by: Marianne Cardwell, Woolpert, Inc. Cheryl Spencer, City of Indianapolis February 23, 2010 Voter Information Portal.
Address matching or Geocoding  Very common for:  E 911  Crime reports  Customer records  Tax/Parcel records  Marketing  Driving directions Most.
Integration of PASER & GIS WLIA March 4, 2004 Presentation Outline Road Centerlines –Overview –Creating –Attribute –Uses –Paser Geocoding Process –Overview.
Vector data model TIN: Triangulated Irregular Network.
Geocoding Chapter 16 GISV431 &GEN405 Dr W Britz. Georeferencing, Transformations and Geocoding Georeferencing is the aligning of geographic data to a.
TUGIS March 15, 2016 Next Generation 911 Data Management TUGIS 2016.
Key Terms Attribute join Target table Join table Spatial join.
Graphical Data Engineering
Business Analytics Masters Program
Physical Structure of GDB
INTRODUCTION TO GEOGRAPHICAL INFORMATION SYSTEM
Supervisor: Prof Michael Lyu Presented by: Lewis Ng, Philip Chan
Harry Williams, Cartography
Parcel Fabric and the Local Government Model
CUAHSI HIS Sharing hydrologic data
Development of User-Participation-type Communication tools for revitalization of local communities using MapServer Kei SAITO*, Michihiko SHINOZAKI* and.
Two-Variable Linear System
Multi Rater Feedback Surveys FAQs for Participants
Multi Rater Feedback Surveys FAQs for Participants
Reach People when it matters with Location Extensions
Spatial Data Processing
Midwestern District ITE 2017 Conference
GTECH 709 Geocoding and address matching
Emergency Event Support system
Voter Information Portal
Routing and Logistics Arc Routing 2018/11/19.
CIM Model Management in a Real-Time Environment Open Systems International, Inc. (OSI)
GEOCODING Creates map features from addresses or place-names.
Efficient Evaluation of k-NN Queries Using Spatial Mashups
CS & CS Capstone Project & Software Development Project
Assigning Spatial Coordinates to Your Address Data
Electronic Field Study Advanced User Training
Building an online tool for spatial joins using open source software
Georeferencing, Geocoding and Address Matching
Automating and Validating Edits
Fabio Crescenzi Territorial Databases and Gis
Esri Highway Data Maintenance Solutions: An Introduction
Building Map Books in ArcGIS
SECE Geoloc v2.0 Advised By: Prof. Henning Schulzrinne Jan Janak
The European Location Framework: taking INSPIRE to the next level
Technical Coordination Group, Zagreb, Croatia, 26 January 2018
Esri Roads and Highways An Introduction
Presentation transcript:

Integrating Online Geocoding Sources into Web-Based Household Travel Surveys

Introduction Most samples in Household Travel Surveys (HTS) complete via web Geocoding is an important element in HTS collection Online geocoding services are commonly used in web-based instruments Google Maps (geocoding) + Google Places (point of interest – POI) Bing Maps General assumption that these services are equivalent to traditional desktop GIS geocoding Including offsetting addresses from centerline… Intro / background / objective

Household Travel Survey User Needs Origin Destination

The Typical Travel Survey Geocoding Process Web survey instrument uses Google APIs for geocoding locations Searches done using geocoding and POI services Real-time response Familiar user interface (for many) High Precision Participant’s encouraged to provide nearest cross-streets or better Ability to drag location marker and click on map Quality-control Automated checks verify minimum geocode type (address, intersection, POI) Consistency in travel reviewed using speed checks Regular client deliverables and review Discuss typical front end and back end processes for acquiring places, geocoding locations, and quality-control

2015 MDOT Statewide (MTC III) + SEMCOG HTS Address-based sample Invitations delivered by USPS Focus on web self-completion (MITravelCounts.com) WebGeoSurvey + TripBuilder Web (online instrument) Telephone reporting and support available via toll-free number Statewide Survey - additional interest from SEMCOG region 3rd Iteration (previous efforts in 2004 and 2009) Very peculiar geocoding requirements in RFP SEMCOG addresses had to be matched to a point address file Minimum MDOT Statewide model network offsets (25’) Discuss size and significance of MI travel survey, the multiple interests/parties/GIS-groups (and the effects that had on our own review and processes)

Study Area MDOT SEMCOG Continue discussion from previous slide

SEMCOG Point Address Matching Loaded point addresses into PostgreSQL table (~1.7 million) Created indexed geometries using original coordinates Exposed geocoding service via a web-service to online tools Developed match query in PostgreSQL that matched online geocodes to point addresses in post-processing Used built-in address parsing and spatial data extensions Matched using location and address components (fuzzy street name match and number within 225’) Used functions in the tiger and postgis extensions of PostgreSQL

MDOT Model Network Distance Requirement Compared geocode locations against MDOT network to look for cases where the geocode fell closer than 25’ of a link Our expectation was that most cases would fall around this distance or, at least at some consistent offset to the correct side of the street segments which we could then extend to meet minimum distance requirements Not exactly what we found…

Pilot Study Results Geocode Distance to MDOT Statewide Model Network

Network distances – What we found (1/4) Google Maps does not really offset results from the centerline for address matches that don’t use parcel data The geocode_type we were saving did not include enough information to determine if the coordinates were offset from centerline. A new variable called location_type was added rooftop* range_interpolated approximate geometric_center The process used by online instruments tools to augment POI results with address components involved re-geocoding them, which replaced the original coordinates with the ones returned by the Google geocoder.

Network distances – What we found (2/4) Out of about 3,000 geocodes in the pilot, close to 400 fell within 25’ of a link (excludes home locations and ”intersection” geocodes) The data on the SEMCOG region was not as affected because we were already matching geocodes to point addresses What was Google doing? We re-ran the delivery addresses that were not home nor intersections through the Google geocoder to get its indicator of location_type (not captured in the pilot). This data element told us how the coordinate was obtained. Not available on POI results, only geocode results. Distances to resulting coordinates were then measured against the MDOT network. For comparison, we ran those same addresses through the Bing Maps geocoder service. Now let’s look at some pretty charts…

Network distances – What we found (3/4) Google distance to MDOT Network

Network distances – What we found (4/4) Bing distance to MDOT Network

Post-Pilot Process Improvements (1/2) Changed online tools so that original offset coordinates from POI results were preserved. Added geometry location type to saved geocode attributes.

Post-Pilot Process Improvements (2/2) Created new process that re-geocoded Google results using Bing Target location type of “range_interpolated” Also re-geocoded locations that are not “intersection” nor home that fall within 25’ of the MDOT road network Only replaced geocodes with Bing results if they fell close to the original (within 75’) Added checks that identified cases for review using ancillary data sources Expected only a small percentage (~5%) of Bing re-geocoded coordinates may need manual review and adjustment. And now, more pretty charts… Google Bing Analyst

Main Study Adjustments - Examples

Main Study Adjustments - Examples Discuss the pilot effort and results with graphics

Network-aware Geocode Auditing

Discuss the pilot effort and results with graphics

Discuss the pilot effort and results with graphics

Discuss the pilot effort and results with graphics

Final Remarks Using PostgreSQL made it easy to automate geocode checks Address parsing (tiger extension) Fuzzy string matching ( Spatial matching to point addresses Distance to network There is a need to check Google’s returned location types If it interpolated coordinates along its centerlines it likely did not offset them Combining the strengths of multiple geocoding sources in order to maximize The final break down of geocodes was: Google Rooftop: 17,462 Google POI: 77,878 Bing Re-geocode: 2,819 Analyst review: 2,835 Discuss closing observations, impressions, conclusions