Download presentation
Presentation is loading. Please wait.
Published byBarry Gibson Modified over 9 years ago
1
2005 Ohio GIS Conference September 21-23, 2005 Marriott North Hotel Columbus, Ohio Geoprocessing for Animal Premises ID Luanne Hendricks State of Ohio OIT/GISSC Intern Columbus State Community College
2
Overview Objective Source Data & Desired Outputs Timeline Tools and Automation Process Statistics Observations
3
Objective Geoprocessing Input: Source Data from County Auditors Output: - Normalized Parcel Data - Unique AG Owners
4
Output - Deliverables Normalized Parcel/Point Geodata – agricultural ( 100 <= LUC <= 199) – dairy (LUC = 103, 113) – residential ( 510 <= LUC <= 520, LUC = 560) Normalized Tabular Data (Access DB) –Table of unique ag owners with owner_id –Table of parcel data with owner_id Time Estimate to regenerate data annually
5
Example: Locate Residential Parcels of Ag Land Owners
6
Example: Select Parcels owned by Owner ID = 2894
7
Owner to Parcel Table Example
9
Source Data – Quantity/Quality Large volume of data – approx. 5 million source records – some counties had 40-50 fields of data – approx. 5 GB of data Multiple source files per county Parcel, Point, CAMA data Non-standardized data fields Variable completeness
10
Example: Non-Normalized Source vs. Normalized Output
11
Processing – High Level View Data Collection from Counties Normalize Source Data Generate Owner Ids for Parcel Records Generate Owner Table Match Dairy Addresses to Parcel Table Create Project for User
12
Timeline First Pass EffortSeveral PT HC - Approx. 1 FT HC Tasks Data Collection & Geocoding NormalizingOwner IDsDairy Match Create Project MonthJanuaryFebruaryMarchAprilMay Second Pass Effort1 PT HC1 FT HC Tasks Identify Original Source used Manual Normalizing Automation Normalizing Owner IDs Owner Ids Dairy match Project MonthMayJuneJulyAugustSept.
13
Need Automation Strategy Need to automate process for: –Repeatability –Ease of modification –Testability –Traceability...As well as speed
14
ToolsProcessing Tasks ArcToolBox - Model BuilderScript development - Python - VBscript Pre-Normalization - Joining source files, - adding key id, -copying to working directory Pre-Owner ID Generation - Address Standardization - Rejoin Data file to Shapefile MS Access - VBA - Queries - SQL - Form Interface - Normalization - Owner ID & Owner Table - (Dairy Match)
15
Processing Detail - Example Pre-normalization steps in Model-Builder for a county with 2 source files – shape and CAMA that need to be joined. This county is now ready for normalization in Access. Slightly different steps are needed for point files and counties with a single source parcel shapefile.
16
Processing Detail - Example Continued Model-Builder has limitations – you can’t loop through these steps for a list of counties. But this model can be converted to script and coded to process a list. Additional field-name mapping steps needed due to “coarse-grained” geoprocessing object. Loop thru cnty list. Delete Temporary table view & layer Get Fields Make Field Map
17
Example of Geoprocessing Tool Limitations When you join fields in the geoprocessing environment, and create a new Feature Layer shapefile, field names are [original layer name].[field name] truncated to 10 characters. Renaming is not done automatically for you as it is when you join and create a new layer manually in ArcMap.
18
Python Script Example
19
Access Form Interface Used for Normalization
20
Example: Non-Normalized Source vs. Normalized Output
21
Normalization Mapping Table
22
Processing – Owner IDs Data Collection from Counties Normalize Source Data Generate Owner Ids for Parcel Records Generate Owner Table Match Dairy Addresses to Parcel Table Create Project for User
23
Owner ID and Owner Table Generation
24
Standardized vs. Un-standardized
25
Owner ID Algorithm Aggregate on Lastname, Firstname Standardize addresses For each Lastname,Firstname group, choose the address - OWNADD1, MAILADD1, or SITEADD, that produces the best set of matches
26
Statistics ORIG_REC = Total AG + Total Residential NOAD = # Records with no address information ADD_REC = Total # of AG + Total Residential associated with more than 1 parcel FINL_REC = Total # of AG + Total Residential associated with at least one AG pcl OWNR = # of Records in the Owner Table NMD_AG = Aggregate of OWNNAM1/MAILADD1 and OWNADD1/MAILADD1 as a sanity check and to compare how effective the processing was
27
Testing Use Statistics –Numbers make sense –Numbers add up, e.g.: All records in Parcel table assigned an ownerid # Records in Owner Table = # Aggregated on Owner Id in PCL table Visual Inspection –Visually inspect how Owner Ids were assigned –Create shapefile and view data in project –Spot check source vs. processed data in shapefiles
28
Status 53 counties normalized 40 counties have owner ids/owner table Dairy matching - to do Final project – to do
29
Example Project – Work in Progress
30
Observations and Conclusions (1) After initial development, Automation speeds process For example, using Form Interface to normalize: Data Normalization TimeData Volume Manual 1 st pass 6 day 1X Ag only Auto 2 nd pass 1 day 5X Ag + Res
31
Observations and Conclusions (2) Automation: –speeds process after initial development investment –enables repeatability of process –makes modification and redo less painful –increases data consistency –reduces errors –accurately documents process –increases future capability to do similar processing – tools are reusable Automation is cost effective
32
Observations and Conclusions (3) This job would be easier if: –Data was maintained in small standard components: Last Name, First Name, MI as separate fields Address components – SiteNum, SiteDir, SiteStr There was a standard for field names of components
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.