Presentation is loading. Please wait.

Presentation is loading. Please wait.

Data Management – Processing

Similar presentations


Presentation on theme: "Data Management – Processing"— Presentation transcript:

1 Data Management – Processing
Claire Osgood November 2017

2 Children’s Environmental Health Initiative
Processing

3 Children’s Environmental Health Initiative
Typical Processing Clean up variables Clean up records Create new variables Raw Data Contents Sample print Freq/cross tabs Freq/cross tabs Contents Sample print Processed Data

4 Children’s Environmental Health Initiative
Typical Processing Numeric Code Race 1 10 11 12 2 21 22 01 02 Clean up variables “Hispanic” “hispanic” “hisp” “H” =“Hispanic” Label Rename Drop empty variables Standardize values within fields Convert dates and times to system format Numbers as text: Convert to numeric; OR Add leading zeros Deal with invalid, missing/unknown, 0/blank State Abbreviation __ -- AR LA NAOK TX Unk OK Sometimes numbers are read in as text because there are characters in records with missing info (“NA” or “NULL” or similar).

5 Children’s Environmental Health Initiative
Typical Processing Clean up records Check for unique identifier If none, what combination of variables uniquely identifies the record? Look for duplicates and deal with them Delete empty records Exclude other records as appropriate Sort or index

6 Children’s Environmental Health Initiative
Typical Processing Create new variables 1 2 17 18 19 20 21 22 30 3 Age Group Street number + Pre Dir + Street Name + Post Dir Address Indicators commonly used Calculated variables; combined/concatenated values; separating values Coded variables Standard variables

7 Children’s Environmental Health Initiative
Other Processing GIS Geocoding Create and output a file for geocoding Separately, geocode Second program to incorporate geocoded information GIS process to add spatial data Merge or combine with other datasets

8 Children’s Environmental Health Initiative
Tips You are going to have to account for adding/removing records, merges with other datasets, etc. Set up programs to have those counts readily available. Template programs for common processes Cumulative Inclusion Enrollment Form for grants Standard exclusions or common calculations Common/standard variables Race or age groups Indicators


Download ppt "Data Management – Processing"

Similar presentations


Ads by Google