ACS Public Use Microdata Samples DataFerrett SACOG

Slides:



Advertisements
Similar presentations
Using American FactFinder John DeWitt Project Manager Social Science Data Analysis Network Lisa Neidert Data Services Population Studies Center.
Advertisements

Loading Excel Double click the Excel icon on the desktop (if you have this) OR Click on Start All Programs Microsoft Office Microsoft Office Excel 2003.
Using Excel to Understand Your Data Clayton County Public Schools Department of Research, Evaluation and Assessment Assistant Principal In-Service.
Accessing and Using Block Group Data From the ACS Warren A. Brown Cornell Institute for Social and Economic Research.
11 ACS Public Use Microdata Samples of 2005 and 2006 – How to Use the Replicate Weights B. Dale Garrett and Michael Starsinic U.S. Census Bureau AAPOR.
Using American FactFinder John DeWitt Project Manager Social Science Data Analysis Network Lisa Neidert Data Services Population Studies Center.
1 U.S. Census Bureau Data Availability for Geographic Areas March 25, 2008.
How to Create Accessible PowerPoint Presentations Elizabeth Tu and Thayer Watkins April, 2010.
1 The American Community Survey (ACS) 2005 Data Release.
U.S. Census Bureau Demographic Census 2000 July 8, 2003.
Your Community by the Numbers Accessing the most current and relevant Census data Alexandra Barker Data Dissemination Specialist U.S Census Bureau New.
2014 SDC and CIC Annual Training Conference: Accessing ACS PUMS Data Tim Gilbert U.S. Census Bureau April 2, 2014.
The American FactFinder Florida Libraries Association Annual Conference, 2012, Orlando, Florida Jan Swanbeck, Documents Librarian, Joe Aufmuth, GIS Librarian.
1 Using the American Community Survey with American Factfinder CTPP Webinar Dec 2, 2008 Melissa Chiu, CTPP Coordinator Journey to Work and Migration Statistics.
11 The American Community Survey Steve Murdock, Ph.D. Director, Hobby Center for the Study of Texas Rice University.
U.S. Census Bureau census.gov Census Data Immersion From A Novice to A Skilled Data Miner Infopeople Webinar August 7,
The American Community Survey The American Community Survey Accessing Information for Hawaii from the 2006 American Community Survey (ACS) Jerry Wong Information.
Business Statistics If you are interested in business statistics, the Census Bureau’s web site is the place to start. In the Census Bureau’s web site you.
Your Table Is Waiting! Census 2010 Accessing and Using the Data Linda Clark Information Services Specialist U.S. Census Bureau Seattle Region April 19,
U.S. Decennial Census Finding and Accessing Data Summer Durrant October 20, 2014 Data & Geographical Information Librarian Research Data Services
American Factfinder Workshop Nola du Toit Spring 2007.
Public Use Microdata Samples Using PDQ Explore Software Grace York University of Michigan Library May 2004.
Kern Grant Summit - January 30, 2015
1 Journey-to-Work Data in the American Community Survey (ACS) May 17, 2009 TRB Transportation Planning Applications Conference Federal Data for Modelers.
1 Public Transportation Data in the American Community Survey (ACS) and Census Transportation Planning Products (CTPP) Dec 3, 2009 AASHTO Standing Committee.
4/22/2017 5:36 PM EViews Training Creating Workfiles.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
Introduction to the Public Use Microdata Sample (PUMS) File from the American Community Survey Updated February 2013.
Integrating ACS with the World’s Census Data: ACS Microdata and the IPUMS Presented at the Pre-ALAP ACS/IPUMS Workshop November 16, 2010 Trent Alexander.
TheDataWeb & DataFerrett Rebecca Blash Bill Hazard The DataWeb Applications Branch U.S. Census Bureau.
Using the ACS: Issues with studying small areas and change over time Presented to Association of Public Data Users January 20, 2011.
American Community Survey Overview September 4, 2013 Tim Gilbert American Community Survey Office.
American Community Survey Maryland State Data Center Affiliate Meeting September 16, 2010.
American Community Survey (ACS) 1 Oregon State Data Center Meeting Portland State University April 14,
New Look New Tools Easier Access Accessing and Using Census Data The New American FactFinder (AFF2) Northwest Government Information Network Tumwater,
Data on the Foreign Born in 2010: Accessing Information on Immigrants and Immigration from the U.S. Census Bureau’s American Community Survey Thomas A.
 Public Use Microdata Sample – sample file of unaggregated raw data with no identifying information about an individual person or household (no addresses,
American Community Survey “It Don’t Come Easy”, Ringo Starr Jane Traynham Maryland State Data Center March 15, 2011.
Census 2000: The Redistricting Summary Data (Public Law )
American Community Survey (ACS) Product Types: Tables and Maps Samples Revised
Accessing Census Data through the American FactFinder Arthur Bakis Information Services Specialist Boston Regional Census Center US Census Bureau
New Look New Tools Easier Access Accessing and Using Census Data The New American FactFinder (AFF2) Oregon AFF Training November, 2011.
DTC Quantitative Methods Summary of some SPSS commands Weeks 1 & 2, January 2012.
American Community Survey (ACS) Using Census Data by Block Group January 21, 2016 Presentation at the National Community Development Association Winter.
How to Work With SURN Principal Academy Data For data downloaded from onlineobservationtools.com.
Census 2010: Accessing Census Data THURSDAY, July 21, :30am.
American Community Survey (ACS) Overview & Access Eric Coyle Data Dissemination Specialist U.S. Census Bureau 1.
Measuring International Migration: An Example from the U. S
Census Data-Strictly Business?:
CENSUS & IPUMS DATA RETRIEVAL
Access Chapter 2 Querying a Database.
Top US Government Data Resources
Introduction to SPSS.
Press <spacebar> to continue tutorial
Journey-to-Work and Migration Statistics Branch U.S. Census Bureau
Reports: Pivot Table ©2015 SchoolCity, Inc. All rights reserved.
Microsoft Excel 2003 Illustrated Complete
Community Data Program
Exploring Microsoft® Access® 2016 Series Editor Mary Anne Poatsy
Using Census data to find emerging immigrant communities in your area
Introduction to IPUMS NYTS and IPUMS YRBSS
PolicyMap MD SLA : Leveraging Data to Lead November 5, 2015
Introduction To Computing BBA & MBA
Introduction to IPUMS NYTS and IPUMS YRBSS
American Factfinder and Census 2000
Survey Documentation and Analysis (SDA)
Spreadsheets and Data Management
Income Poverty Status Education The Labor Force Journey To Work
Transition to data.census.gov
Presentation transcript:

ACS Public Use Microdata Samples DataFerrett SACOG Luz M Castillo Data Dissemination Specialist Los Angeles Regional Office U.S. Census Bureau

Outline Summary Data vs. Microdata Fundamentals of PUMS Data Geography and the PUMS Accessing PUMS Data Documentation and Guidance

Summary Data Versus Microdata Premade or published tables Easy to get, even for small areas Limitations: fixed content Dataset of individual responses to questionnaire Enables custom tables and analyses Limitations: edits to protect privacy, can’t study small areas 3 3

Summary Data Source: 2010 ACS 1-year Estimates. Table B04001. FIRST ANCESTRY REPORTED 4

Microdata Source: 2010 ACS 1-year PUMS file

Microdata in SAS Source: 2010 ACS 1-year PUMS file.

Outline Summary data vs. Microdata Fundamentals of PUMS Data Geography and the PUMS Accessing PUMS Data Documentation and Guidance

What are PUMS data? Public Use anonymized, downloadable Microdata records of individual people Sample a representative sample of the population 8

PUMS Overview PUMS sample is a subsample of ACS interviews, one percent of all US households PUMS is a “weighted” sample Weighting variables must be used in analysis A set of two files - housing units and persons Available as SAS files, CSV files, via DataFerrett and redistributors such as IPUMS 9 9

Why Use PUMS? Data needed for a tabulation or a specific universe not supported by standard ACS tables (e.g., population groups by single year of age) Statistical analysis required to understand relationships between economic, demographic or housing variables (e.g., correlation analysis) Can create new measures using multiple variables or other people in household (spouse’s occupation, same-sex couples, number of kids) 10 10

ACS PUMS Availability Produced every year since 2000 Person-level files includes about 250 variables Housing unit files include about 200 variables Includes people in housing units and group quarters Includes many useful constructed variables (e.g., poverty status, subfamily identification, etc.) Includes collapsed codes for some variables (e.g., race, Hispanic origin, ancestry, place of birth, industry, occupation, etc.) 11

Person records in ACS PUMS (millions) Person records in ACS complete data (millions) Population represented 2001 1.2 285 2002 287 2003 290 2004 293 2005 2.9 4.5 296 2006 3.0 298 2007 301 2008 304 2009 307 2010 3.1 309 2011 5.0 312 12

Types of PUMS Files Released We release 3 new PUMS files every year 1 year PUMS (example: 2015 1-year PUMS) October 3-year PUMS (example: 2011-2013 3-year PUMS) Discontinued after 2013 5-year PUMS (example: 2011-2015 5-year PUMS) January Most documentation released one week prior to data 13

Modifications to Multiyear PUMS Multiyear PUMS have the same cases and geography as their component 1-year files How are multiyear PUMS different from single year? Weights are produced using latest population estimate “vintages” Coding schemes and dollar amounts are standardized Why use the multiyear PUMS files? For studying small groups, where more cases are needed When analysis is also making use of multiyear summary data 14

Outline Summary data vs. Microdata Fundamentals of PUMS Data Geography and the PUMS Accessing PUMS Data Documentation and Guidance

Limited Geographic Detail Geographic identifiers are region, division, state, PUMA PUMAs can be used to identify most cities of 100,000+ and many metropolitan areas, but not all Combinations of adjacent counties and census tracts within states Also, divisions of geo areas (counties/cities) PUMS is not designed for statistical analysis of small geographic areas

Public Use Microdata Area (PUMA) Defined after each census by the states in coordination with the Census Bureau’s Geography Division Redefined PUMAs for 2012 PUMS files Forthcoming multiyear files to have dual PUMA vintages Large enough to meet disclosure avoidance requirements An area of size 100,000 population or more To determine population, housing, or land ratio visit the Missouri State Data Center site PUMAs are identified by a five-digit number, unique within each state 17 17

Public Use Microdata Areas

PUMA Maps http://www.census.gov/geo/maps-data/maps/2010puma/st06_ca.html

PUMA Maps

2010 Census – PUMA Reference Map: Sacramento City (Central/Downtown & Midtown) 21 21 21

Outline Summary data vs. Microdata Fundamentals of PUMS Data Geography and the PUMS Accessing PUMS Data Documentation and Guidance

American FactFinder 23

American FactFinder (cont’d) 24

American FactFinder (cont’d) Main benefit of accessing PUMS via AFF: Convenient access if comfortable with AFF from regular use of summary tables

Census Bureau FTP Site

Census Bureau FTP Site (cont’d) Main benefit of accessing PUMS via FTP: Complete listing of files by year and state

DataFerrett 28

DataFerrett (cont’d) Main benefit of accessing PUMS via DF: Menu driven system doesn’t require knowledge of a stats package (i.e. SAS, SPSS, etc.) Ability to download variables individually 29

Powerful Tabulation Capabilities Simple table layout that supports: Flexible design Frequencies and trends Spreadsheet math for robust analysis Complex nesting Hide columns/rows Applies weighting variables Fast results using large datasets Save as HTML, PDF & JPEG

Highlight spreadsheet rows or columns to create: Data Visualization Highlight spreadsheet rows or columns to create: Maps Graphs

What We’re Working On Calculating variances on-the-fly for microdata tabulations Calculating margins of error for custom summations of aggregate data Integrating Google maps with DataFerrett thematic maps

Outline Summary data vs. Microdata Fundamentals of PUMS Data Geography and the PUMS Accessing PUMS Data Documentation and Guidance

PUMS Documentation Subjects in the PUMS Code Lists PUMS Top Coded and Bottom Coded Values PUMS Estimates for User Verification Accuracy of the PUMS http://www.census.gov/acs/www/data_documentation/pums_documentation/ 34

PUMS Guidance Compass Handbook on Using PUMS http://www.census.gov/acs/www/guidance_for_data_users/handbooks/ soup-to-nuts overview of getting and using the data Training PPT on Using PUMS http://www.census.gov/acs/www/guidance_for_data_users/training_presentations/ overview of PUMS basics

Exercise 1 In Placer County, how many foreign born individuals entered before 2000, between 2000 and 2009 and after 2010?

Exercise 1 – Nativity and Year of Entry Access: American Community Survey, 2015 1-Year Estimates PUMS Foreign Born and Year of Entry Variables Create a Recode for Year of Entry All PUMAS within Placer County Create a Table

Go to www.census.gov Type ‘DataFerrett’ in the Search Box

Click ‘TheDataWeb – DataFerrett’

Launch DataFerrett

CAUTION Do Not Navigate Away or Close This Window While DataFerret is Loading

Enter Your Email Address and Click ‘Ok’

Click ‘Get Data Now’

American Community Survey with PUMS and Other Datasets

Select American Community Survey Open Public Use Microdata Sample to view years Select 2015 Click View Variables (drop down)

Click ‘Selectable Geographies’ and ‘Population’ Click ‘Selectable Geographies’ and ‘Population’. Click ‘Search Variables’

Click on ‘Variable Label’ to Alphabetize Column

Select ‘Nativity’. Hold control button down and select ‘Year of Entry (YOEP)’. Click ‘Browse/Select Highlighted Variable’ (Blue Button).

Check the box next to ‘Select’ ACS Nativity’

Highlight ‘ACS YOEP’ Check the box next to ‘Select’ ACS YOEP Year of entry’ Click ‘OK’

You have added 2 variables for your DataBasket Click ‘OK’

Double Click to ‘Selectable Geographies’ Variable’ Click ‘Browse/Select Highlighted Variables’ (Blue Button)

Select ‘Public Use Microdata Area’ from ‘Types of Geographies Available’. Highlight the PUMA code in the Hierarchies section and click ‘Use Hierarchy’ Hierarchies

Double click ‘California’ from ‘Select State of current residence’ Double click ‘California’ from ‘Select State of current residence’. Highlight ‘California’ in middle box and click ‘Next Level’

Note: ALL PUMAs in California are Listed by County Double Click or Highlight and drag PUMA/s to box on far right. Click ‘Finish’

Note: There are 3 variables in DataBasket Click on ‘Step2: DataBasket/Download/Make A Table’

Highlight ‘Year of Entry’ variable Highlight ‘Year of Entry’ variable. Click ‘Recode Variable’ from right side of screen

Rename ‘Recode1’ to ‘Year of Entry Recode’

Highlight the categories from ‘1921 to 1999’ and click ‘Recode’ button below

Highlight all of the categories from ‘2000 to 2009’ and click ‘Recode’ button below

Note: there are three categories for the new recoded variable Note: there are three categories for the new recoded variable. Change the ‘Label’ Names by double clicking inside the cells. (Make sure to hit the Enter Key when completed).

Note: ‘Year of Entry Recode’ now listed Click ‘Make a Table’

Click ‘OK’

You Will Now Make A Nested Table Using the Variables

Drag the ‘Geog-101 PUMA’ to ‘C1,R2’

Drag ‘RECODE1 Year of Entry’ to ‘C2,R1’

Nest ‘Nativity’ variable by dropping it onto any of the ‘Year of Entry Labels’.

Click ‘GO Get Data’

From File, Click ‘Save As’

You Can Save to Your Desktop Save File as Text Document – Comma Delimited (Excel)

Exercise 2 In Sacramento County, what age group under 50 has a higher estimate of individuals with a disability?

Exercise 2 - Age and Disability Access: American Community Survey, 2015 1-Year Estimates PUMS Population with a Disability Create a Recode for Age Disaggregation All PUMAS within Sacramento County Create a Pivot Table

Go to the ‘Step1’ Tab and Click ‘Empty DataBasket’

Select American Community Survey Open Public Use Microdata Sample to view years Select 2015 Click View Variables (drop down)

Click ‘Selectable Geographies’ and ‘Population’ Click ‘Selectable Geographies’ and ‘Population’. Click ‘Search Variables’

Click ‘Variable Label’ to Alphabetize Column

Select ‘Age’. Hold control button down and select and ‘Disability Recode’. Click ‘Browse/Select Highlighted Variables’ (Blue Button)

Check the box next to ‘Select’ ACS AGEP’

Highlight ‘ACS Disability Recode’ Check the box next to ‘Select’ ACS DIS Disability Recode’ Click ‘OK’

You have added 2 variables for your DataBasket Click ‘OK’

Note: 2 Variables selected in Data Basket Double Click to ‘Select Geographies’ Variable Click ‘Browse/Select Highlighted Variables’ (Blue Button)

Select ‘Public Use Microdata Area’ from ‘Types of Geographies Available’. Highlight the PUMA code in the Hierarchies section and click ‘Use Hierarchy’ Hierarchies

Double click ‘California’ from ‘Select State of current residence’ Double click ‘California’ from ‘Select State of current residence’. Highlight ‘California’ in middle box and click ‘Next Level’

Note: ALL PUMAs in California are Listed by County Double Click or Highlight and drag PUMA/s to box on far right. Click ‘Finish’

Note: There are 3 variables in DataBasket Click on ‘Step2: DataBasket/Download/Make A Table’

Highlight ‘Age’ variable Highlight ‘Age’ variable. Click ‘Recode Variable’ from right side of screen

Rename ‘Recode1’ to ‘Age Recode’

Change Range to ‘1 through 17’ and Click ‘Recode’ 2 Change Range to ‘18 through 19’ and Click ‘Recode’ 3 Change Range to ‘20 through 24’ and Click ‘Recode’

Change the rest of the age groups and recode

Note: there are nine categories for the new recoded variable Note: there are nine categories for the new recoded variable. Change the ‘Label’ Names by double clicking inside the cell. (Make sure to hit the Enter Key when completed).

Note: ‘Age Recode’ now listed Click ‘Make a Table’

Click ‘OK’

You Will Now Make A Nested Table Using the Variables

Making a Pivot Table 1. Drag and Drop “Recode 1 Age” to C1, R2 2 . Drag and Drop “GEOG-101 to C2, R1 3. Drag and Drop “Disability” above R1

Click ‘GO Get Data’

From ‘File’ drop-down, Click ‘Save As’

Save Your Table

Exercise 3 For each race group, which age group has the highest estimated number of males and females?

Exercise 3 – Sex by Race and Age Accessing: American Community Survey, 2015 1-Year Estimates PUMS Add more variables Create a Race, Sex and Age Disaggregation All PUMAS within Sacramento County Create a Table, Chart and Map

Close Table and Click ‘Step1’ Tab

Select American Community Survey Open Public Use Microdata Sample to view years Select 2015 Click View Variables (drop down)

Click ‘Population’. Click ‘Search Variables’

Click on ‘Variable Label’ to Alphabetize Column

Select ‘RAC1P-Recoded Detailed Race Code’ Select ‘RAC1P-Recoded Detailed Race Code’. Hold control button down and select and ‘Sex’. Click ‘Browse/Select Highlighted Variables’ (Blue Button)

1. Click ‘Select ALL Variables’ 2. Click ‘OK’ 3. Confirm that you have modified 2 Variables by Clicking ‘OK’

Click ‘Step2’ Tab and Click ‘Make a Table’

1. Drag and Drop “RAC1P” to C1, R2 2 . Drag and Drop “GEOG-101 to C2, R1 3. Drag and Drop “Recode1 Age” to C1, R2 (On top of ‘Total RAC1P’)

Click ‘GO Get Data’

To Create a Bar Chart or Map, Change the Variable Label and Highlight the estimates in that row, Click the Chart or Map Icons

Resources: Need Assistance? Data Dissemination Branch Customer Liaison and Marketing Services Office U.S. Census Bureau (844) ASK-DATA Toll Free Census.askdata@census.gov Luz.M.Castillo@census.gov Cell: 818-515-3748 112 112