1 Updated Unified Category System for 1960-2000 Census Occupations Peter B. Meyer US Bureau of Labor Statistics (but none of this represents official measurement.

Slides:



Advertisements
Similar presentations
Test process essentials Riitta Viitamäki,
Advertisements

The Microdata Analysis System (MAS): A Tool for Data Dissemination Disclaimer: The views expressed are those of the authors and not necessarily those of.
“... providing timely, accurate, and useful statistics in service to U.S. agriculture.” Using Mixed Methods to Evaluate Survey Questionnaires Heather Ridolfo,
The Uninsured: Policy and Data Issues Michael J. O’Grady, Ph.D. Assistant Secretary for Planning and Evaluation US Department of Health and Human Services.
An Overview of China ’ s Industrial Statistical Methodology Xiaohui WANG Department of Industrial and Transport Statistics National Bureau of Statistics,
Workforce Development Region 4 Tonya Lee Statistician Supervisor Alabama Department of Labor Labor Market Information Division January 24, 2014 New Labor.
Trends in Industry and Occupational Data Workforce Development Council Nov 2012 Mathew Barewicz Vermont Department of Labor Economic & Labor Market Information.
Measures of Income, Poverty and Health Insurance Wesley Basel, U.S. Census Bureau Presented at the Walter Cronkite School of Journalism June 17, :00.
Bridging the Gaps: Dealing with Major Survey Changes in Data Set Harmonization Joint Statistical Meetings Minneapolis, MN August 9, 2005 Presented by:
Online Industry Market Research Presented by Janet Harrah, Director Center for Economic Development & Business Research, Wichita State University.
Labor Statistics in the United States Grace York March 2004.
Health Insurance Coverage of California’s Working Latinos Howard Greenwald Suzanne O'Keefe Mark DiCamillo University of Southern California California.
1 Surveying Market Pay Just as skill, effort, responsibility, and working conditions have been used to differentiate rates of pay among major job groupings,
Bureau of Labor Statistics Green Jobs Initiative Dixie Sommers Bureau of Labor Statistics Association of Public Data Users 2010 Annual Conference September.
UNM BBER Data Users Conference November 13 th, 2014 NMDWS Occupation Projections and Labor Market Information.
Employment Projections Overview
1 Updated Unified Category System for Census Occupations Peter B. Meyer, OPT Brown Bag seminar, Oct 25, 2006 Outline 1. Tentative standard categories.
USE OF LITHUANIAN CLASSIFICATION OF OCCUPATIONS ISCO 88, ISCO 2008 and the Development of the ESeC Regional Meeting, Oslo, 7 June 2005 Violeta Skamarociene.
Trends in the Manufacturing Sector in Massachusetts Sharing Skills ~ Building Connection Conference, March 12, 2008 Rebekah Lashman Jonathan Latner Navjeet.
RESEARCHING CAREERS By David Agnew Arkansas State University.
Employment Outlook:  National employment projections  State projections
政府統計處 Census and Statistics Department Introduction to Statistical Work.
1 Session 143: Evaluating occupation classifications  Slight changes to the program  Chair: Peter B Meyer, US Bureau of Labor Statistics  Speakers:
The Main Idea Throughout the years, the U.S. economic system has changed. Each change affected what was produced and how people were employed. To gauge.
12th Meeting of the Group of Experts on Business Registers
Brooke Helppie McFall, Amanda Sonnega and Robert J. Willis University of Michigan Retirement Research Consortium Conference August 6-7,
Employment, Income and Population Change in Curry County May 6, 2009 Mallory Rahe Extension Community Economist Oregon State University.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
UNM BBER Data Users Conference November 14 th, 2013 NMDWS Industry & Occupation Projections.
DATA MINING FINAL REPORT Vipin Saini M 許博淞 M 陳昀志 M
Northwest Connecticut CEDS Region Demographic & Economic Analyses May 2012.
Technological uncertainty and superstardom: two sources of inequality within occupations Peter B. Meyer Office of Productivity and Technology, U.S. Bureau.
1 Recent occupation concepts applied to historical U.S. Census data Peter B. Meyer US Bureau of Labor Statistics (but none of this represents official.
Georgia: business register data and gender-disaggregated indicators Tengiz Tsekvava Technical Meeting on Measuring Entrepreneurship from Gender Perspective.
1 Recent occupation concepts applied to historical U.S. Census data Peter B. Meyer US Bureau of Labor Statistics (but none of this represents official.
ECONOMIC & LABOR MARKET CHARACTERISTICS & PROJECTIONS Pam Bodwell EVT 7066 Foundations of Career & Workforce Education University of South Florida October,
The Quantity and Quality of Leisure: How Occupational Status Shapes the Free Time Experience of Dual Working Parents Melissa Buckmiller- University of.
Using External Secondary Data Chapter 7. Standardized Marketing Information Services Commercial sources of secondary data The data are usually collected,
The Main Idea Throughout the years, the U.S. economic system has changed. Each change affected what was produced and how people were employed. To gauge.
Technological uncertainty and superstardom: two sources of inequality within occupations Peter B. Meyer Office of Productivity and Technology, U.S. Bureau.
How to find screen, hire & retain superstar employees How to Find and Hire Superstars in Santa Barbara’s Tight Employment Market 111 East Arrellaga Street,
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Data Analysis.
Editing of linked micro files for statistics and research.
Kansas Labor Market Analysis Presenter: Dorothy D. Stites Director, Labor Market Information Services (LMIS) Kansas Department of Labor July 8, 2005 Labor.
Statistical Expertise for Sound Decision Making Quality Assurance for Census Data Processing Jean-Michel Durr 28/1/20111Fourth meeting of the TCG - Lubjana.
The Main Idea Throughout the years, the U.S. economic system has changed. Each change affected what was produced and how people were employed. To gauge.
Web Designer Nick Fain 5/8/14 1 st Hour Career Tech.
INFO 4470/ILRLE 4470 Visualization Tools and Data Quality John M. Abowd and Lars Vilhuber March 16, 2011.
Australia’s ICT Recruitment Sector Employment Market Update Q4, 2015 Beacham Group Summary.
13-Jul-07 State of the art of the ISCO-08 implementation.
1 Economically Active Population Survey Dong-Wook JEONG Employment Statistics Div. Statistics Korea.
12 Automatic enrolment myths & truths This guide to automatic myths and truths contains some of the misconceptions we have come across when speaking to.
Recent occupation concepts applied to historical U.S. Census data
RSA Insight Report: Supporting Slides
Accounting courses in Chandigarh. Computer & Accounting courses is designed to teach you the accounting and computer skills employers are looking for.
Imputation in UNECE Statistical Databases: Principles and Practices
RSA Insight Report: Supporting Slides
RSA Insight Report: Supporting Slides
RSA Insight Report: Supporting Slides
Transition Interview Statistics
Bureau of Transportation Statistics
RSA Insight Report: Supporting Slides
RSA Insight Report: Supporting Slides
RSA Insight Report: Supporting Slides
RSA Insight Report: Supporting Slides
RSA Insight Report: Supporting Slides
RSA Insight Report: Supporting Slides
Updated Unified Category System for Census Occupations
RSA Insight Report: Supporting Slides
Presentation transcript:

1 Updated Unified Category System for Census Occupations Peter B. Meyer US Bureau of Labor Statistics (but none of this represents official measurement or policy) SSHA 2006, Minneapolis; Nov 4, 2006 Outline 1. Tentative standard categories 2. Users and bug fixes 3. How Census assigns occupation codes 4. Imputation practice

2 Census Occupational Classifications U.S. Bureau of Census determines a list of 3 digit occupation codes each ten years Then puts one for employed respondents to the decennial Census and some other surveys Vast data is available in these categories:  CPS, ATUS, SIPP, NLS, ACS, decennial Census But not always consistently over long time spans Research efforts may require some standard

3 Tradeoffs in Classification Systems Precise job distinctions vs. Consistency, duration, and sample size  High tech occupations vs. other technical occupations blacksmith, database admin (shorter, more precise series) electrical engineer (longer evolving series )  “Superstars” jobs like athletes and musicians (need precision)  Licensed jobs (need long comparable occupations) Conformity with other data Avoid “sparseness” – many missing year-occ cells Meaning of occupation: function, tasks, skills, background, social class There is no perfect classification but there are tools & criteria for better ones.

4 Baselines to improve on IPUMS defined occ1950 for US workers recorded in ANY Census Working paper (Meyer and Osborne, 2005) defined classification of digit occupations codes from 1960 to present It was adapted from the 500+ categories in 1990 Census:  379 categories have same name or almost same as 1990  125 were eliminated to help harmonize with other years (Example to follow)  19 categories have expanded (changed name or a not- elsewhere-classified category was given more scope)  3 categories added for 1960 data which doesn’t fit in

5 Some distinctions are lost in standardization 1970 code 1970 occupation title code 1980/90 component category titles Civilian Labor Force % of 1970 category 284 Sales workers, except clerks, retail trade 263 Sales workers, motor vehicles and boats 185, % 266 Sales workers, furniture and home furnishings 98, % 267 Sales workers; radio, television, hi fi, and appliances 76, % 268 Sales workers, hardware and building supplies 81, % 269 Sales workers, parts 39, % 274 Sales workers, other commodities 16, % 277 Street and door to door sales workers 2, % Census reports and IPUMS data show how many respondents would be coded in each of two classification systems.

6 User input and new data since 2005 Sent these programs to 19 people who expressed interest  Open-source code idea (helps find errors; also is public property) Corrections from users did come in  Philip Cohen, UNC Sociology, identified some problems/mistakes.  Sarah Porter, research assistant at U of Iowa working with Jennifer Glass, wrote a program to do some similar mappings. Comparing to that program I found mistakes in mine.  Dual-coded 1990/2000 data sets highlighted some surprises Experimented with imputations (example to follow) Visited the Census office where they assign these codes.

7 Census Bureau's National Processing Center in Jeffersonville, IN  Louisville, KY, is just south of it I interviewed four specialists who assign occupation & industry codes.

8 Information used when coding “what kind of work" “most important activities or duties" employer name “what kind of industry” city and state ("PSU") of respondent's home industry type (manufacturing, service, other) years of education, age, sex not income, although it was available before Jan '94 software. Tens of thousands of job titles are mapped to a code in a reference book they have, if industry also matches. Some cases may be "autocoded" by software and coder checks After coding, public use samples have 3-digit occupation code and 3-digit industry code Quality of assignments from public use samples are limited

9 Imputation: Statisticians and Actuaries Counts of Actuaries and Statisticians in Census Sample Actuaries Statisticians These were separate categories in and after 1970 But in 1960 they were all in “statisticians and actuaries” When standardizing (2005) they were put in “statisticians” Will try to infer which of the 1960 people were actuaries.

10 Statisticians and Actuaries Pooled all statisticians and actuaries Good predictors of whether respondent is an actuary:  Recorded in a later year  Employed in insurance, accounting/auditing, or professional services industries  Employed in private sector  High salary income  High business income, or to earn mostly business income  Is employed  Lives in Connecticut, Minnesota, Nebraska, or Wisconsin Ran many logistic regressions predicting the actuaries

11 Statisticians and Actuaries For 1970 data that logistic regression predicts occupation right 88% of the time Revised counts of actuaries and statisticians after imputation Actuaries Statisticians

12  More accurate standardized “statistician” category  Longer actuary time series  Reduces sparseness – empty cells  Builds a technique for this data mining  Benefits scale up through IPUMS Statisticians and Actuaries Why work this arcane problem?

13 Imputing judges In 1960 Census, lawyers and judges were one category Later, they’re separate, and separate in “standard” system Without more info, we categorize all in 1960 as “lawyers”. We wish to impute which ones are judges Useful fact: private sector ones were all called lawyers Predictors for the public sector ones, of who’s a judge:  Older  Employed in state government  High salary income  Low business income  Educated less than 16 years  Employed at time of survey

14 Logit regression predicting judges in Census CoefficientStd errorp-value Year Age Age-squared Federal government employee State government employee Ln(salary) Ln(salary) squared Ln(salary) cubed Ln(business income) Fraction of earned income that is business income Education less than 16 years Years of formal education Is employed at time of survey Constant Dependent variable: maximum likelihood probability this individual is a judge.

15 Thus we assign judge occupation code gen logitindex = * year * age * age * age * indfed * indstate * lnwage * lnwage * lnwage * lnwage * lnwage * lnwage * lnbus * busfrac * (educyrs<16) * educyrs * employed /* constant */ ; … gen logitval=exp(logitindex)/(1.0+exp(logitindex)) replace logitval=.0001 if !govtemployee /* this is a perfect predictor */ replace logitval=.0001 if !indfed & !indstate & !indlocal /* this too */ gen assigned = logitval>.46 /* Now ‘assigned’ has a 1 for imputed judges */ Threshhold probability is chosen to match the number of judges expected to be there, based on annual trend. Can get 83% accurate predictions from such a rule on 1970 data. This mis-assigns a few who should have stayed lawyers.

16 Newly Imputed Judges Lawyers Judges Respondents in Census samples after imputation

17 What's next? Use dual-coded CPS datasets with 1990 and 2000 codes to make a few more imputations Keep listening, seek more help, make it better. Publish variable at IPUMS.org Keep going? 1970 & 1980 dual coded data sets exist.

18 Industry and occupation coding Industry codes and occupations codes are assigned by the same group of people, at the same time for each respondent. Industry is almost always decided first. The people who do that are “coders” Procedures are carefully documented I wasn’t a “sworn” Census agent and couldn’t see it done, live

19 Desirable Attributes of a Classification For each occupation, well-behaved time- series of:  mean wage  wage variance  fraction of the population SPARSENESS New criterion: SPARSENESS  One prefers a classification not be sparse, meaning it does not have many empty occ-year cells

20 What new information would help referralists? Information about a job title Information about employer's city and state  not respondent’s But asking more questions would extend the CPS interview

21 Problems faced by referralists Too little information from respondent “Computer work" for “kind of work” Exaggeration (example: dot com businesses) Ambiguities:  "water company" for industry or employer  "surveyor" occupation  "boot" vs "boat" in handwriting Having to hurry Referralists confer with each other routinely, but sometimes make different choices from one another YES. Does technological change go along with occupational ambiguity? YES. Problems with computer work, biotech. Still no nanotech in classification.

22 The information coders have

23 Who's Doing the Coding There were about 12 coders and 14 referralists in October 2006 Referralistscoders 9+ Referralists have been coders before and usually have 9+ years of experience referralists I interviewed three referralists, and a supervisor The ones I met handled referrals from several surveys:  CPS, ATUS, SIPP, NLS, ACS  others on contract  All these use decennial Census occupation codes They DON’T handle the decennial Censuses

24 Information available to referralist Can match Employer name to a known employer from their Employer Name List (ENL), same as SSEL or Business Registry. Can look on the web for that employer Can study “little red book” - SOC manual or (less often) giant Dict Occ Titles 1991 or, I’m told, look up employer in Dun and Broadstreet data They try to make a coherent choice for industry and occupation together.

25 “Coders”“Referralists” “Coders” and “Referralists” Coders Coders follow carefully documented procedures from the Census headquarters in Suitland, MD Coders Coders with two years of experience are expected to assign 94 codes an hour, with 95% accuracy (which is checked) “Referralist" (aka statistical assistant) If there is not enough information to assign industry and occupation codes by procedure, the case is forwarded electronically ("referred") to a “Referralist" (aka statistical assistant)