Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Updated Unified Category System for 1960-2000 Census Occupations Peter B. Meyer, OPT Brown Bag seminar, Oct 25, 2006 Outline 1. Tentative standard categories.

Similar presentations


Presentation on theme: "1 Updated Unified Category System for 1960-2000 Census Occupations Peter B. Meyer, OPT Brown Bag seminar, Oct 25, 2006 Outline 1. Tentative standard categories."— Presentation transcript:

1 1 Updated Unified Category System for 1960-2000 Census Occupations Peter B. Meyer, OPT Brown Bag seminar, Oct 25, 2006 Outline 1. Tentative standard categories 2. Users and bug fixes 3. How Census assigns occupation codes 4. Imputation practice

2 2 Census Occupational Classifications  1960 system from 1968-1970  1970 system from 1971-1982  1980 system from 1983-1991  1990 system from 1992-2002  2000 system from 2003- present Census Bureau staff assign 3-digit occupations codes to respondents of decennial Census The list of codes changes every Census Current Population Survey (CPS) uses these codes: Vast data is available in these categories But time series don’t cover the whole period

3 3 Tradeoffs in Classification Systems Duration vs. accuracy, precision  blacksmith, database admin (short precise series)  electrical engineer (long evolving series) Number of occupations vs. sample size of each  Narrow distinctions may be of interest Dental technicians High tech occupations vs. other technical occupations Licensed jobs Conformity with other data

4 4 Desirable Attributes of a Classification For each occupation, well-behaved time- series of:  mean wage  wage variance  fraction of the population SPARSENESS New criterion: SPARSENESS  One prefers a classification not be sparse, meaning it does not have many empty occ-year cells

5 5 Classification Current Phase Earlier working paper (Meyer and Osborne, 2005) defines a unified classification for Census & CPS 3-digit occupation codes from 1960 to present It was adapted from the 500+ categories in 1990 Census: 379 categories have same name or almost same as 1990 125 were eliminated to help harmonize with other years  Example to follow 19 categories have expanded (changed name or n.e.c. category) 3 categories added for 1960 data which doesn’t fit in

6 6 Hard case; combined category here 1970 code 1970 occupation Title 1980 code 1980 component categories and codes Civilian Labor Force % of 1970 category 284 Sales workers, except clerks, retail trade 263 Sales workers, motor vehicles and boats 185,160 37.06% 266 Sales workers, furniture and home furnishings 98,941 19.80% 267 Sales workers; radio, television, hi fi, and appliances 76,674 15.35% 268 Sales workers, hardware and building supplies 81,668 16.35% 269Sales workers, parts 39,120 7.83% 274 Sales workers, other commodities 16,008 3.20% 277 Street and door to door sales workers 2,082 0.42%

7 7 Input from users and new data Corrections from users

8 8 The information coders have

9 9

10 10 Statisticians and Actuaries Counts of Actuaries and Statisticians in Census Sample 1960197019801990 Actuaries.45129182 Statisticians 199237352338 Separate categories in and after 1970 In 1960 they were all in “statisticians and actuaries” When standardizing we put all these in “statisticians” Now we try to infer which people in this population were actuaries

11 11 Statisticians and Actuaries Pooled all 1970-1990 statisticians and actuaries Ran many logistic regressions predicting the actuaries Good predictors of whether respondent is an actuary  Recorded in a later year  Employed in insurance, accounting/auditing, or professional services  Employed in private sector  High salary income  High business income, or to earn mostly business income  Is employed  Lives in Connecticut, Minnesota, Nebraska, or Wisconsin

12 12 Statisticians and Actuaries For 1970-1990 a logistic regression can predict occupation right 88% of the time Impute a prediction on 1960 data Revised counts of actuaries and statisticians after imputation 1960197019801990 Actuaries 29 45129182 Statisticians 170 237352338

13 13  More accurate statistician category, by later definition  Longer time series for actuaries sparseness  Reduces sparseness  Builds a technique Statisticians and Actuaries Why work this arcane problem?

14 14 Lawyers and Judges Combine all 1970-1990 lawyers and judges Exclude all private sector employees because they are all lawyers (By definition? ) In the remainder, predictors of judge, not lawyer: (judge is 1, lawyer is 0 in the next slide)  Older  Employed in state government  High salary income; low or no business income  Educated less than 16 years  Employed at time of survey Can get 83% accurate predictions from such a regression

15 15 Logit Regression on 1970-1990 Census Sample CoefficientStd errorp-value Year -0.0050.0110.633 Age 0.1550.0330.000 Age-squared -0.0010.0000.040 Federal government employee -1.4400.1370.000 State government 0.4990.2630.058 Ln(salary) -1.7953.0940.562 Ln(salary) squared 0.0520.3330.877 Ln(salary) cubed 0.0030.0120.798 Ln(business income) -0.0410.0360.261 Fraction of earned income that is business income -0.7141.0530.498 Education less than 16 years 2.2350.3200.000 Years of formal education -0.0440.0460.336 Is employed at time of survey 0.2240.2410.352 Constant 13.01723.4280.578

16 16 Can Use Those Coefficients in Stata gen logitindex = -.0046652 * year +.1549193 * age -.0006942 * age * age -1.4405086* indfed +.4986729 * indstate -1.795481 * lnwage +.0517015 * lnwage * lnwage +.0030016 * lnwage * lnwage * lnwage -.040749 * lnbus -.7140285 * busfrac +2.234934 * (educyrs<16) -.0442429 * educyrs +.2239105 * employed +13.0172 /* constant */ ; … gen logitval=exp(logitindex)/(1.0+exp(logitindex)) replace logitval=.0001 if !govtemployee /* this is a perfect predictor */ replace logitval=.0001 if !indfed & !indstate & !indlocal /* this too */ gen assigned = logitval>.46 /* Now ‘assigned’ has a 1 for imputed judges */

17 17 Newly Imputed Judges 1960197019801990 Lawyers1971257050827603 Judges82123298331 Respondents in Census samples after imputation

18 18 Preliminary Findings There are opportunities to impute occupations occasionally with reasonable accuracy The resulting records have “better-classified” occupations  slightly more accurate (in four categories)  Slightly less sparse (293 empty cells not 295) Effects in a substantive regression not focused on these categories is tiny (What does it mean?)

19 19 Census Bureau's National Processing Center in Jeffersonville, IN

20 20 Who's Doing the Coding “Coders”“Referralists” There are “Coders” and “Referralists” Coders Coders follow carefully documented procedures, most likely from the Census National Headquarters in Suitland, MD In many cases there is not enough information to assign industry and occupation codes “Referralist" Such unresolved cases are forwarded electronically ("referred") to a “Referralist" Coders Coders with two years of experience are expected to produce 94 code assignments an hour, with 95% accuracy (codes are checked)

21 21 Who's Doing the Coding There were about 12 coders and 14 referralists in October 2006 Referralistscoders 9+ Referralists have been coders before and usually have 9+ years of experience referralists coders I interviewed three referralists, and a supervisor, but no coders during my October 2006 visit The ones I met handled referrals from several surveys:  CPS, ATUS, SIPP, NLS, ACS, and others on contract All of the above surveys use decennial Census occupation codes Industry and occupational codes for Decennial Censuses are assigned by other employees, not the ones who permanently work in Jeffersonville now (???)

22 22 Information Available to a Coder "kind of work" "principal duties" employer name city and state ("PSU") of respondent's home (not workplace) industry, already coded industry type (manufacturing, service, other) years of education, age, sex not income, although it was available before Jan '94 software. The industry is normally coded before the occupation. Referralist can match Employer name to a known employer from the Employer Name List (ENL), possibly the same as SSEL Some cases are "autocoded" before coder sees

23 23 Problems and Problematic Cases “Computer work" for occupation (???) Too little information from respondent Exaggeration (example: dot com businesses) Ambiguities:  "water company" for industry or employer  "surveyor" occupation  "boot" vs "boat" in handwriting  hurrying Referralists confer with each other routinely, but sometimes make different choices from one another YES. Does technological change go along with occupational ambiguity? YES. Problems with computer work, biotech. Still no nanotech in classification.

24 24 What Would Improve Their Coding Accuracy or Speed? Information about a job title Information about employer's city and state [show CPS 1993 questions] (???) But! Asking more questions would extend the interview Retrieved from "http://econterms.net/pbmeyer/research/occs/w iki/index.php?title=Brown_bag_Oct_25"http://econterms.net/pbmeyer/research/occs/w iki/index.php?title=Brown_bag_Oct_25

25 25 Questions for Occupational Time Series Hypotheses for time series of consistently-defined occupations:  Have high tech jobs had rising earnings inequality? [yes]  Superstars effect? [yes]  Is nurturing work valued less (England et al)?  Have mathematical occupations grown in size or pay?  Measuring payoffs to skills  Have better job-search technologies reduced inequality within job categories? (as predicted by Stigler (1960) Researchers sometimes use only industry, not occupation, or limit time span of study to keep consistent occupation

26 26 "What's Next?" Make next working paper and program code available Publish at IPUMS Accumulate more classification systems, techniques, criteria, and expert opinions New wiki of all classifications

27 27 Thank You.

28 28 Worker’s tasks Worker’s function (identified e.g. by inputs and outputs)  example: blacksmiths vs forging machine operators  example: teachers of different subjects and ages of students Sometimes other distinctions  Hierarchically (apprentices, foremen, supervisors)  Certification  Skills  Industry (activity of the employing organization) To some extent these are separate labor markets, with separated job search, wage setting, unemployment experiences. Meaning of Occupation Tasks Inputs Outputs

29 29 Occupation Attributes I Strength (1-5 from DOT) Reasoning (1-6 from DOT) Mathematical reasoning (1-6 from DOT) Language use (1-6 from DOT) Duration of specific training (from DOT) Nurturing (0/1) (England et al, 1994) many others, potentially

30 30 Occupation Attributes II % urban (e.g. doctor in rural area) often involves traveling (or required mobility earlier) rate of growth % of immigrants authority (0/1) (England et al, 1994) high tech regulated unionized use of machines involves advocacy; or repair; or negotiation


Download ppt "1 Updated Unified Category System for 1960-2000 Census Occupations Peter B. Meyer, OPT Brown Bag seminar, Oct 25, 2006 Outline 1. Tentative standard categories."

Similar presentations


Ads by Google