Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Session 143: Evaluating occupation classifications  Slight changes to the program  Chair: Peter B Meyer, US Bureau of Labor Statistics  Speakers:

Similar presentations


Presentation on theme: "1 Session 143: Evaluating occupation classifications  Slight changes to the program  Chair: Peter B Meyer, US Bureau of Labor Statistics  Speakers:"— Presentation transcript:

1 1 Session 143: Evaluating occupation classifications  Slight changes to the program  Chair: Peter B Meyer, US Bureau of Labor Statistics  Speakers: Meyer Levenson discussant: Kevin Cahill, Analysis Group Glass discussant: Meyer

2 2 Updated unified category system for 1960-2000 Census occupations Peter B. Meyer Office of Productivity and Technology, U.S. Bureau of Labor Statistics Western Economic Association meetings San Diego July 2, 2006 Nothing here is an official measurement, finding, view, or policy of the US Dept of Labor

3 3 Census occupational classifications  Census Bureau staff assign occupations (3-digit codes) to decennial Census respondents  But the list of 3 digit occ codes changes every Census  Current Population Survey (CPS) uses these categories 1960 system from 1968-1970 1970 system from 1971-1982 1980 system from 1983-1991 1990 system from 1992-2002 2000 system from 2003-present  Vast data is available in these categories  Researchers do assign, with error, records from one classification system into another.

4 4 Tradeoffs in classification systems  Duration of category vs. accuracy and precision of category E.g. blacksmith, electrical engineer  Sample sizes of each occupation vs. number of comparable occupations standard deviation of incomes within occ requires more cases But narrow distinctions may be the ones of interest  English professors vs. other humanities professors  High tech occupations vs. other technical/mathematical occupations  Conformity with many other sources of data Past Censuses, CPS, NLSY, DOT, O*NET, ISCO, HISCO So there is no one best classification Often (I think) poor fitting ones are used. Researchers may need tools to make them from existing data

5 5 Long term project goals and strategy  Create classification for decennial Census occupation categories since 1960, with an interest in high tech occupations  Assigning occupation to 1990 classifications other years by similar titles or function supporting tables from Census and IPUMS  IPUMS variable occ1950 Changing the 1990 classification if it seemed useful  Test it and compare it to alternatives  Use it, share it  Accumulate resources, techniques, code, contacts. Listen. Census Bureau and BLS experts IPUMS National Crosswalk Center Dictionary of Occupation Titles ETA / O*NET experts

6 6 Example: difficult category 1970 code 1970 occupation Title 1980 code 1980 component categories and codes Civilian Labor Force % of 1970 category 284 Sales workers, except clerks, retail trade 263 Sales workers, motor vehicles and boats 185,16037.06% 266 Sales workers, furniture and home furnishings 98,94119.80% 267 Sales workers; radio, television, hi fi, and appliances 76,67415.35% 268 Sales workers, hardware and building supplies 81,66816.35% 269Sales workers, parts39,1207.83% 274Sales workers, other commodities16,0083.20% 277Street and door to door sales workers2,0820.42% We invented a category, “salespersons not elsewhere classified” to hold all of the 284s from 1970

7 7 Desirable attributes of a classification (1) Mean wage within groups should not jump far between periods (2) Wage variance should not jump over time (3) Fraction of the population in this occupation should not jump around between time periods  Usually these signal a categorization problem not a real-world phenomenon.  We measured these. New criterion: One prefers a classification not be sparse, meaning it does not have many empty occ-year cells

8 8 Statisticians and actuaries Counts of actuaries and statisticians in Census sample 1960197019801990 Actuaries.45129182 Statisticians 199237352338 These are separate categories in and after 1970 In 1960 they were all in “statisticians and actuaries” When standardizing we put all these in “statisticians” Now we try to infer which people in this population were actuaries.

9 9 Statisticians and actuaries Combined all 1970-1990 statisticians and actuaries Predictors of actuary not statistician: Recorded in a later year Employed in insurance, accounting/auditing, professional services In private sector High salary income High business income, or to earn mostly business income Employed at time of Census Lives in Connecticut, Minnesota, Nebraska, or Wisconsin

10 10 Actuaries and statisticians  For 1970-1990 a logistic regression can predict occupation right 88% of the time  Impute a prediction on 1960 data Revised counts of actuaries and statisticians after imputation 1960197019801990 Actuaries 2945129182 Statisticians 170237352338  Why work this arcane problem? More accurate statistician category, by later definition Longer time series for actuaries Reduces sparseness Builds a technique

11 11 Lawyers and judges Combine all 1970-1990 lawyers and judges Exclude all private sector employees because they are all lawyers In the remainder, predictors of judge not lawyer: Older Works for state government High salary income Low or no business income Educated less than 16 years Is employed at time of survey Can get 83% accurate predictions from such a regression

12 12 Logit regression on 1970-1990 Census sample Dependent variable is 1 for judges, 0 for lawyers CoefficientStd errorp-value Year -0.005 0.0110.633 Age 0.155 0.0330.000 Age-squared -0.001 0.0000.040 Federal government employee -1.440 0.1370.000 State government 0.499 0.2630.058 Ln(salary) -1.795 3.0940.562 Ln(salary) squared 0.052 0.3330.877 Ln(salary) cubed 0.003 0.0120.798 Ln(business income) -0.041 0.0360.261 Fraction of earned income that is business income -0.714 1.0530.498 Education less than 16 years 2.235 0.3200.000 Years of formal education -0.044 0.0460.336 Is employed at time of survey 0.224 0.2410.352 Constant 13.017 23.4280.578

13 13 Can use those coefficients in Stata  gen logitindex = -.0046652 * year  +.1549193 * age  -.0006942 * age * age  -1.4405086* indfed  +.4986729 * indstate  -1.795481 * lnwage  +.0517015 * lnwage * lnwage  +.0030016 * lnwage * lnwage * lnwage  -.040749 * lnbus  -.7140285 * busfrac  +2.234934 * (educyrs<16)  -.0442429 * educyrs  +.2239105 * employed  +13.0172 /* constant */  ;  #delimit cr /* back to carriage return as statement delimiter not ; */  gen logitval=exp(logitindex)/(1.0+exp(logitindex))  replace logitval=.0001 if !govtemployee /* this is a perfect predictor */  replace logitval=.0001 if !indfed & !indstate & !indlocal /* this too */  gen assigned = logitval>.46  /* Now ‘assigned’ has a 1 for imputed judges */

14 14 Newly imputed judges 1960197019801990 Lawyers1971257050827603 Judges82123298331 Respondents in Census samples after imputation

15 15 Research questions for occupational time series Hypotheses for time series of consistently-defined occupations: Have high tech jobs had rising earnings inequality? [yes] Superstars effect? [yes] Is nurturing work valued less (England et al)? Have mathematical occupations grown in size or pay? Measuring payoffs to skills Have better job-search technologies reduced inequality within job categories? (as predicted by Stigler (1960)  Researchers sometimes use only industry, not occupation, or limit time span of study to keep consistent occupation

16 16 Preliminary findings  There are opportunities to impute occupations occasionally with reasonable accuracy  The resulting records have “better-classified” occupations slightly more accurate (in four categories) Slightly less sparse (293 empty cells not 295)  Effects in a substantive regression not focused on these categories is tiny

17 17 What’s next?  Combine smallest occupations  Split farmers into fewer categories  In imputations, incorporate more information from Dual-coded CPS for 2000-2002 Dual-coded “Treiman” sample from 1970-1980  Visit Census categorizers in Jeffersonville, Indiana.  Make next working paper and program code available  Publish at IPUMS  Accumulate more classification systems, techniques, criteria, and experts. New wiki of all classifications.

18 18  Worker’s tasks  Worker’s function (identified e.g. by inputs and outputs) example: blacksmiths vs forging machine operators example: teachers of different subjects and ages of students  Sometimes other distinctions Hierarchically (apprentices, foremen, supervisors) Certification Skills Industry (activity of the employing organization)  To some extent these are separate labor markets, with separated job search, wage setting, unemployment experiences. Meaning of occupation Tasks InputsOutputs

19 19 Occupation attributes I  Strength (1-5 from DOT)  Reasoning (1-6 from DOT)  Mathematical reasoning (1-6 from DOT)  Language use (1-6 from DOT)  Duration of specific training (from DOT)  Nurturing (0/1) (England et al, 1994)  many others, potentially

20 20 Occupation attributes II  % urban (e.g. doctor in rural area)  often involves traveling (or required mobility earlier)  rate of growth  % of immigrants  authority (0/1) (England et al, 1994)  high tech  regulated  unionized  use of machines  involves advocacy; or repair; or negotiation


Download ppt "1 Session 143: Evaluating occupation classifications  Slight changes to the program  Chair: Peter B Meyer, US Bureau of Labor Statistics  Speakers:"

Similar presentations


Ads by Google