Presentation is loading. Please wait.

Presentation is loading. Please wait.

Design and Use of the IPUMS-International Data Serieshttp://international.ipums.org Matt Sobek Minnesota Population Center

Similar presentations


Presentation on theme: "Design and Use of the IPUMS-International Data Serieshttp://international.ipums.org Matt Sobek Minnesota Population Center"— Presentation transcript:

1 Design and Use of the IPUMS-International Data Serieshttp://international.ipums.org Matt Sobek Minnesota Population Center sobek@pop.umn.edu

2 Overview Processing Dissemination system Strengths and limitations Users Summation IPUMS-International

3 END Matt Sobek Minnesota Population Center sobek@pop.umn.edu

4 Census data samples What is IPUMS-International? Integrated Public Use Microdata Series – consistent codes and labels – anonymized – users download – individual-level – 1960 to present – pooled data

5 IPUMS 1992 – Steve Ruggles Bob McCaa IPUMS-International 1999 Latin America, Europe, Extension Background

6 Map of IPUMS Partners Dark green = disseminating data Light green = partners, not yet disseminating 83 countries

7 Current Countries in IPUMS 35 countries 111 samples 263 million persons Egypt Ghana Kenya Rwanda South Africa Uganda Cambodia China Iraq Israel Malaysia Palestine Philippines Vietnam Argentina Brazil Canada Chile Colombia Costa Rica Ecuador Mexico Panama United States Venezuela Austria Belarus France Greece Hungary Netherlands Portugal Romania Spain United Kingdom Most countries have multiple samples Africa Asia Americas Europe

8 IPUMS Microdata Relation to head Marital status Literacy Occupation

9 Selected Variable Topics Basic demographics Marriage Family structure Fertility and mortality Migration Ethnicity, language, religion Education Work Income Housing characteristics 475 Integrated variables 9052 Unharmonized variables

10 User Access Application Scholarly and educational purposes Key: it must not be redistributed Once approved, access to all data Free

11 Making the IPUMS Pre-processing Integration Dissemination

12 Making the IPUMS Pre-processing Integration Dissemination Language translation Reformatting Error correction Sampling Confidentiality

13 Making the IPUMS Pre-processing Integration Language translation Reformatting Error correction Sampling Confidentiality Metadata Data harmonization Constructed variables Dissemination

14 Census Questionnaire (Mexico 2000) Water Access

15 Editable Census Questionnaire

16 Water access XML-Tagged Census Questionnaire

17 Data Integration – Marital Status China1982Colombia1973Kenya1989Mexico1970U.S.A.1990

18 PernumRelateAgeSexMarstChborn 1head46malemarriedn/a 2spouse44femalemarried3 3aunt77femalewidow7 4child15femalesingle0 5child13femalesinglen/a 6child11malesinglen/a PernumRelateAgeSexMarstChborn 1head46malemarriedn/a 2spouse44femalemarried3 3aunt77femalewidow7 4child15femalesingle0 5child13femalesinglen/a 6child11malesinglen/a Spouse’s Mother’sFather’s Family Interrelationship Variables Location 2 1 0 0 0 0 0 0 00 0 0 21 1 1 2 2 (Simple household)

19 Attached Characteristics Spouse’s age Mother’s location Employment status Mother’s Employment status Spouse’s location Age

20 IPUMS Home Page

21 Variables Page Variable browsing

22 Variables Page

23

24 Sample Filtering

25 Variables Page

26 Unharmonized Variables

27 Variables Page

28 Variable Description (Marital status)

29 Comparability Discussion (Marital status)

30 Variable Description (Marital status)

31 Enumeration Text (Marital status)

32 Enumeration Text (Marital status, Cambodia)

33 Variable Description (Marital status)

34 Variable Codes (Marital status)

35 Variable Codes (Marital status)

36 Variable Description (Marital status)

37 Unharmonized Input Variables (Marital status)

38 IPUMS Home Page

39 Extract Step 1 – Login

40 Extract Step 2 – Select Samples

41 Extract Step 3 – Select Variables

42

43 Extract Step 4 – Variable Options

44 Extract Step 4 – Select Cases

45 Age of spouse Employment status of father Occupation of father Extract Step 4 – Attach Characteristics

46 Extract Step 5 – Customize Sample Sizes

47

48

49 Extract Step 6 – Submit

50 Download or Revise Extract

51 Key Strengths of the Census Samples Internationally comparable Pool data across countries – integrated variables Enable study of relatively small populations Large Temporal depth Provide historical perspective

52 Key Strengths of the Census Samples Microdata All of a person’s characteristics – multivariate analysis Hierarchical Characteristics of everyone a person resided with Cohabitation and family interrelationships

53 Limitations Due to Confidentiality Geography 20,000 population or larger Sensitive variables, very small categories Samples Too small to answer some questions

54 Other Issues and Limitations Varying census years Cross-sectional data Not longitudinal User burden Information overload; culturally specific knowledge Variable labels are insufficient Very large data

55 Academic field (%) 47Economics 21Demography 10Sociology 22Other IPUMS Users 54% Graduate students 2000 registered users

56 67% multiple samples 45% multiple countries Samples Extracted 17% 5 or more countries

57 Decade of Extracted Sample 1960s 11 1970s 14 1980s 16 1990s 30 2000s 29 Decade Percent

58 Most Frequently Extracted Countries 1. Mexico 2. Brazil 3. United States 4. Colombia 5. France 6. Chile 7. Ecuador 8. Vietnam 9. Kenya 10. Argentina

59 Summation Living project Democratized access World’s largest collection of census data 200 samples in another 5 years Ongoing nature of project limits us in some respects Allows us to correct errors and improve Most data are not otherwise accessible New opportunities for comparative research Entire system is designed to encourage comparisons We welcome your feedback

60 0 5 10 15 20 25 30 35 40 45 50 1960196519701975198019851990199520002005 Percent in Labor Force Mexico Costa Rica Ecuador Chile Venezuela Colombia Brazil Married Female Labor Force Participation in Latin America (age 18 to 65)

61 0 10 20 30 40 50 60 70 1920193019401950196019701980199020002010 Percent in Labor Force Latin America United States Married Female Labor Force Participation: Latin America and U.S. (age 18 to 65)

62 0 10 20 30 40 50 60 70 1920193019401950196019701980199020002010 Percent in Labor Force United States Mexico Costa Rica Ecuador Chile Venezuela Colombia Brazil Married Female Labor Force Participation: Latin America and U.S. (age 18 to 65) Compare Latin America to U.S. 40 years earlier

63 Married Female Labor Force Participation: Mexican-born Women, 1970-2000 0 10 20 30 40 50 60 70 1970197519801985199019952000 Percent in Labor Force Mexican-born Women in United States Women in Mexico

64 Percent of elders in intergenerational families

65 Percent in elder-head intergenerational families

66 Percent in younger-head families

67

68

69


Download ppt "Design and Use of the IPUMS-International Data Serieshttp://international.ipums.org Matt Sobek Minnesota Population Center"

Similar presentations


Ads by Google