Visualizing Life Course Sequences: A Comparison of Family Formation between East and West German Women Tim F. Liao, University of Illinois 10 December.

Slides:



Advertisements
Similar presentations
Multilevel Multiprocess Models for Partnership and Childbearing Event Histories Fiona Steele, Constantinos Kallis, Harvey Goldstein and Heather Joshi Institute.
Advertisements

Multilevel Event History Analysis of the Formation and Outcomes of Cohabiting and Marital Partnerships Fiona Steele Centre for Multilevel Modelling University.
Multiple Analysis of Variance – MANOVA
Cross Sectional Designs
Trends in living arrangements of older adults in Belgium Anne Herm, Luc Dal and Michel Poulain.
Psychology: A Modular Approach to Mind and Behavior, Tenth Edition, Dennis Coon Appendix Appendix: Behavioral Statistics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Giving Meaning to Social Networks: Methodology for Conducting and Analyzing Interviews based on Personal Network Visualizations José Luis Molina (UAB)
Alexa Curcio. Original Problem : Would a restriction on height, such as prohibiting males from marrying taller females, affect the height of the entire.
Faculty of Management, Economics and Social Sciences Research Institute for Sociology Michael Wagner On the links between unemployment, partnership stability.
QUANTITATIVE DATA ANALYSIS
Clustered or Multilevel Data
PPA 415 – Research Methods in Public Administration Lecture 4 – Measures of Dispersion.
Fertility (within and across cohorts)  What is available over time?  Longitudinal perspectives  Complimentary analytical inputs and outputs  How to.
11 Populations and Samples.
Social Research Methods
Leeds, 22 March 2007 A SECONDARY ANALYSIS OF DATA MID CAREER FELLOWSHIP Gopalakrishnan Netuveli Imperial College London 1 Jan 2007 – 31 March 2008.
PS 225 Lecture 15 Analysis of Variance ANOVA Tables.
Hypothesis Testing II The Two-Sample Case.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Think of a topic to study Review the previous literature and research Develop research questions and hypotheses Specify how to measure the variables in.
Descriptive Statistics
Data Presentation.
1 DATA DESCRIPTION. 2 Units l Unit: entity we are studying, subject if human being l Each unit/subject has certain parameters, e.g., a student (subject)
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.
Confidence Interval Estimation
Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Estimation in Sampling!? Chapter 7 – Statistical Problem Solving in Geography.
A statistical method for testing whether two or more dependent variable means are equal (i.e., the probability that any differences in means across several.
The Division of Household Labor Introduction to Family Studies May 26,
Giving Meaning to Social Networks: Methodology for Conducting and Analyzing Interviews based on Personal Network Visualizations José Luis Molina (UAB),
StatisticsStatistics Graphic distributions. What is Statistics? Statistics is a collection of methods for planning experiments, obtaining data, and then.
Agricultural and Biological Statistics. Sampling and Sampling Distributions Chapter 5.
Transition Patterns & Risks of School Leavers in Europe How institutions & policies shape the integration of young people into the labour market.
1 Chapter 3 Looking at Data: Distributions Introduction 3.1 Displaying Distributions with Graphs Chapter Three Looking At Data: Distributions.
Examining Relationships in Quantitative Research
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Chapter 5: Measures of Variability  The Importance of Measuring Variability  IQV (Index of Qualitative Variation)  The Range  IQR (Inter-Quartile Range)
1 Using the Cohort Studies: Understanding the postponement of parenthood to later ages Ann Berrington ESRC Centre for Population Change University of Southampton,
The Division of Household Labor Introduction to Family Studies November 22,
1 CSE 2337 Chapter 3 Data Visualization With Excel.
© Copyright McGraw-Hill CHAPTER 2 Frequency Distributions and Graphs.
University of Warwick, Department of Sociology, 2012/13 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Survival Analysis/Event History Analysis:
Chapter Eight: Using Statistics to Answer Questions.
RESEARCH & DATA ANALYSIS
PCB 3043L - General Ecology Data Analysis.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
1January 26, 2016January 26, 2016January 26, 2016 The Division of Household Labor Family Sociology.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Measurements and Their Analysis. Introduction Note that in this chapter, we are talking about multiple measurements of the same quantity Numerical analysis.
The American Family 50 years of change. Change… The American family has undergone tremendous change in the last 50 years. Some argue that family life.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
1 Statistical Analysis - Graphical Techniques Dr. Jerrell T. Stracener, SAE Fellow Leadership in Engineering EMIS 7370/5370 STAT 5340 : PROBABILITY AND.
Aron, Aron, & Coups, Statistics for the Behavioral and Social Sciences: A Brief Course (3e), © 2005 Prentice Hall Chapter 10 Introduction to the Analysis.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
Slide Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple.
SECTION 1 TEST OF A SINGLE PROPORTION
Methods of multivariate analysis Ing. Jozef Palkovič, PhD.
Chapter 15 Analysis of Variance. The article “Could Mean Platelet Volume be a Predictive Marker for Acute Myocardial Infarction?” (Medical Science Monitor,
Which socio-demographic living arrangement helps to reach 100? Michel POULAIN & Anne HERM Orlando 8 January 2014.
PCB 3043L - General Ecology Data Analysis.
Bi-variate #1 Cross-Tabulation
Cross Sectional Designs
Discrete Event Simulation - 4
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Chapter Nine: Using Statistics to Answer Questions
Chapter 10 Introduction to the Analysis of Variance
The Demographic Transition Model (DTM)
Presentation transcript:

Visualizing Life Course Sequences: A Comparison of Family Formation between East and West German Women Tim F. Liao, University of Illinois 10 December 2010 Institute of Sociology, Academia Sinica

Some Preliminary Considerations Elder (1985): "Life patterns are structured by variations in the timing, duration, and order of events." Rindfuss, Swicegood, and Rosenfeld (1987) in “Disorder in the Life Course: How Often and Does it Matter?” analyzed NLS of class 72 data, 5 school-employment events of 8 years, if equal probability 5 8 =390,625 sequences. They encouraged researchers to take a more careful look at the life course as it is actually lived In this presentation, we will take a formal look at people’s life courses as they were actually lived, through the help of visualization of life course sequences

Recent Developments in Sequence Analysis Abbott’s work using optimal matching in the 1990s analysis the sociological article Initially critics highly questioned its value for social science research (Wu 2000, Levine 2000) In the past decade methodological developments (Lesnard 2006, 2008; Elzinga 2003, 2010; Hollister 2009; Stovel & Bolan 2004) introduced a ‘second wave’ of sequence analysis (Aisenbrey and Fasang 2010) that addresses much of this criticism Sequence analysis is becoming increasingly wide-spread and accepted especially in life course and employment research

Visualization of sequences: sequence index plots Introduced by Scherer 2001 Stata.ado by Kohler, Brzinsky-Fay and Luniak 2006 Each sequence represented by one line across a process time axis, different states indicated by different colors

Alternative Visual Methods Tree representation of clusters of transitions to adulthood (Aassve et al 2007) MDS Sequence index plot of family formation trajectories (Piccarreta & Lior 2010)

Tufte’s (2001) Graphic Excellence (1) Above all else show the data, i.e. tell the truth about the data (2) maximize the data-ink ratio, (3) erase non-data-ink, (4) erase redundant data-ink, and (5) revise and edit

Advantages of sequence index plots Holistic representation of processes Taking individual sequence seriously as a unit of analysis Convey large amounts of information in single graph: prevalence and timing of states, intermediary states, overall variability within and between sequences etc… Number

Problems of sequence index plots Sequence index plots can be deceptive due to the follow reasons: 1.Over-plotting: deceptive in terms of relative importance of individual sequences, problematic for N larger than a couple of hundreds 2.Unfair Comparison: difficult to compare groups of unequal sizes 3.Reliance on Order: lose sight of central tendencies in the timing of more than one transition within sequences

Example data: German Life History Study (GLHS) The larger study collected data since early 1980s on the life histories of some 8,500 men and women from 20 selected birth cohorts in West Germany and of more than 2,900 men and women from 13 selected birth cohorts in East Germany For this analysis: family formation sequences of 474 East (N=132) and West (N=342) German women born 1971 Retrospective life history data, collected in 1997/1998 & follow up in 2005; we only use the data available in 2005, thus we can follow family formation until age 34 Age range: seven states of family formation: single, in a relationship, cohabiting, cohabiting with a child, married, married with a child, and divorced/widowed

Problem I: over-plotting of 474 cases

Problem I: over-plotting of 948 cases

Problem I: over-plotting of 1932 cases

Problem II: comparative difficulty

Relative frequency sequence index plots To deal with over-plotting of large N: we draw random subsample and present as relative sequence index plots For comparing groups of unequal sizes: we compare differences in relative frequency of specific sequences (and states they contain) across plots of unequal sizes

Relative Sequence index plots of family formation sequences in West and East Germany  width of each line represents relative frequency of sequence in subgroup Relatively more cohabitation with or without child in East Germany than in West Germany

Sequence index plots of clusters of family formation (based on Lesnard’s dynamic OM distance)

Relative frequency sequence index plots of family formation clusters (sum to 100%)

Relative sequence index plots in cohort comparison

Problem III: Order of Sequences One may lose sight of central tendencies in multiple transitions within sequences Ordering of sequences can change one’s overall perception of a graph

Ordered by timing of cohabitation or marriage? Deceptive/not informative on timing/prevalence of marriage when ordered by cohabitation and vice versa

Multiple transition curve plots based on Kaplan Mayer survival curves Calculate separate transition curves for each transition of interest: 1.in a relationship 2. cohabitation 3. marriage 4. cohabitation with child 5. marriage with child

Multiple transition curve plots next to relative frequency index plots

The two new plots for East & West Germany

Multiple transition curves for family formation clusters

The two new plots for the marriage/late child cluster

The two new plots for the marriage/early child cluster

The two new plots for the late marriage cluster

Comments on multiple transition curve plots Most applicable for non-recurrent states, unidirectional processes – a lot of social science sequences, especially in life course applications are fairly unidirectional Especially informative in combination with relative sequence index plots: allowing us to visualize the variability of specific transitions and central tendencies in their timing jointly

Are they different enough?

There should be no significant difference between 2 random 100 sequences

Mean time spent in each state among East and West German Women

State distribution plots comparing East and West German Women

Conclusion & next steps We have developed relative frequency index plots and multiple transition curve plots for examining life courses as they were actually lived We still have the remaining tasks of testing differences between sequence patterns and shapes  Randomization test Goal: develop three types of difference measures to test for difference in different aspects of sequences: (1)Overall difference that can be decomposed into (2)Temporal structure: (a) within and (b) between sequence variation, (3)Qualitative difference in the occurrence of states

Another procedure for constructing relative frequency index plot 1. Divide the sample into P relative frequency groups; 2. Choose a representative sequence pattern by modal sequence to represent a relative frequency group; 3. For a relative frequency group that has no modal pattern, use medoid sequence (smallest sum of distances to all other sequences in each group) as a representative of each group; 4. When a sample cannot be neatly allocated into the total number of relative frequency groups P, a case can be proportionally allocated to adjacent groups with allocation probabilities determined by (N-P)/P; 5. Steps 2 & 3 are carried out on the reallocated sample; 6. The choice of P depends on effective visual presentation.

Applying the Medoid Selection Procedure

State distribution plots by gender

State distribution plots by father’s unemployment

State distribution plots by grammar school background