Individual level synthetic regression models Session 4.

Slides:



Advertisements
Similar presentations
Estimating Identification Risks for Microdata Jerome P. Reiter Institute of Statistics and Decision Sciences Duke University, Durham NC, USA.
Advertisements

Curve fitting Session 2. Method background Disability rates are strongly linked to age However HSE disability rates for single years of age are unstable.
Session 1: Small area estimation introduction Alan Marshall ESDS Government.
Adding geographical detail to social surveys: Estimating local disability prevalence Alan Marshall ESDS Government 15 th April 2010.
1 Session 10 Sampling Weights: an appreciation. 2 To provide you with an overview of the role of sampling weights in estimating population parameters.
Statistical Analysis SC504/HS927 Spring Term 2008
Household Projections for England Yolanda Ruiz DCLG 16 th July 2012.
Apr-15H.S.1 Stata: Linear Regression Stata 3, linear regression Hein Stigum Presentation, data and programs at: courses.
Lecture 11 (Chapter 9).
© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Logit & Probit Regression
Limited Dependent Variables
SPATIAL DATA ANALYSIS Tony E. Smith University of Pennsylvania Point Pattern Analysis Spatial Regression Analysis Continuous Pattern Analysis.
1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.

Multinomial Logistic Regression
Sociology 601 Class 28: December 8, 2009 Homework 10 Review –polynomials –interaction effects Logistic regressions –log odds as outcome –compared to linear.
458 Lumped population dynamics models Fish 458; Lecture 2.
Log-linear and logistic models Generalised linear model ANOVA revisited Log-linear model: Poisson distribution logistic model: Binomial distribution Deviances.
Log-linear and logistic models
EPI 809/Spring Multiple Logistic Regression.
Lecture 14-2 Multinomial logit (Maddala Ch 12.2)
Notes on Logistic Regression STAT 4330/8330. Introduction Previously, you learned about odds ratios (OR’s). We now transition and begin discussion of.
Log-linear analysis Summary. Focus on data analysis Focus on underlying process Focus on model specification Focus on likelihood approach Focus on ‘complete-data.
1 B. The log-rate model Statistical analysis of occurrence-exposure rates.
C. Logit model, logistic regression, and log-linear model A comparison.
Logistic regression for binary response variables.
Lecture 8: Generalized Linear Models for Longitudinal Data.
Logit model, logistic regression, and log-linear model A comparison.
ALISON BOWLING THE GENERAL LINEAR MODEL. ALTERNATIVE EXPRESSION OF THE MODEL.
Learning Theory Reza Shadmehr logistic regression, iterative re-weighted least squares.
Business Intelligence and Decision Modeling Week 11 Predictive Modeling (2) Logistic Regression.
Logistic Regression Database Marketing Instructor: N. Kumar.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
1 היחידה לייעוץ סטטיסטי אוניברסיטת חיפה פרופ’ בנימין רייזר פרופ’ דוד פרג’י גב’ אפרת ישכיל.
Using Weighted Data Donald Miller Population Research Institute 812 Oswald Tower, December 2008.
Lecture Slide #1 Logistic Regression Analysis Estimation and Interpretation Hypothesis Tests Interpretation Reversing Logits: Probabilities –Averages.
Week 5: Logistic regression analysis Overview Questions from last week What is logistic regression analysis? The mathematical model Interpreting the β.
Business Intelligence and Decision Modeling
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
Logistic Regression. Linear Regression Purchases vs. Income.
Basics of Biostatistics for Health Research Session 1 – February 7 th, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health.
Multiple Logistic Regression STAT E-150 Statistical Methods.
1 Everyday is a new beginning in life. Every moment is a time for self vigilance.
K = K = K = 100.
1. Divide the study sample data into two groups: Fatigued (F), N=3,528, and Non-Fatigued (NF), N=3,634. Estimate logistic regressions to obtain the probability.
© Department of Statistics 2012 STATS 330 Lecture 22: Slide 1 Stats 330: Lecture 22.
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
Dates Presentations Wed / Fri Ex. 4, logistic regression, Monday Dec 7 th Final Tues. Dec 8 th, 3:30.
Binary logistic regression. Characteristic Regression model for target categorized variable explanatory variables – continuous and categorical Estimate.
Logistic Regression and Odds Ratios Psych DeShon.
Joint UNECE/Eurostat Work Session on Statistical Data Confidentiality, 5-7 September 2015, Helsinki Beata Nowok Administrative Data Research Centre – Scotland.
Nonparametric Statistics
Birthweight (gms) BPDNProp Total BPD (Bronchopulmonary Dysplasia) by birth weight Proportion.
1 BINARY CHOICE MODELS: LOGIT ANALYSIS The linear probability model may make the nonsense predictions that an event will occur with probability greater.
Instructor: R. Makoto 1richard makoto UZ Econ313 Lecture notes.
Nonparametric Statistics
Advanced Quantitative Techniques
Intervention Study: Kenya PRIMR Case Regression Analysis
Notes on Logistic Regression
Generalized Linear Models
Introduction to logistic regression a.k.a. Varbrul
Nonparametric Statistics
Regression Models - Introduction
Mathematical Foundations of BME Reza Shadmehr
Logistic Regression.
females males Analyses with discrete variables
Ordinary Least Square estimator using STATA
Presentation transcript:

Individual level synthetic regression models Session 4

Aims of practical 1.Fit a logistic regression model to predict the probability of a person having a mobility disability (age, age squared and age cubed, sex, LLTI and LLTI*age 2.Calculate the model probabilities of having a mobility disability at each single year of age for males and females and for those with and without an LLTI. 3.Generate district estimates of the population with a mobility disability by multiplying the model probabilities by the appropriate population counts (Census)

Let: =probability of mobility disability for individual i x i =the age of individual i j=0 (no LLTI) or 1 (has an LLTI) z ij =1 if j=1 (individual i has an LLTI) and 0 otherwise parameters represent the influence of age on log odds of having a disability Β gives the effect of having an LLTI and δ represents How this effect changes with age

We can use the parameter estimates from the logistic regression model to calculate model probabilities of having a mobility disability Note there is a probability (or rate) for each age (x) and LLTI group (j)

Number of people with a mobility disability at age x and district r Population count at age x, district r and LLTI group j (census) Model probability of mobility disability at age x and LLTI group j (Estimated using HSE data) J=0 no LLTI, j=1 has an LLTI

Synthetic regression assumptions and model limitations The model assumes that the explanatory variables in the HSE and the census are measured identically. The model requires population counts of crosstabulations of all explanatory variables. Have to drop lots of missing values of LLTI (in 2000 proxy interviews didnt include this question).

Comparison of Census and HSE LLTI schedules Lower rates of LLTI in the Census compared to Lower rates of LLTI in the census compared to the HSE at the younger ages

Stata commands xi: logit mobility i.llti*age agesq agecub if sex==1 [pweight=weight] predict MO_RT_M if sex==1 Start of with the HSE microdata (HSE data.dta)

Stata commands Repeat the model and generate predicted values for females Drop duplicate records in terms of the variables age, sex and LLTI Duplicates drop age sex llti, force This leaves us with a predicted probability of mobility disability for each single year of age, sex and LLTI status

Model rates of mobility disability

Stata commands FileDetails Practical 3 – task 1.dtaModel probabilities (age, sex and LLTI status) – file created during task 1 of practical 3 Pop data – practical 3.dta Population counts (age, sex and LLTI status) for each of the 6 casestudy districts Now merge two files using sex, age and LLTI status as the merge variable