1 Master Economics and Public Policy Ecole Polytechnique – ENSAE - Sciences Po Academic year 2009-2010 Quantitative sociology.

Slides:



Advertisements
Similar presentations
Chapter 2 Describing Contingency Tables Reported by Liu Qi.
Advertisements

Workshop in R & GLMs: #3 Diane Srivastava University of British Columbia
Special random variables Chapter 5 Some discrete or continuous probability distributions.
SPH 247 Statistical Analysis of Laboratory Data. Generalized Linear Models The type of predictive model one uses depends on a number of issues; one is.
© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
- Word counts - Speech error counts - Metaphor counts - Active construction counts Moving further Categorical count data.
Lecture Data Mining in R 732A44 Programming in R.
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/
Moving away from Linear-Gaussian assumptions Cons: Some things become much harder. No baked-in test of global fit Non-recursive models Error correlations.
Week 3. Logistic Regression Overview and applications Additional issues Select Inputs Optimize complexity Transforming Inputs.
Logit & Probit Regression
L OGISTIC R EGRESSION Now with multinomial support!
1. Analyzing Non-Normal Data with Generalized Linear Models 2010 LISA Short Course Series Sai Wang, Dept. of Statistics.
Repeated Measures, Part 3 May, 2009 Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and Biostatistics, UCSF.
Linear statistical models 2008 Binary and binomial responses The response probabilities are modelled as functions of the predictors Link functions: the.

SLIDE 1IS 240 – Spring 2010 Logistic Regression The logistic function: The logistic function is useful because it can take as an input any.
Log-linear and logistic models
A random variable that has the following pmf is said to be a binomial random variable with parameters n, p The Binomial random variable.
Linear and generalised linear models Purpose of linear models Least-squares solution for linear models Analysis of diagnostics Exponential family and generalised.
Severity Distributions for GLMs: Gamma or Lognormal? Presented by Luyang Fu, Grange Mutual Richard Moncher, Bristol West 2004 CAS Spring Meeting Colorado.
Generalized Linear Models
Statistics Idiots Guide! Dr. Hamda Qotba, B.Med.Sc, M.D, ABCM.
Ordinal Logistic Regression “Good, better, best; never let it rest till your good is better and your better is best” (Anonymous)
Review of Lecture Two Linear Regression Normal Equation
Leedy and Ormrod Ch. 11 Gray Ch. 14
1 Regression Models with Binary Response Regression: “Regression is a process in which we estimate one variable on the basis of one or more other variables.”
Baburao Kamble (Ph.D) University of Nebraska-Lincoln Data Analysis Using R Week 8: Advanced Regression and Probability Distribution.
Chapter 3: Generalized Linear Models 3.1 The Generalization 3.2 Logistic Regression Revisited 3.3 Poisson Regression 1.
ALISON BOWLING THE GENERAL LINEAR MODEL. ALTERNATIVE EXPRESSION OF THE MODEL.
Today: Lab 9ab due after lecture: CEQ Monday: Quizz 11: review Wednesday: Guest lecture – Multivariate Analysis Friday: last lecture: review – Bring questions.
Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:
Linear Model. Formal Definition General Linear Model.
Quantifying uncertainty in long-term care costs following introduction of new drug therapy: the importance of model choice Paul McNamee, Alessandra Vanoli,
Forecasting Choices. Types of Variable Variable Quantitative Qualitative Continuous Discrete (counting) Ordinal Nominal.
1 GLM I: Introduction to Generalized Linear Models By Curtis Gary Dean Distinguished Professor of Actuarial Science Ball State University By Curtis Gary.
Copyright © 2006 The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Dummy Variable Regression Models chapter ten.
Econometrics Course: Cost at the Dependent Variable (II) Paul G. Barnett, PhD December 4, 2013.
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
Log-linear Models HRP /03/04 Log-Linear Models for Multi-way Contingency Tables 1. GLM for Poisson-distributed data with log-link (see Agresti.
Statistics 2: generalized linear models. General linear model: Y ~ a + b 1 * x 1 + … + b n * x n + ε There are many cases when general linear models are.
Remembering way back: Generalized Linear Models Ordinary linear regression What if we want to model a response that is not Gaussian?? We may have experiments.
Dependent Variable Discrete  2 values – binomial  3 or more discrete values – multinomial  Skewed – e.g. Poisson Continuous  Non-normal.
1 Fighting for fame, scrambling for fortune, where is the end? Great wealth and glorious honor, no more than a night dream. Lasting pleasure, worry-free.
Logistic Regression Hal Whitehead BIOL4062/5062.
Week 7: General linear models Overview Questions from last week What are general linear models? Discussion of the 3 articles.
Instructor: R. Makoto 1richard makoto UZ Econ313 Lecture notes.
 Naïve Bayes  Data import – Delimited, Fixed, SAS, SPSS, OBDC  Variable creation & transformation  Recode variables  Factor variables  Missing.
Multiple Regression Analysis Bernhard Kittel Center for Social Science Methodology University of Oldenburg.
Statistical Modelling
A priori violations In the following cases, your data violates the normality and homoskedasticity assumption on a priori grounds: (1) count data  Poisson.
Advanced Quantitative Techniques
Robust Estimation Techniques for Trip Generation in Tennessee
Advanced Quantitative Techniques
Generalized Linear Models
Eastern Michigan University
Generalized Linear Models
Correct statistics in ecological research
Generalized Linear Models (GLM) in R
Introduction to logistic regression a.k.a. Varbrul
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Quantitative Methods What lies beyond?.
Constantly forgotten Hein Stigum Presentation, data and programs at:
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
DCAL Stats Workshop Bodo Winter.
Quantitative Methods What lies beyond?.
Do whatever is needed to finish…
Categorical Data Analysis
Chapter 3 : Random Variables
Fitting generalized linear models
Presentation transcript:

1 Master Economics and Public Policy Ecole Polytechnique – ENSAE - Sciences Po Academic year Quantitative sociology Basics on GLM on stata 11

2 What do we do today? Overview on General Linar Model Whats this? => generalization of many kinds of regressions Syntax => glm vardep listofvarindep [fweight=weightvar] if (condition), family(xxx) link(yyy) comparison of models, etc. => estimates store mod / estimates stats mod for the bics On STATA 11: same as regression and logit : for categorical indep var and for interactions

3 Categorical independent variables no more xi: To change the reference category = fvset base 2 gender Expression of interactions # for interaction without fist order variables ## for interaction AND fist order variables

4 | id log logit probit clog pow opower nbinomial loglog logc Gaussian | x x x inv. Gau. | x x x binomial | x x x x x x x x x Poisson | x x x neg. bin. | x x x x gamma | x x x link() family() Family glm option Gaussian(normal) family(gaussian) inverse Gaussian family(igaussian) Bernoulli/binomial family(binomial) Poisson family(poisson) negative binomial family(nbinomial) gamma family(gamma) Link function glm option identity link(identity) log link(log) logit link(logit) probit link(probit) complementary log-log link(cloglog) odds power link(opower #) power link(power #) negative binomial link(nbinomial) log-log link(loglog) log-complement link(logc)