Binary Response Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.

Slides:



Advertisements
Similar presentations
Class 6: Hypothesis testing and confidence intervals
Advertisements

Using R Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/
Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
U eatworms.swmed.edu/~leon u
Proportion Data Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Count Data Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Final Review Session.
Probability Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Log-linear and logistic models Generalised linear model ANOVA revisited Log-linear model: Poisson distribution logistic model: Binomial distribution Deviances.
Log-linear and logistic models
Some standard univariate probability distributions
Statistics in MATLAB COMM2M Harry R. Erwin, PhD University of Sunderland.
Some standard univariate probability distributions
Lecture Slides Elementary Statistics Twelfth Edition
Contrasts Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Some standard univariate probability distributions
Ch 5 Practical Point Pattern Analysis Spatial Stats & Data Analysis by Magdaléna Dohnalová.
Regression Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Comparing Two Samples Harry R. Erwin, PhD
Analysis of Covariance Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Chapter 3: Generalized Linear Models 3.1 The Generalization 3.2 Logistic Regression Revisited 3.3 Poisson Regression 1.
Some standard univariate probability distributions Characteristic function, moment generating function, cumulant generating functions Discrete distribution.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Central Tendency Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Chapter 11 Chi-Square Procedures 11.3 Chi-Square Test for Independence; Homogeneity of Proportions.
Summary of Remainder Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Experimental Design and Modelling Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Variance Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Statistical Modelling Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Introduction to Statistics Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Biostatistics, statistical software VII. Non-parametric tests: Wilcoxon’s signed rank test, Mann-Whitney U-test, Kruskal- Wallis test, Spearman’ rank correlation.
Biostatistics Class 3 Discrete Probability Distributions 2/8/2000.
1 In this case, each element of a population is assigned to one and only one of several classes or categories. Chapter 11 – Test of Independence - Hypothesis.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
© Department of Statistics 2012 STATS 330 Lecture 20: Slide 1 Stats 330: Lecture 20.
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
CHI SQUARE TESTS.
Chapter Outline Goodness of Fit test Test of Independence.
Generalized Linear Models (GLMs) and Their Applications.
Multiple Regression Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Statistical Inference Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Statistics 2: generalized linear models. General linear model: Y ~ a + b 1 * x 1 + … + b n * x n + ε There are many cases when general linear models are.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Remembering way back: Generalized Linear Models Ordinary linear regression What if we want to model a response that is not Gaussian?? We may have experiments.
1 Fighting for fame, scrambling for fortune, where is the end? Great wealth and glorious honor, no more than a night dream. Lasting pleasure, worry-free.
More inferential statistics. Chi square tests compare observed frequency distributions, either to theoretical expectations or to other observed frequency.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
Bayesian Modelling Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
The Idea of the Statistical Test. A statistical test evaluates the "fit" of a hypothesis to a sample.
Week 7: General linear models Overview Questions from last week What are general linear models? Discussion of the 3 articles.
Statistical Modelling
Chi-Square hypothesis testing
Generalized Linear Models
Active Learning Lecture Slides
Analysis of Variance Harry R. Erwin, PhD
Logistic Regression --> used to describe the relationship between
What is Regression Analysis?
STAT 312 Introduction Z-Tests and Confidence Intervals for a
Lecture Slides Elementary Statistics Twelfth Edition
Lecture Slides Elementary Statistics Twelfth Edition
Lecture Slides Essentials of Statistics 5th Edition
Chapter Outline Goodness of Fit test Test of Independence.
Presentation transcript:

Binary Response Harry R. Erwin, PhD School of Computing and Technology University of Sunderland

Resources Crawley, MJ (2005) Statistics: An Introduction Using R. Wiley. Freund, RJ, and WJ Wilson (1998) Regression Analysis, Academic Press. Gentle, JE (2002) Elements of Computational Statistics. Springer. Gonick, L., and Woollcott Smith (1993) A Cartoon Guide to Statistics. HarperResource (for fun).

Introduction These four demonstration sessions of this class address special types of data: –Counts –Proportions –Survival analysis –Binary responses (this lecture)

Binary Response Very common: –dead or alive –occupied or empty –male or female –employed or unemployed Response variable is 0 or 1. R assumes a binomial trial with sample size 1.

When to use Binary Response Data Do a binary response analysis only when you have unique values of one or more explanatory variables for each and every individual case. Otherwise lump: aggregate to the point where you have unique values. Either: –Analyse the data as a contingency table using Poisson errors, or –Decide which explanatory variable is key, express the data as proportions, recode as a count of a two-level factor, and assume binomial errors.

Applications to Spatial and Time Series Statistics You’re assuming you’re sampling from a spatial point process. The null hypothesis is that events occur uniformly over space and with a Poisson distribution (memory-less) over time. The usual approach is described on the next slide. This addresses both location and rate of events simultaneously. Consider lumping to study the geographic or time-dependent distribution of the event rate separately. The problem is similar to how we model neurone spiking.

Modelling Binary Response Single vector with the response variable Use glm with family = binomial Think about a log-log link instead of logit. Fit the usual way. Test significance using  2. Don’t worry about overdispersion. Book example.