Quantitative Methods Categorical Data. The Poisson Distribution.

Slides:



Advertisements
Similar presentations
SADC Course in Statistics Modelling ideas in general – an appreciation (Session 20)
Advertisements

Survival Analysis. Key variable = time until some event time from treatment to death time for a fracture to heal time from surgery to relapse.
A PowerPoint®-based guide to assist in choosing the suitable statistical test. NOTE: This presentation has the main purpose to assist researchers and students.
The logic behind a statistical test. A statistical test is the comparison of the probabilities in favour of a hypothesis H 1 with the respective probabilities.
© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
INTRODUCTION TO NON-PARAMETRIC ANALYSES CHI SQUARE ANALYSIS.
N-way ANOVA. Two-factor ANOVA with equal replications Experimental design: 2  2 (or 2 2 ) factorial with n = 5 replicate Total number of observations:
Comparing More than Two Means Mean AMean BMean C Slide 14.1.
Modeling Wim Buysse RUFORUM 1 December 2006 Research Methods Group.
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
ANalysis Of VAriance (ANOVA) Comparing > 2 means Frequently applied to experimental data Why not do multiple t-tests? If you want to test H 0 : m 1 = m.
Spotting pseudoreplication 1.Inspect spatial (temporal) layout of the experiment 2.Examine degrees of freedom in analysis.
Log-linear and logistic models
Chapter 19 Data Analysis Overview
Modelling health care costs: practical examples and applications Andrew Briggs Philip Clarke University of Oxford & Daniel Polsky Henry Glick University.
Linear statistical models 2009 Count data  Contingency tables and log-linear models  Poisson regression.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
SAS Lecture 5 – Some regression procedures Aidan McDermott, April 25, 2005.
1 Advances in Statistics Or, what you might find if you picked up a current issue of a Biological Journal.
ANCOVA Lecture 9 Andrew Ainsworth. What is ANCOVA?
Chapter 13: Inference in Regression
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 26 Comparing Counts.
Generalized Linear Models All the regression models treated so far have common structure. This structure can be split up into two parts: The random part:
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Blocks and pseudoreplication
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Chapter 12: Correlation and Linear Regression 1.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Statistics 2: generalized linear models. General linear model: Y ~ a + b 1 * x 1 + … + b n * x n + ε There are many cases when general linear models are.
Comparing Counts Chapter 26. Goodness-of-Fit A test of whether the distribution of counts in one categorical variable matches the distribution predicted.
Nonparametric Statistics
 Naïve Bayes  Data import – Delimited, Fixed, SAS, SPSS, OBDC  Variable creation & transformation  Recode variables  Factor variables  Missing.
Chapter 12: Correlation and Linear Regression 1.
Nonparametric Statistics
32931 Technology Research Methods Autumn 2017 Quantitative Research Component Topic 4: Bivariate Analysis (Contingency Analysis and Regression Analysis)
BINARY LOGISTIC REGRESSION
Logistic Regression When and why do we use logistic regression?
Parallel chi-square test
Two-way ANOVA with significant interactions
Generalized Linear Models
Confidence Intervals and Hypothesis Tests for Variances for One Sample
Active Learning Lecture Slides
Categorical Data Aims Loglinear models Categorical data
Chapter 25 Comparing Counts.
Y - Tests Type Based on Response and Measure Variable Data
12 Inferential Analysis.
Part Three. Data Analysis
Lecture 2: Replication and pseudoreplication
Analysis of Data Graphics Quantitative data
Social Research Methods
Introduction to logistic regression a.k.a. Varbrul
Nonparametric Three or more groups.
Log Linear Modeling of Independence
Quantitative Methods What lies beyond?.
Nonparametric Statistics
Review of Hypothesis Testing
Statistics review Basic concepts: Variability measures Distributions
What is Regression Analysis?
Quantitative Methods What lies beyond?.
Two-way analysis of variance (ANOVA)
12 Inferential Analysis.
Chapter 26 Comparing Counts.
Chapter 26 Comparing Counts Copyright © 2009 Pearson Education, Inc.
Chapter 26 Comparing Counts.
Quantitative Methods Categorical Data.
Designing experiments - keeping it simple
Quantitative Methods Categorical Data.
Presentation transcript:

Quantitative Methods Categorical Data

The Poisson Distribution

Categorical Data The Poisson Distribution Items Containers Radioactive decays Telephone calls begun Fig trees Fleas Typing mistakes Second Minute Hectare Cat Page

Categorical Data The Poisson Distribution

Categorical Data The Poisson Distribution

Categorical Data The Poisson Distribution Variance increase with mean. Indeed, variance=mean for a Poisson distribution. This will be important later in the lecture.

Categorical Data The Poisson Distribution Items Containers Radioactive decays Telephone calls begun Fig trees Fleas Typing mistakes Second Minute Hectare Cat Page Assumptions that guarantee a Poisson distribution of items across containers: (1) Independence of items (2) Homogeneity of containers

Categorical Data The Poisson Distribution

Categorical Data What is categorical data?

Categorical Data What is categorical data? 1. There are 952 datapoints 2. They must be independent

Categorical Data What is categorical data? Small Large Yellow Blue 1. There are 952 datapoints 2. They must be independent

Categorical Data What is categorical data? Caterpillar 1 Caterpillar 2 Plant 1 Plant 2 1. There are 952 datapoints 2. They must be independent

Categorical Data What is categorical data? Training Procedure 1 Training Procedure 2 Test 1 Test 2 1. There are 952 datapoints 2. They must be independent

Categorical Data Dispersion test

Categorical Data Dispersion test

Categorical Data Dispersion test Species A is under-dispersed Species B is over-dispersed

Categorical Data Contingency tables

Categorical Data Contingency tables

Categorical Data Contingency tables

Categorical Data Contingency tables and orthogonality

Categorical Data Contingency tables and orthogonality Variety Sowrate: Number of plots at each treatment combination.

Categorical Data Contingency tables and orthogonality Treatments Blocks: Number of plots at each treatment combination.

Categorical Data Contingency tables by GLM

Categorical Data Contingency tables by GLM 2 Pr(X>x)=0.7995, similar to contingency table test

Categorical Data Contingency tables by GLM If we accept the Poisson hypothesis:

Categorical Data Contingency tables by GLM If we dont accept the Poisson hypothesis:

Categorical Data Contingency tables by GLM The GLM main effects have parallel categorical analyses too:

Last words… Be sure you can tell whether a dataset is categorical Chi-square methods apply to simple cases GLM methods can also be used, and are linked For many situations, Generalised Linear Models (in this case Logistic Regressions or Log-Linear Models) are needed Nonparametric Tests Read handout Categorical Data