Categorical data analysis

Slides:



Advertisements
Similar presentations
Contingency Tables Prepared by Yu-Fen Li.
Advertisements

Chapter 2 Describing Contingency Tables Reported by Liu Qi.
Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/
KRUSKAL-WALIS ANOVA BY RANK (Nonparametric test)
Chi Square Tests Chapter 17.
Statistical Inference for Frequency Data Chapter 16.
Statistical Tests Karen H. Hagglund, M.S.
CHI-SQUARE TEST OF INDEPENDENCE
Session 7.1 Bivariate Data Analysis
Chi Square Test Dealing with categorical dependant variable.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
1 Introduction to biostatistics Lecture plan 1. Basics 2. Variable types 3. Descriptive statistics: Categorical data Categorical data Numerical data Numerical.
+ Quantitative Statistics: Chi-Square ScWk 242 – Session 7 Slides.
AS 737 Categorical Data Analysis For Multivariate
AM Recitation 2/10/11.
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
Amsterdam Rehabilitation Research Center | Reade Testing significance - categorical data Martin van der Esch, PhD.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
Dr.Shaikh Shaffi Ahamed Ph.D., Dept. of Family & Community Medicine
Chi-squared Tests. We want to test the “goodness of fit” of a particular theoretical distribution to an observed distribution. The procedure is: 1. Set.
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Pearson Chi-Square Contingency Table Analysis.
Chi-Square X 2. Parking lot exercise Graph the distribution of car values for each parking lot Fill in the frequency and percentage tables.
Analysis of Qualitative Data Dr Azmi Mohd Tamil Dept of Community Health Universiti Kebangsaan Malaysia FK6163.
Please turn off cell phones, pagers, etc. The lecture will begin shortly.
CHI SQUARE TESTS.
HYPOTHESIS TESTING BETWEEN TWO OR MORE CATEGORICAL VARIABLES The Chi-Square Distribution and Test for Independence.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Fundamental Statistics in Applied Linguistics Research Spring 2010 Weekend MA Program on Applied English Dr. Da-Fu Huang.
1/5/2016Slide 1 We will use a one-sample test of proportions to test whether or not our sample proportion supports the population proportion from which.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis – mutually exclusive – exhaustive.
Chi-Square X 2. Review: the “null” hypothesis Inferential statistics are used to test hypotheses Whenever we use inferential statistics the “null hypothesis”
Chi-Square X 2. Review: the “null” hypothesis Inferential statistics are used to test hypotheses Whenever we use inferential statistics the “null hypothesis”
Copyright c 2001 The McGraw-Hill Companies, Inc.1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent variable.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 12 Tests of Goodness of Fit and Independence n Goodness of Fit Test: A Multinomial.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 11 Testing for Differences Differences betweens groups or categories of the independent.
Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.
 Confidence Intervals  Around a proportion  Significance Tests  Not Every Difference Counts  Difference in Proportions  Difference in Means.
FREQUENCY DISTRIBUTION
I. ANOVA revisited & reviewed
CHI-SQUARE(X2) DISTRIBUTION
Chapter 9: Non-parametric Tests
Making Comparisons All hypothesis testing follows a common logic of comparison Null hypothesis and alternative hypothesis mutually exclusive exhaustive.
The binomial applied: absolute and relative risks, chi-square
Association between two categorical variables
Hypothesis Testing Review
Lecture 8 – Comparing Proportions
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Summarising and presenting data - Bivariate analysis
Categorical Data Analysis
Hypothesis testing. Chi-square test
Ass. Prof. Dr. Mogeeb Mosleh
If we can reduce our desire,
Statistical Analysis using SPSS
STAT 312 Introduction Z-Tests and Confidence Intervals for a
Contingency Tables.
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Experimental Design Data Normal Distribution
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Exact Test Fisher’s Statistics
Applied Statistics Using SPSS
Displaying and Describing Categorical data
Applied Statistics Using SPSS
CLASS 6 CLASS 7 Tutorial 2 (EXCEL version)
Research Techniques Made Simple: Interpreting Measures of Association in Clinical Research Michelle Roberts PhD,1,2 Sepideh Ashrafzadeh,1,2 Maryam Asgari.
Georgi Iskrov, MBA, MPH, PhD Department of Social Medicine
Presentation transcript:

Categorical data analysis

Types of variables Numerical Categorical discrete continuous nominal ordinal

Confidence interval two sided confidence interval error table value – normal distribution u0,05 = 1,96

One – sample test (two sided) H0: π = π0 test criterion table value → u0,05 = 1,96

Two – sample test (two sided) H0: π1 = π2 test criterion table value → u0,05 = 1,96

Contingency tables 2 x 2 Var A/Var B B1 B2 Total A1 a b a+b A2 c d c+d b+d n

Contingency tables i x j Var A/Var B B1 B2 … Bj Total A1 n11 n12 n1j n1. A2 n21 n22 n2. Ai ni1 ni2 nij ni. n.1 n.2 n.j n

Testing independence in two-way CT Null hypothesis Alfa Test criterion computation Table value χ2[(m-1).(n-1)] Conditions for testing n > 40 → χ2 test n (20;40>, some expected frequency is < 5 → Fisher test, if all expected freq are > 5 → χ2 test n <= 20 → Fisher test

Theoretical frequencies Var A/Var B B1 B2 … Bj Total A1 n11 n12 n1j n1. A2 n21 n22 n2. Ai ni1 ni2 nij ni. n.1 n.2 n.j n

Testing independence in two-way CT Fisher test find cell with the lowest value go down by 1 in the cell (final value is 0), all marginal freq are the same computation of probability for each created table ∑pi > 0,05 → H0 is valid

Testing independence in CT table value χ2[(m-1).(n-1)] conditions for testing by χ2 test max 20 % of expected freq is < 5 no expected freq is less than 1

Intensity of dependent Pearson coefficient Cramer coefficient V, where h is min (k;m)

Two-way tables – dependent observation Mc Nemar test 2 x 2 table 1 group of units, observation „before – after“   2nd measure (after) + - 1st measure (before) a b c d

Two-way tables – dependent observation McNemar test test criterion table value χ2[(m-1).(n-1)] condition b + c > 8 correction

Chances and risks in two-way tables Threats Exposition Var A/Var B B1 B2 Total A1 a b a+b A2 c d c+d a+c b+d n

Chances and risks in two-way tables Relative risk (part 1) if the categories of the B variable are independent on categories of the A variable, RR1 = 1 RR > 1 → in the cell „a“ will be occur higher share of the total frequency than in the cell „c“ how many times is higher the probability of threats

Chances and risks in two-way tables Relative risk (part 2)

Chances and risks in two-way tables Odds ratio ratio of two alternatives of relative risk results (RR1 a RR2) OR values are between zero and infinity independent between variables → OR = 1 how many times is higher chance to threats

Chances and risks in two-way tables Attributive risk difference of probability of incidence B1 for both categories of the first variable – A1 and A2 AR is <-1;1> indicates changes of threat probability

Chances and risks in two-way tables Relative attributive risk is based on attributive risk and it is percentage change of probability of incidence B1 for both categories – A1 and A2 basis for computation is share of frequency incidence in the cell „a“ in relation to marginal frequency of category A1 (share a/(a+b))