AnnMaria De Mars, Ph.D. The Julia Group Santa Monica, CA Categorical data analysis: For when your data DO fit in little boxes.

Slides:



Advertisements
Similar presentations
David Pieper, Ph.D. STATISTICS David Pieper, Ph.D.
Advertisements

COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Inferential Statistics
EPI 809/Spring Probability Distribution of Random Error.
Hypothesis test flow chart frequency data Measurement scale number of variables 1 basic χ 2 test (19.5) Table I χ 2 test for independence (19.9) Table.
1 If we live with a deep sense of gratitude, our life will be greatly embellished.
EPI 809 / Spring 2008 Final Review EPI 809 / Spring 2008 Ch11 Regression and correlation  Linear regression Model, interpretation. Model, interpretation.
QUANTITATIVE DATA ANALYSIS
UNDERSTANDING RESEARCH RESULTS: STATISTICAL INFERENCE © 2012 The McGraw-Hill Companies, Inc.
Today Concepts underlying inferential statistics
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Hypothesis Testing Using The One-Sample t-Test
Chapter 14 Inferential Data Analysis
Nonparametric and Resampling Statistics. Wilcoxon Rank-Sum Test To compare two independent samples Null is that the two populations are identical The.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
COURSE: JUST 3900 Tegrity Presentation Developed By: Ethan Cooper Final Exam Review.
Inferential Statistics
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
AM Recitation 2/10/11.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
Inferential Statistics: SPSS
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
Hypothesis Testing:.
Overview of Statistical Hypothesis Testing: The z-Test
PowerPoint presentation to accompany Research Design Explained 6th edition ; ©2007 Mark Mitchell & Janina Jolley Chapter 7 Introduction to Descriptive.
230 Jeopardy Unit 4 Chi-Square Repeated- Measures ANOVA Factorial Design Factorial ANOVA Correlation $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 17 Inferential Statistics.
Copyright © 2008 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 22 Using Inferential Statistics to Test Hypotheses.
Introduction To Biological Research. Step-by-step analysis of biological data The statistical analysis of a biological experiment may be broken down into.
Proc freq: Five secrets* *Okay, well, lesser known facts.
User Study Evaluation Human-Computer Interaction.
Data Analysis (continued). Analyzing the Results of Research Investigations Two basic ways of describing the results Two basic ways of describing the.
Hypothesis Testing Using the Two-Sample t-Test
Associate Professor Arthur Dryver, PhD School of Business Administration, NIDA url:
Categorical Data Analysis: When life fits in little boxes AnnMaria DeMars, PhD.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
The Logic of Statistical Significance & Making Statistical Decisions
Categorical Data Analysis: When life fits in little boxes AnnMaria DeMars, PhD.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
CHI SQUARE TESTS.
Categorical data analysis: An overview of statistical techniques AnnMaria De Mars The Julia Group AnnMaria De Mars The Julia Group.
Experimental Research Methods in Language Learning Chapter 10 Inferential Statistics.
ANALYSIS PLAN: STATISTICAL PROCEDURES
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Fundamental Statistics in Applied Linguistics Research Spring 2010 Weekend MA Program on Applied English Dr. Da-Fu Huang.
Session 13: Correlation (Zar, Chapter 19). (1)Regression vs. correlation Regression: R 2 is the proportion that the model explains of the variability.
Going from data to analysis Dr. Nancy Mayo. Getting it right Research is about getting the right answer, not just an answer An answer is easy The right.
Why this is useful  Failure as a statistician/ analyst often is failure to clearly communicate  Need to communicate results to non-technical decision-
1 STA 617 – Chp10 Models for matched pairs Summary  Describing categorical random variable – chapter 1  Poisson for count data  Binomial for binary.
1 G Lect 7a G Lecture 7a Comparing proportions from independent samples Analysis of matched samples Small samples and 2  2 Tables Strength.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Soc 3306a Lecture 7: Inference and Hypothesis Testing T-tests and ANOVA.
Beginning Statistics Table of Contents HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
Chapter 8: Using Basic Statistical Procedures “33⅓% of the mice used in the experiment were cured by the test drug; 33⅓% of the test population were unaffected.
Power Point Slides by Ronald J. Shope in collaboration with John W. Creswell Chapter 7 Analyzing and Interpreting Quantitative Data.
Nonparametric Statistics
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
CHOOSING A STATISTICAL TEST
SA3202 Statistical Methods for Social Sciences
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
BIVARIATE ANALYSIS: Measures of Association Between Two Variables
Categorical Data Analysis
Chapter 18: The Chi-Square Statistic
REVIEW Course Review.
Presentation transcript:

AnnMaria De Mars, Ph.D. The Julia Group Santa Monica, CA Categorical data analysis: For when your data DO fit in little boxes

Anyone who thinks he knows all of SAS is clinically insane Okay, Hemingway didn’t really say that, but he should have

Descriptive Statistics PROC FREQ * PROC UNIVARIATE PROC TABULATE ODS graphs * SAS/Graph Graph – N- Go SAS Enterprise Guide

Other PROCs LOGISTIC * CATMOD CORRESP PRINQUAL SURVEYLOGISTIC

Hybrids T-test ANOVA NPAR1WAY FACTOR REG

Our secret plan Descriptives Chi-square Secrets of PROC FREQ Logistic regression

Homes without computers have fewer books Graphs with SAS On-Demand

You keep saying that word

PROC FREQ DATA = dsname ; TABLES varname1 * varname2 / chisq ; YOU GET  Chi-square value (several)  Phi coefficient  Fisher Exact test (where applicable)

Pearson Chi-Square  Tests for a relationship between two categorical variables, e.g. whether having participated in a program is related to having a correct answer on a test.  Assumes randomly sampled data  Assumes independent observations  Assumes large samples  Tests for a relationship between two categorical variables, e.g. whether having participated in a program is related to having a correct answer on a test.  Assumes randomly sampled data  Assumes independent observations  Assumes large samples

Mothers Education & Failing a Grade

Fisher’s exact test Is used when the assumption of large sample sizes cannot be met There is no advantage to using it if you do have large sample sizes

Test for bias in sample

Fisher – magically happens

Other simple statistics Binomial tests Confidence intervals McNemar Odds ratios Cochran-Mantel- Haenszel test Because, obviously, not everyone has the same tastes

 PROC FREQ DATA = dsname ; TABLES varname / BINOMIAL (EXACT P =.333) ALPHA =.05 ;

 The binomial (equiv p =.333) will produce a test that the population proportion is.333 for the first category. That is “No” for death. A Z-value will be produced and probabilities for one-tail and two-tailed tests.  The exact keyword will produce confidence intervals and, since I have specified alpha =.05, these will be the 95% confidence intervals.

Some More Coding PROC FREQ DATA = dsname ; TABLES varname1 * varname2 / AGREE ; FOR CORRELATED DATA

Correlated Data

McNemar’s Test

Cohen’s Kappa 1.0 = perfect agreement Negative Kappa is not an error, it means the two agree less than chance = Probability observed – Probability expected 1 – Probability expected