Beginners statistics Assoc Prof Terry Haines. 5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question.

Slides:



Advertisements
Similar presentations
SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic.
Advertisements

Inference for Regression
Bivariate Analyses.
MSS 905 Methods of Missiological Research
Statistics. Review of Statistics Levels of Measurement Descriptive and Inferential Statistics.
Statistical Tests Karen H. Hagglund, M.S.
Data Analysis Statistics. Inferential statistics.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
The Simple Regression Model
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Business 205. Review of Previous Class Milestone #1 Groups Math Review Symbolic Manipulation Excel Review.
Basic Statistics for Research: Choosing Appropriate Analyses and Using SPSS Dr. Beth A. Bailey Dr. Tiejian Wu Department of Family Medicine.
Data Analysis Statistics. Inferential statistics.
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
1 COMM 301: Empirical Research in Communication Kwan M Lee Lect3_1.
BASIC STATISTICS WE MOST OFTEN USE Student Affairs Assessment Council Portland State University June 2012.
Understanding Research Results
AS 737 Categorical Data Analysis For Multivariate
Selecting the Correct Statistical Test
Inference for regression - Simple linear regression
LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007.
Fundamentals of Statistical Analysis DR. SUREJ P JOHN.
CHAPTER 4 Research in Psychology: Methods & Design
Chapter Eleven A Primer for Descriptive Statistics.
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
User Study Evaluation Human-Computer Interaction.
T-TEST Statistics The t test is used to compare to groups to answer the differential research questions. Its values determines the difference by comparing.
Experimental Research Methods in Language Learning Chapter 11 Correlational Analysis.
Hypothesis of Association: Correlation
Association between 2 variables
Statistical analysis Prepared and gathered by Alireza Yousefy(Ph.D)
© 2006 McGraw-Hill Higher Education. All rights reserved. Numbers Numbers mean different things in different situations. Consider three answers that appear.
Hypothesis testing Intermediate Food Security Analysis Training Rome, July 2010.
SPSS Basics and Applications Workshop: Introduction to Statistics Using SPSS.
By: Amani Albraikan.  Pearson r  Spearman rho  Linearity  Range restrictions  Outliers  Beware of spurious correlations….take care in interpretation.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
+ Chapter 12: More About Regression Section 12.1 Inference for Linear Regression.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
Correlation & Regression Chapter 15. Correlation It is a statistical technique that is used to measure and describe a relationship between two variables.
CHI SQUARE TESTS.
Going from data to analysis Dr. Nancy Mayo. Getting it right Research is about getting the right answer, not just an answer An answer is easy The right.
Commonly Used Statistics in the Social Sciences Chi-square Correlation Multiple Regression T-tests ANOVAs.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Chapter Eight: Using Statistics to Answer Questions.
Inferential Statistics. Explore relationships between variables Test hypotheses –Research hypothesis: a statement of the relationship between variables.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
1 Week 3 Association and correlation handout & additional course notes available at Trevor Thompson.
STATS 10x Revision CONTENT COVERED: CHAPTERS
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Cross Tabs and Chi-Squared Testing for a Relationship Between Nominal/Ordinal Variables.
Chapter 13 Understanding research results: statistical inference.
T-tests Chi-square Seminar 7. The previous week… We examined the z-test and one-sample t-test. Psychologists seldom use them, but they are useful to understand.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
PXGZ6102 BASIC STATISTICS FOR RESEARCH IN EDUCATION
Lesson 3 Measurement and Scaling. Case: “What is performance?” brandesign.co.za.
Interpretation of Common Statistical Tests Mary Burke, PhD, RN, CNE.
Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.
Appendix I A Refresher on some Statistical Terms and Tests.
CHAPTER 15: THE NUTS AND BOLTS OF USING STATISTICS.
Data measurement, probability and Spearman’s Rho
CHAPTER 4 Research in Psychology: Methods & Design
APPROACHES TO QUANTITATIVE DATA ANALYSIS
T-Tests Chapters 14 and 13.
Basic Statistics Overview
POSC 202A: Lecture Lecture: Substantive Significance, Relationship between Variables 1.
Introduction to Statistics
Ass. Prof. Dr. Mogeeb Mosleh
Association, correlation and regression in biomedical research
15.1 The Role of Statistics in the Research Process
Presentation transcript:

Beginners statistics Assoc Prof Terry Haines

5 simple steps 1.Understand the type of measurement you are dealing with 2.Understand the type of question you are asking 3.Select a test 1.Focus today on tests of difference 4.Check assumptions where relevant 5.Run the test

Measurement Assigning numerals to variables – Nominal – Ordinal – Interval – Ratio – Count

Nominal Categories without order – Gender Male / Female – Diagnosis Orthopaedic / neurological / cardiorespiratory

Nominal Entering categorical data on a spreadsheet – Binary / dichotomous data Eg. gender One column (female=0, male=1) – Polytomous data Eg. Diagnosis Can have one column (ortho=0, neuro=1, cardio=1) – Risk that the numeric values will be misused Can have three “dummy” variables / columns – Ortho (no=0, yes=1) – Neuro (no=0, yes=1) – Cardio (no=0, yes=1)

Ordinal Categories with order, but we don’t know how much better one place is than another – Finishing order in a race 1 st, 2 nd, 3 rd – Likert scaled surveys Strongly agree, agree, undecided, disagree, strongly disagree – Entering data One column – make sure you record what numbers mean

Interval Equal intervals between numbers, but not a true zero – Eg. Degrees centigrade, IQ test scores, calendar years AD – Entering data Input the number

Ratio Equal intervals between numbers, a true zero – Eg. Distance, age, time, weight – Entering data Input the number

Count Whole, non-negative numbers indicating the frequency of an event – Eg. Number of falls, number of steps, number of therapy sessions

Manipulating data Can turn a higher level of measurement into a lower level, but not vice versa – Eg. IQ scores 0-50 below average average above average This leads to a “loss” of data and can conceal the true relationship between two variables This converts interval data to ordinal

Measurement Nominal, ordinal, interval, ratio, count Can manipulate data down this scale but not up – Be careful in doing this – Loss of data – Would need a really good reason to do so Questions on measurement scales?

What sort of question is being asked? Is A≠B? Is A>B? Is A<B? Is A=B? Is A~B? Difference Agreement / reliability / prediction Correlation

Difference AB AB AB

The confusing thing is that we test a null hypothesis. – What is the probability that there is no difference in the broader population For the one null hypothesis, there are three alternate hypotheses possible – Is A≠B? – Is A>B? – Is A<B? The magnitude of difference can also be measured

Agreement / reliability / prediction To what extent do two variables tell us exactly the same thing, or can one variable predict a later variable? AB AB

Agreement / reliability / prediction The statistical procedures of agreement / reliability / prediction test a null hypothesis – What is the probability that the amount of agreement / reliability / prediction observed occurred by chance? The magnitude of agreement can also be described

Correlation To what extent do two variables co-relate to each other – They do not have to agree in order to co-relate The statistical procedures of correlation test a null hypothesis – What is the probability that the amount of association observed occurred by chance? The magnitude of correlation can also be described

Understand the question Any questions on – Difference – Agreement / reliability / prediction – Correlation

Statistical testing Why do it? Eg. The average height of men in this room is 179 cms, the average height of women is 163 cms. I know the men in this room are taller by 16 cms – Why do a test?

Statistical testing We normally want to extrapolate the results from our sample to a broader population It is the nature of the relationship between A and B in the broader population that is of greatest interest than what is going on just inside this room

Select a test Tests will vary depending on – Measurement scale of variable A and variable B – The type of question being asked – Whether there are repeated measures or correlated samples involved

Tests of difference Variable AVariable B Tests for independent groups / repeated measures or correlated samples Nominal Nominal, 2 groupsChi 2 test (Fisher Exact test for small samples), logistic regression, relative risk, McNemar test, logistic regression with clustering Ordinal Nominal, 2 groupsMann-Whitney U, ordinal logistic regression, Wilcoxon test, ordinal logistic regression with clustering Interval / ratio Nominal, 2 groupsUnpaired t-test (equal / unequal variance), linear regression, Cox regression, paired t- test, linear regression with clustering, Cox regression with clustering Count Nominal, 2 groupsPoisson regression, Poisson regression with clustering, can use ratio tests also if normally distributed

Mock data

T-test versus regression Variable AVariable B Tests for independent groups / repeated measures or correlated samples Nominal Nominal, 2 groupsChi 2 test (Fisher Exact test for small samples), logistic regression, relative risk, McNemar test, logistic regression with clustering Ordinal Nominal, 2 groupsMann-Whitney U, ordinal logistic regression, Wilcoxon test, ordinal logistic regression with clustering Interval / ratio Nominal, 2 groupsUnpaired t-test (equal / unequal variance), linear regression, Cox regression, paired t- test, linear regression with clustering, Cox regression with clustering Count Nominal, 2 groupsPoisson regression, Poisson regression with clustering, can use ratio tests also if normally distributed

What the data says

Is there a difference? T-test

Is there a difference? Regression

Selecting a test: Correlation First check visually, then Pearson’s R Can also use linear regression for further description of the correlation

Correlation Height vs weight – Pearson’s r

Regression

Regression line is line of best fit

Y = bX + c What do these numbers mean? For each one unit increase in weight, there is a 0.87 increase in height. Height = 0.87*weight

Does this work when one variable is dichotomous? Height = 13.3*gender(0,1)

Some tricky questions Can we have: – A is different to B, but A correlates with B? – A agrees with B, and A correlates with B? – A is not different to B, and A does not correlate with B?

Some more mock data

A and B are different, but highly correlated Confidence intervals so narrow and p-value so low they can’t be calculated

A and C have a negative correlation, and are different

B and D are not different, and not correlated

But is there really no relationship here? Linear regression only looks for linear (straight line) relationships. Data transformations or other forms of regression are needed here.

Checking assumptions Many assumptions surround most statistical tests – Need to check to make sure you are doing the right thing by your data – There are specific tests to check assumptions – When in doubt, use visual examination of your data

Run the tests Can use Excel for some tests – Gives you a single number output We have been using Stata today – Lot’s more output to help you interpret your data

Any questions? Next month – 31 st March Starting small and research question development Dr Elizabeth Skinner