LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok 19-23 November 2007.

Slides:



Advertisements
Similar presentations
Hypothesis testing 5th - 9th December 2011, Rome.
Advertisements

SPSS Session 5: Association between Nominal Variables Using Chi-Square Statistic.
LEARNING PROGRAMME Hypothesis testing Part 2: Categorical variables Intermediate Training in Quantitative Analysis Bangkok November 2007.
Independent t -test Features: One Independent Variable Two Groups, or Levels of the Independent Variable Independent Samples (Between-Groups): the two.
Inference for Regression
Analysis of variance (ANOVA)-the General Linear Model (GLM)
5/15/2015Slide 1 SOLVING THE PROBLEM The one sample t-test compares two values for the population mean of a single variable. The two-sample test of a population.
Correlation Chapter 9.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Intro to Statistics for the Behavioral Sciences PSYC 1900
Matching level of measurement to statistical procedures
Linear Regression and Correlation Analysis
Topic 3: Regression.
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
Correlations and T-tests
Dr. Mario MazzocchiResearch Methods & Data Analysis1 Correlation and regression analysis Week 8 Research Methods & Data Analysis.
Today Concepts underlying inferential statistics
Data Analysis Statistics. Levels of Measurement Nominal – Categorical; no implied rankings among the categories. Also includes written observations and.
Correlation Analysis 5th - 9th December 2011, Rome.
Correlation and Regression Analysis
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Chapter 14 Inferential Data Analysis
Descriptive measures of the strength of a linear association r-squared and the (Pearson) correlation coefficient r.
Correlation Question 1 This question asks you to use the Pearson correlation coefficient to measure the association between [educ4] and [empstat]. However,
SPSS Session 4: Association and Prediction Using Correlation and Regression.
Relationships Among Variables
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Leedy and Ormrod Ch. 11 Gray Ch. 14
1 Doing Statistics for Business Doing Statistics for Business Data, Inference, and Decision Making Marilyn K. Pelosi Theresa M. Sandifer Chapter 11 Regression.
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
Psy B07 Chapter 1Slide 1 ANALYSIS OF VARIANCE. Psy B07 Chapter 1Slide 2 t-test refresher  In chapter 7 we talked about analyses that could be conducted.
Estimation and Hypothesis Testing Faculty of Information Technology King Mongkut’s University of Technology North Bangkok 1.
Inferential Statistics: SPSS
Introduction to Linear Regression and Correlation Analysis
Correlation and Linear Regression
SPSS Series 1: ANOVA and Factorial ANOVA
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
T-TEST Statistics The t test is used to compare to groups to answer the differential research questions. Its values determines the difference by comparing.
Statistical Analysis. Statistics u Description –Describes the data –Mean –Median –Mode u Inferential –Allows prediction from the sample to the population.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Hypothesis testing Intermediate Food Security Analysis Training Rome, July 2010.
6/2/2016Slide 1 To extend the comparison of population means beyond the two groups tested by the independent samples t-test, we use a one-way analysis.
Recap of data analysis and procedures Food Security Indicators Training Bangkok January 2009.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
Analysis of Variance 1 Dr. Mohammed Alahmed Ph.D. in BioStatistics (011)
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Chapter 16 Data Analysis: Testing for Associations.
ANOVA: Analysis of Variance.
Chapter 13 - ANOVA. ANOVA Be able to explain in general terms and using an example what a one-way ANOVA is (370). Know the purpose of the one-way ANOVA.
SW318 Social Work Statistics Slide 1 One-way Analysis of Variance  1. Satisfy level of measurement requirements  Dependent variable is interval (ordinal)
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
STATS 10x Revision CONTENT COVERED: CHAPTERS
PART 2 SPSS (the Statistical Package for the Social Sciences)
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
© The McGraw-Hill Companies, Inc., Chapter 10 Correlation and Regression.
Independent Samples ANOVA. Outline of Today’s Discussion 1.Independent Samples ANOVA: A Conceptual Introduction 2.The Equal Variance Assumption 3.Cumulative.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Interpretation of Common Statistical Tests Mary Burke, PhD, RN, CNE.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical.
CHAPTER 15: THE NUTS AND BOLTS OF USING STATISTICS.
Regression Analysis.
Dr. Siti Nor Binti Yaacob
Regression Analysis Simple Linear Regression
Multiple Regression.
Simple Linear Regression
Hypothesis Testing Part 2: Categorical variables
Some statistics questions answered:
Presentation transcript:

LEARNING PROGRAMME Hypothesis testing Intermediate Training in Quantitative Analysis Bangkok November 2007

LEARNING PROGRAMME - 2 Hypothesis testing Hypothesis testing involves: 1.defining research questions and 2.assessing whether changes in an independent variable are associated with changes in the dependent variable by conducting a statistical test Dependent and independent variables  Dependent variables are the outcome variables  Independent variables are the predictive/ explanatory variables

LEARNING PROGRAMME - 3 Example…  Research question: Is educational level of the mother related to birthweight?  What is the dependent and independent variable?  Research question: Is access to roads related to educational level of mothers?  Now?

LEARNING PROGRAMME - 4 Tests statistics  To test hypotheses, we rely on test statistics…  Test statistics are simply the result of a particular statistical test The most common include: 1.T-tests calculate T-statistics 2.ANOVAs calculate F-statistics 3.Correlations calculate the pearson correlation coefficient

LEARNING PROGRAMME - 5 Significant test statistic  Is the relationship observed by chance, or because there actually is a relationship between the variables???  This probability is referred to as a p-value and is expressed a decimal percent (ie. p=0.05)  If the probability of obtaining the value of our test statistic by chance is less than 5% then we generally accept the experimental hypothesis as true: there is an effect on the population  Ex: if p=0.1-- What does this mean? Do we accept the experimental hypothesis?  This probability is also referred to as significance level (sig.)

LEARNING PROGRAMME Hypothesis testing Part 1: Continuous variables Intermediate Training in Quantitative Analysis Bangkok November 2007

LEARNING PROGRAMME - 7 Topics to be covered in this presentation  T- test  One way analysis of variance (ANOVA)  Correlation  Simple linear regression

LEARNING PROGRAMME - 8 Learning objectives By the end of this session, the participant should be able to:  Conduct t-tests  Conduct ANOVA  Conduct correlations  Conduct linear regressions

LEARNING PROGRAMME - 9 Hypothesis testing… WFP tests a variety of hypothesis… Some of the most common include: 1. Looking at differences between groups of people (comparisons of means) Ex. Are different livelihood groups more likely to have different levels food consumption?? 2. Looking at the relationship between two variables… Ex. Is asset wealth associated with food consumption??

LEARNING PROGRAMME - 10 How to assess differences in two means statistically T-tests

LEARNING PROGRAMME - 11 T-test A test using the t-statistic that establishes whether two means differ significantly. Independent means t-test:  It is used in situations in which there are two experimental conditions and different participants have been used in each condition. Dependent or paired means t-test:  This test is used when there are two experimental conditions and the same participants took part in both conditions of experiment.

LEARNING PROGRAMME - 12 T-test assumptions In order to conduct a T-test, data must be:  Normally distributed  Interval  Estimates are independent  Homogeneity of variance Independent and dependent t-tests Independent t-tests

LEARNING PROGRAMME - 13 The independent t-test  The independent t-test compares two means, when those means have come from different groups of people;  This test is the most useful for our purposes

LEARNING PROGRAMME - 14 T-tests formulas Quite simply, the T-test formula is a ratio of the: Difference between the two means or averages/ the variability or dispersion of the scores Statistically this formula is:

LEARNING PROGRAMME - 15 Example T-tests Difference in weight for age z-scores between males and females in Kenya T-test = T-test = 5.56

LEARNING PROGRAMME - 16 To conduct an independent t- test In SPSS, t-tests are best run using the following steps: 1.Click on “Analyze” drop down menu 2.Click on “Compare Means” 3.Click on “Independent- Sample T-Test…” 4.Move the independent and dependent variable into proper boxes 5.Click “OK”

LEARNING PROGRAMME - 17 One note of caution about independent t-tests It is important to ensure that the assumption of homogeneity of variance (sometimes referred to as homoschedasticity) is met To do so: Look at the column labelled Levene’s Test for Equality of Variance. If the Sig. value is less than.05 then the assumption of homogeneity of variance has been broken and you should look at the row in the table labelled Equal variances not assumed. If the Sig. value of Levene’s test is bigger than.05 then you should look at the row in the table labelled Equal variances assumed.

LEARNING PROGRAMME - 18 Testing for homogeneity of variance  Look at the column labelled Sig. : if the value is less than.05 then the means of the two groups are significantly different.  Look at the values of the means to tell you how the groups differ.

LEARNING PROGRAMME - 19 What to do if we want to statistically compare differences in three means? Analysis of variance (ANOVA)

LEARNING PROGRAMME - 20 Analysis of Variance (ANOVA)  ANOVAs, however, produce an F-statistic, which is an omnibus test, i.e. it tells us if there are any difference among the different means but not how (or which) means differ.  ANOVAs are similar to t-tests and in fact an ANOVA conducted to compare two means will give the same answer as a t-test.

LEARNING PROGRAMME - 21 Calculating an ANOVA ANOVA formulas: calculating an ANOVA by hand is complicated and knowing the formulas are not necessary… Instead, we will rely on SPSS to calculate ANOVAs…

LEARNING PROGRAMME - 22 Example of One-Way ANOVAs Research question: Do mean child malnutrition (GAM) rates differ according to mother’s educational level (none, primary, or secondary/ higher)?

LEARNING PROGRAMME - 23 To calculate one-way ANOVAs in SPSS In SPSS, one-way ANOVAs are run using the following steps:  Click on “Analyze” drop down menu 1.Click on “Compare Means” 2.Click on “One-Way ANOVA…” 3.Move the independent (factor) and dependent variable into proper boxes 4.Click “OK”

LEARNING PROGRAMME - 24 Determining where differences exist In addition to determining that differences exist among the means, you may want to know which means differ. There is one type of test for comparing means:  Post hoc tests are run after the experiment has been conducted (if you don’t have specific hypothesis).

LEARNING PROGRAMME - 25 ANOVA post hoc tests Once you have determined that differences exist among the means, post hoc range tests and pairwise multiple comparisons can determine which means differ. Tukeys post hoc test is the amongst the most popular and are adequate for our purposes…so we will focus on this test…

LEARNING PROGRAMME - 26 To calculate Tukeys test in SPSS In SPSS, Tukeys post hoc tests are run using the following steps: 1.Click on “Analyze” drop down menu 2.Click on “Compare Means” 3.Click on “One-Way ANOVA…” 4.Move the independent and dependent variable into proper boxes 5.Click on “Post Hoc…” 6.Check box beside “Tukey” 7.Click “Continue” 8.Click “OK”

LEARNING PROGRAMME - 27 Tukey’s post hoc test

LEARNING PROGRAMME - 28 Other types of Post Hoc tests There are lots of different post hoc tests, characterized by different adjustment/ setting of the error rate for each test and for multiple comparisons. if interested, please feel free to investigate more and to try different tests – SPSS help might provide you some good hints!

LEARNING PROGRAMME - 29 Now what if we would like to measure how well two variables are associated with one another? Correlations

LEARNING PROGRAMME - 30 Correlations  T-tests and ANOVAs measure differences between means  Correlations explain the strength of the linear relationship between two variables…  Pearson correlation coefficients (r) are the test statistics used to statistically measure correlations

LEARNING PROGRAMME - 31 Types of correlations  Positive correlations: Two variables are positively correlated if increases (or decreases) in one variable results in increases (or decreases) in the other variable.  Negative correlations: Two variables are negatively correlated if one increases (or decreases) and the other decreases (on increases).  No correlations: Two variables are not correlated if there is no linear relationship between them. Strong negative correlation No correlationStrong positive correlation

LEARNING PROGRAMME - 32 Illustrating types of correlations Perfect positive correlation Test statistic= 1 Positive correlation Test statistics>0 and <1 Perfect negative correlation Test statistic= -1 Negative correlation Test statistic -1

LEARNING PROGRAMME - 33 Example for the Kenya Data Correlation between children’s weight and height… Is this a positive or negative correlation?? In what range would the test statistics fall?

LEARNING PROGRAMME - 34 Measuring the strength of a correlation: Pearson’s correlation coefficient Pearson correlation coefficient (r) is the name of the test statistic It is measured using the following formula: Looks complicated and we will rely on spss to calculate them…

LEARNING PROGRAMME - 35 To calculate a Pearson’s correlation coefficient in SPSS In SPSS, correlations are run using the following steps: 1.Click on “Analyze” drop down menu 2.Click on “Correlate” 3.Click on “Bivariate…” 4.Move the variables that you are interested in assessing the correlation between into the box on the right 5.Click “OK ”

LEARNING PROGRAMME - 36 example in SPSS… Using SPSS we get Pearson’s correlation (0.932)

LEARNING PROGRAMME Lets refresh briefly, what does a correlation of mean?? 2.What does *** mean?

LEARNING PROGRAMME - 38 What if we are interested in defining this relationship further by assessing how change in one variable specifically impacts the other variable? Linear regression

LEARNING PROGRAMME - 39 Linear regression  Allows to statistically model the relationship between variables…  allowing us to determine how change in one unit of an independent variable specifically impacts

LEARNING PROGRAMME - 40 Types of linear regression There are two types of linear regression: 1.Simple linear regression 2.Multiple linear regression 3.Simple linear regression compares two variables, assessing how the dependent affects the independent (as discussed) 4.Multiple linear regression is more complicated– this involves assessing the relationship of two variables, while taking account of the impact of other variables. 5.We will focus only on simple linear regression…

LEARNING PROGRAMME - 41 The mechanics of simple linear regression…put simply  Linear regression allows us to linearly model the relationship between two variables (in this case x and y), allowing us to predict how one variable would respond given changes in another  Linear regression actually fits the line that best shows the relationship between x and y and provides the equation for this line  Y = a + b x  Y= dependent variable  a= constant coefficient  b= independent variable coefficient  Using this equation we can predict changes in dependent variables, given changes in the independent variable

LEARNING PROGRAMME - 42 Simple linear regression To illustrate, lets return to the previous example of wealth index and FCS Here, the correlation coefficient (0.932) indicates that increases in wealth index are associated with increases in FCS. Conducting a linear regression would allow us to estimate specifically how FCS increases given increases in units of wealth index

LEARNING PROGRAMME - 43 Simple linear regression Regressing FCS by wealth index gives the following output: Using this output, we can build the regression equation… Y = a + b x Y= FCS a= b= x= wealth index

LEARNING PROGRAMME - 44 Compiling the equation…  FCS= (wealth index)  What if we wanted to predict the FCS of a households in this population who had an wealth index of 0.569?  FCS= (0.569)  FCS=  What would the predicted FCS of a household be if the wealth index is:  2.256?  ?

LEARNING PROGRAMME - 45 To calculate a linear regression in SPSS…  In SPSS, correlations are run using the following steps: 1.Click on “Analyze” drop down menu 2.Click on “Regression” 3.Click on “Linear…” 4.Move the independent and dependent variables into the proper boxes 5.Click “OK”

LEARNING PROGRAMME - 46  Now… practical exercise!