Presentation is loading. Please wait.

Presentation is loading. Please wait.

Hypothesis Testing and Statistical Significance

Similar presentations


Presentation on theme: "Hypothesis Testing and Statistical Significance"— Presentation transcript:

1 Hypothesis Testing and Statistical Significance http://xkcd.com/539/

2  Estimators and Correlation  Hypothesis Testing and Statistical Significance  labels and other questions 2

3 3 General definition, continuous and discrete variables: For discrete variables:

4 4  What is an estimator?  Often a trade-off between bias and variance

5 5 (Typeset equations courtesy http://en.wikipedia.org/wiki/Variance) Variance defined: Population variance: (have all obs 1…N) Two estimators of population variance:

6 6 vs. (Typeset equations courtesy http://en.wikipedia.org/wiki/Variance)

7 7

8  Standard Deviation  Spread of a list  Single variables have SD 8  Standard Error  Spread of a chance process  Sampling Distributions have SE Graphics: Wikipedia

9 9

10  Remember that a z-score tells us where a score is located within a distribution– specifically, how many standard deviation units the score is above or below the mean. 10

11  For example, if we find a particular difference that is x standard errors wide, how confident are we that the difference is not just due to chance?  So… we can use z-scores on a normal curve to interpret how likely a given outcome is (how likely is it due to chance?) 11

12  Example, you have a variable x with mean of 500 and S.D. of 15. How common is a score of 525?  Z = 525-500/15 = 1.67  If we look up the z-statistic of 1.67 in a z-score table, we find that the proportion of scores less than our value is.9525.  Or, a score of 525 exceeds.9525 of the population. (p <.05) 12

13  z is a test statistic  More generally: z = observed – expected SE  Z tells us how many standard errors an observed value is from its expected value. 13

14  A confidence interval is a range of scores above and below the mean.  The interval is in standard errors  It is the interval where we expect our value to be  A confidence coefficient is the likelihood that a given interval has the true value of the parameter  Sample value = true population value + error 14

15  One-tailed  Directional Hypothesis  Probability at one end of the curve  Two-tailed  Non-directional Hypothesis  Probability is both ends of the curve 15

16  Null Hypothesis:  H 0 : μ 1 = μ c ▪ μ 1 is the intervention population mean ▪ μ c is the control population mean 16 Alternative Hypotheses:  H 1 : μ 1 < μ c  H 1 : μ 1 > μ c

17  Null Hypothesis:  H 0 : μ 1 = μ c ▪ μ 1 is the intervention population mean ▪ μ c is the control population mean 17 Alternative Hypothesis:  H 1 : μ 1 ≠ μ c

18  Do Berkeley students read more or less than 8 hours a week?  H 0 : μ = 8 The mean for Berkeley students is equal to 8  H 1 : μ ≠ 8 The mean for Berkeley students is not equal to 8 18

19  Do Berkeley students read more than 8 hours a week (the average for students across the country)?  H 0 : μ = 8 There is no difference between Berkeley students and other students  H 1 : μ > 8 The mean for Berkeley students is higher than the mean for all students 19

20  A p-value is the observed significance level (more on this in a moment)  A test statistic depends on the data, as does p.  This chance assumes that the null hypothesis is correct. Thus, the smaller the chance (p-value), the morel likely that the null can be rejected.  The choice of a test statistic (e.g., z, t, F, Χ 2 ) depends on the model and they hypothesis being considered  The basic process is exactly the same, however. 20

21  When p value >.10 → the observed difference is “not significant”  When p value ≤.10 → the observed difference is “marginally significant” or “borderline significant”  When p value ≤.05 → the observed difference is “significant”  When p value ≤.01 → the observed difference is “highly significant” 21

22  We cannot hypothesize the null  As odd as it may seem at first, we reject or do not reject the null; a traditional hypothesis test tests against the null.  We never use the word proof with hypothesis testing and statistics, we reject or accept.  Prove has a specific meaning in mathematics and philosophy, but the term is misleading in statistics. 22

23  Type I Error: falsely rejecting a null hypothesis (false positive)  Type II Error: Failing to reject the null hypothesis when it is false (false negative) 23

24 (The auto data) 24


Download ppt "Hypothesis Testing and Statistical Significance"

Similar presentations


Ads by Google