Hypothesis Testing and Statistical Significance

Hypothesis Testing and Statistical Significance http://xkcd.com/539/

 Estimators and Correlation  Hypothesis Testing and Statistical Significance  labels and other questions 2

3 General definition, continuous and discrete variables: For discrete variables:

4  What is an estimator?  Often a trade-off between bias and variance

5 (Typeset equations courtesy http://en.wikipedia.org/wiki/Variance) Variance defined: Population variance: (have all obs 1…N) Two estimators of population variance:

6 vs. (Typeset equations courtesy http://en.wikipedia.org/wiki/Variance)

 Standard Deviation  Spread of a list  Single variables have SD 8  Standard Error  Spread of a chance process  Sampling Distributions have SE Graphics: Wikipedia

 Remember that a z-score tells us where a score is located within a distribution– specifically, how many standard deviation units the score is above or below the mean. 10

 For example, if we find a particular difference that is x standard errors wide, how confident are we that the difference is not just due to chance?  So… we can use z-scores on a normal curve to interpret how likely a given outcome is (how likely is it due to chance?) 11

 Example, you have a variable x with mean of 500 and S.D. of 15. How common is a score of 525?  Z = 525-500/15 = 1.67  If we look up the z-statistic of 1.67 in a z-score table, we find that the proportion of scores less than our value is.9525.  Or, a score of 525 exceeds.9525 of the population. (p <.05) 12

 z is a test statistic  More generally: z = observed – expected SE  Z tells us how many standard errors an observed value is from its expected value. 13

 A confidence interval is a range of scores above and below the mean.  The interval is in standard errors  It is the interval where we expect our value to be  A confidence coefficient is the likelihood that a given interval has the true value of the parameter  Sample value = true population value + error 14

 One-tailed  Directional Hypothesis  Probability at one end of the curve  Two-tailed  Non-directional Hypothesis  Probability is both ends of the curve 15

 Null Hypothesis:  H 0 : μ 1 = μ c ▪ μ 1 is the intervention population mean ▪ μ c is the control population mean 16 Alternative Hypotheses:  H 1 : μ 1 < μ c  H 1 : μ 1 > μ c

 Null Hypothesis:  H 0 : μ 1 = μ c ▪ μ 1 is the intervention population mean ▪ μ c is the control population mean 17 Alternative Hypothesis:  H 1 : μ 1 ≠ μ c

 Do Berkeley students read more or less than 8 hours a week?  H 0 : μ = 8 The mean for Berkeley students is equal to 8  H 1 : μ ≠ 8 The mean for Berkeley students is not equal to 8 18

 Do Berkeley students read more than 8 hours a week (the average for students across the country)?  H 0 : μ = 8 There is no difference between Berkeley students and other students  H 1 : μ > 8 The mean for Berkeley students is higher than the mean for all students 19

 A p-value is the observed significance level (more on this in a moment)  A test statistic depends on the data, as does p.  This chance assumes that the null hypothesis is correct. Thus, the smaller the chance (p-value), the morel likely that the null can be rejected.  The choice of a test statistic (e.g., z, t, F, Χ 2 ) depends on the model and they hypothesis being considered  The basic process is exactly the same, however. 20

 When p value >.10 → the observed difference is “not significant”  When p value ≤.10 → the observed difference is “marginally significant” or “borderline significant”  When p value ≤.05 → the observed difference is “significant”  When p value ≤.01 → the observed difference is “highly significant” 21

 We cannot hypothesize the null  As odd as it may seem at first, we reject or do not reject the null; a traditional hypothesis test tests against the null.  We never use the word proof with hypothesis testing and statistics, we reject or accept.  Prove has a specific meaning in mathematics and philosophy, but the term is misleading in statistics. 22

 Type I Error: falsely rejecting a null hypothesis (false positive)  Type II Error: Failing to reject the null hypothesis when it is false (false negative) 23

(The auto data) 24

Hypothesis Testing and Statistical Significance

Similar presentations

Presentation on theme: "Hypothesis Testing and Statistical Significance"— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Hypothesis Testing and Statistical Significance

Similar presentations

Presentation on theme: "Hypothesis Testing and Statistical Significance"— Presentation transcript:

Similar presentations

About project

Feedback