Download presentation
Presentation is loading. Please wait.
Published byMaximillian Baker Modified over 8 years ago
1
Hypothesis Testing and Statistical Significance http://xkcd.com/539/
2
Estimators and Correlation Hypothesis Testing and Statistical Significance labels and other questions 2
3
3 General definition, continuous and discrete variables: For discrete variables:
4
4 What is an estimator? Often a trade-off between bias and variance
5
5 (Typeset equations courtesy http://en.wikipedia.org/wiki/Variance) Variance defined: Population variance: (have all obs 1…N) Two estimators of population variance:
6
6 vs. (Typeset equations courtesy http://en.wikipedia.org/wiki/Variance)
7
7
8
Standard Deviation Spread of a list Single variables have SD 8 Standard Error Spread of a chance process Sampling Distributions have SE Graphics: Wikipedia
9
9
10
Remember that a z-score tells us where a score is located within a distribution– specifically, how many standard deviation units the score is above or below the mean. 10
11
For example, if we find a particular difference that is x standard errors wide, how confident are we that the difference is not just due to chance? So… we can use z-scores on a normal curve to interpret how likely a given outcome is (how likely is it due to chance?) 11
12
Example, you have a variable x with mean of 500 and S.D. of 15. How common is a score of 525? Z = 525-500/15 = 1.67 If we look up the z-statistic of 1.67 in a z-score table, we find that the proportion of scores less than our value is.9525. Or, a score of 525 exceeds.9525 of the population. (p <.05) 12
13
z is a test statistic More generally: z = observed – expected SE Z tells us how many standard errors an observed value is from its expected value. 13
14
A confidence interval is a range of scores above and below the mean. The interval is in standard errors It is the interval where we expect our value to be A confidence coefficient is the likelihood that a given interval has the true value of the parameter Sample value = true population value + error 14
15
One-tailed Directional Hypothesis Probability at one end of the curve Two-tailed Non-directional Hypothesis Probability is both ends of the curve 15
16
Null Hypothesis: H 0 : μ 1 = μ c ▪ μ 1 is the intervention population mean ▪ μ c is the control population mean 16 Alternative Hypotheses: H 1 : μ 1 < μ c H 1 : μ 1 > μ c
17
Null Hypothesis: H 0 : μ 1 = μ c ▪ μ 1 is the intervention population mean ▪ μ c is the control population mean 17 Alternative Hypothesis: H 1 : μ 1 ≠ μ c
18
Do Berkeley students read more or less than 8 hours a week? H 0 : μ = 8 The mean for Berkeley students is equal to 8 H 1 : μ ≠ 8 The mean for Berkeley students is not equal to 8 18
19
Do Berkeley students read more than 8 hours a week (the average for students across the country)? H 0 : μ = 8 There is no difference between Berkeley students and other students H 1 : μ > 8 The mean for Berkeley students is higher than the mean for all students 19
20
A p-value is the observed significance level (more on this in a moment) A test statistic depends on the data, as does p. This chance assumes that the null hypothesis is correct. Thus, the smaller the chance (p-value), the morel likely that the null can be rejected. The choice of a test statistic (e.g., z, t, F, Χ 2 ) depends on the model and they hypothesis being considered The basic process is exactly the same, however. 20
21
When p value >.10 → the observed difference is “not significant” When p value ≤.10 → the observed difference is “marginally significant” or “borderline significant” When p value ≤.05 → the observed difference is “significant” When p value ≤.01 → the observed difference is “highly significant” 21
22
We cannot hypothesize the null As odd as it may seem at first, we reject or do not reject the null; a traditional hypothesis test tests against the null. We never use the word proof with hypothesis testing and statistics, we reject or accept. Prove has a specific meaning in mathematics and philosophy, but the term is misleading in statistics. 22
23
Type I Error: falsely rejecting a null hypothesis (false positive) Type II Error: Failing to reject the null hypothesis when it is false (false negative) 23
24
(The auto data) 24
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.