Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.

Slides:



Advertisements
Similar presentations
Statistics Review.
Advertisements

1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Comparing One Sample to its Population
Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
Hypothesis Testing A hypothesis is a claim or statement about a property of a population (in our case, about the mean or a proportion of the population)
Inference Sampling distributions Hypothesis testing.
Comparing Two Groups’ Means or Proportions Independent Samples t-tests.
1. Estimation ESTIMATION.
Review: What influences confidence intervals?
Chapter 8 Hypothesis Testing I. Significant Differences  Hypothesis testing is designed to detect significant differences: differences that did not occur.
T-tests Computing a t-test  the t statistic  the t distribution Measures of Effect Size  Confidence Intervals  Cohen’s d.
Business Statistics - QBM117
1 Hypothesis Testing In this section I want to review a few things and then introduce hypothesis testing.
Chapter Sampling Distributions and Hypothesis Testing.
BCOR 1020 Business Statistics Lecture 21 – April 8, 2008.
Inference about a Mean Part II
8-4 Testing a Claim About a Mean
Chapter 9 Hypothesis Testing.
Hypothesis Testing: Two Sample Test for Means and Proportions
Getting Started with Hypothesis Testing The Single Sample.
Probability Population:
Inferential Statistics
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Chapter 5For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 Suppose we wish to know whether children who grow up in homes without access to.
Comparing Two Groups’ Means or Proportions
Statistics 11 Hypothesis Testing Discover the relationships that exist between events/things Accomplished by: Asking questions Getting answers In accord.
Hypothesis Testing:.
Overview Definition Hypothesis
Copyright © 2012 by Nelson Education Limited. Chapter 8 Hypothesis Testing II: The Two-Sample Case 8-1.
Chapter 8 Hypothesis testing 1. ▪Along with estimation, hypothesis testing is one of the major fields of statistical inference ▪In estimation, we: –don’t.
Statistical Analysis Statistical Analysis
Section 10.1 ~ t Distribution for Inferences about a Mean Introduction to Probability and Statistics Ms. Young.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
Estimation Statistics with Confidence. Estimation Before we collect our sample, we know:  -3z -2z -1z 0z 1z 2z 3z Repeated sampling sample means would.
A Sampling Distribution
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
Chapter 9 Hypothesis Testing II: two samples Test of significance for sample means (large samples) The difference between “statistical significance” and.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
One Sample Inf-1 If sample came from a normal distribution, t has a t-distribution with n-1 degrees of freedom. 1)Symmetric about 0. 2)Looks like a standard.
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 10. Hypothesis Testing II: Single-Sample Hypothesis Tests: Establishing the Representativeness.
Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.
1 Psych 5500/6500 The t Test for a Single Group Mean (Part 1): Two-tail Tests & Confidence Intervals Fall, 2008.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
5.1 Chapter 5 Inference in the Simple Regression Model In this chapter we study how to construct confidence intervals and how to conduct hypothesis tests.
Copyright © 2010, 2007, 2004 Pearson Education, Inc Section 8-2 Basics of Hypothesis Testing.
Introduction to the Practice of Statistics Fifth Edition Chapter 6: Introduction to Inference Copyright © 2005 by W. H. Freeman and Company David S. Moore.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
KNR 445 Statistics t-tests Slide 1 Introduction to Hypothesis Testing The z-test.
Chapter 12 Confidence Intervals and Hypothesis Tests for Means © 2010 Pearson Education 1.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
The Single-Sample t Test Chapter 9. t distributions >Sometimes, we do not have the population standard deviation. (that’s actually really common). >So.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
Math 4030 – 9a Introduction to Hypothesis Testing
Welcome to MM570 Psychological Statistics
Chapter Eight: Using Statistics to Answer Questions.
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Hypothesis Testing and Statistical Significance
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Copyright © 2009 Pearson Education, Inc. 9.2 Hypothesis Tests for Population Means LEARNING GOAL Understand and interpret one- and two-tailed hypothesis.
Copyright © 2009 Pearson Education, Inc t LEARNING GOAL Understand when it is appropriate to use the Student t distribution rather than the normal.
Chapter 9 Introduction to the t Statistic
9.3 Hypothesis Tests for Population Proportions
Hypothesis Testing: One Sample Cases
Review: What influences confidence intervals?
Presentation transcript:

Significance Tests …and their significance

Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from the US. Record the mean self-esteem. If the mean should be 25, you might get this. Self-esteem

Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from the US. Record the mean self-esteem. If the mean should be 25, you might get this. Self-esteem

Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from the US. Record the mean self-esteem. If the mean should be 25, you might get this. Self-esteem

Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from the US. Record the mean self-esteem. If the mean should be 25, you might get this. Self-esteem

Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from the US. Record the mean self-esteem. If the mean should be 25, you might get this. Self-esteem

Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from the US. Record the mean self-esteem. If the mean should be 25, you might get this. Self-esteem

Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from the US. Record the mean self-esteem. If the mean should be 25, you might get this. Self-esteem

Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from the US. Record the mean self-esteem. If the mean should be 25, you might get this. Self-esteem

Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from the US. Record the mean self-esteem. If the mean should be 25, you might get this. Self-esteem

Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from the US. Record the mean self-esteem. If the mean should be 25, you might get this. The sample means would stack up in a normal curve. A normal sampling distribution. z Self-esteem

Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from the US. Record the mean self-esteem. If the mean should be 25, you might get this. The sample means would stack up in a normal curve. A normal sampling distribution. z Self-esteem %

Significance Tests The sample size affects the sampling distribution: Standard error = population standard deviation / square root of sample size  Y-bar = /n s.e. = pop. Sd./ n But in fact we use our sample’s standard deviation as an estimate of the population’s.

Σ(Y – Y-bar) 2 n - 1

Significance Tests And if we increase our sample size (n)… Our repeated sample means will be closer to the true mean: z % Z

Significance Tests Means will be closer to the true mean, and our standard error of the sampling distribution is smaller: z % Z

Significance Tests The range of particular middle percentages gets smaller: z Z % Range Self-esteem

Significance Tests  68% % 68% 1.96z 1z 3z 99.99% We use that measuring stick to say two things: 1.If my sample is in the middle specified percent, the population’s mean is within this range. (Confidence Interval) 2.Besides construct a confidence interval, we can also do a significance test.

Significance Tests  68% % 68% 1.96z 1z 3z 99.99% We use that measuring stick to say two things: 1.If my sample is in the middle specified percent, the population’s mean is within this range. (Confidence Interval) 2.If the population mean is the same as a guess of mine, then my sample’s mean would have to fall within this range to have been drawn from the middle specified percent. (Significance Test)

Significance Tests  68% % 1z We know that if you have your sampling distribution centered on the population mean: 16% of samples’ means would be larger than  + 1z and 16% would be smaller than  - 1z, for a total of 32% outside that range.

Significance Tests  % 1.96z We know that if you have your sampling distribution centered on the population mean: 2.5% of samples’ means would be larger than  z and 2.5% would be smaller than  z, for a total of 5% outside that range.

Significance Tests  z 99.99% We know that if you have your sampling distribution centered on the population mean: 0.005% of samples’ means would be larger than  + 3z and 0.005% would be smaller than  - 3z, for a total of 0.01% outside that range.

Significance Tests But you remember that we don’t normally know the actual mean for the population. But what if we guessed? What if we specified a value that might be the population mean?

Significance Tests  If we guessed a mean… If our guess is correct, our sample’s mean should be among the common samples that would have been drawn from a population with that guessed mean. If it is not, it is likely that the sample did not come from such a population. guess What if my sample’s mean were here?

Significance Tests  One way to tell whether our sample’s mean was generated by such a population is to place our sampling distribution over the guessed mean to see if the sample mean is among the middle 99% or 95% of samples that would be generated by such a mean. guess What if my sample’s mean were here? It is among the rare 5% of possible means. 95% 1.96z

Significance Tests  Essentially, a significance test for a mean tells you what the odds are that your sample mean could have come from a population whose mean equals your guess. guess What if my sample’s mean were here? It is among the rare 5% of possible means. 95% 1.96z

Significance Tests  Essentially, a significance test for a mean tells you what the odds are that your sample mean could have come a population with a particular mean. guess 95% Sample mean What you do is figure out what your sample’s z- score is relative to your guessed mean. If z is larger than 1.96 or smaller than -1.96, you have less than a 5% chance than your sample came from such a “guess population” —reject the guess!

Significance Tests For example:If our guess was that self-doubt scores in the population averaged 20 on a scale from 1 – 50, we’d place a guess as below. Self-doubt

Significance Tests We guess 20, but our sample of size 100 has a mean of 25 and a standard deviation of 10. Guess,  Sample, Y-bar Self-doubt

Significance Tests Let’s build a sampling distribution around our guess, 20: sample of size 100; s.d. = 10. s.e. = 10/100 = 10/10 = 1 Self-doubt Z: Sample, Y-bar

Significance Tests Our sample appears to be larger than a critical value of 1.96 (outer 5% of samples) or even 2.58 (outer 1% of samples). s.e. = 10/100 = 10/10 = 1 Self-doubt Z: Sample, Y-bar

Significance Tests How many z’s is our sample mean away from our guess? Z = Y-bar –  / s.e. Z = 25 – 20 / 1 z = 5 s.e. = 10/100 = 10/10 = 1 Self-doubt Z: Sample, Y-bar

Significance Tests Indeed, our sample z-score is 5, well above 1.96 or Reject the guess! Looking in Appendix B… Our sample has a % chance of having come from a population whose mean is 20! s.e. = 10/100 = 10/10 = 1 Self-doubt Z: Sample, Y-bar

Significance Tests Conducting a Test of Significance for the Mean By slapping the sampling distribution for the mean over a guess of the mean, H o, we can find out whether our sample could have been drawn from a population where the mean is equal to our guess. 1.Decide -level ( =.05) and nature of test (two-tailed vs. one-tailed) 2.Set critical z (z = +/- 1.96) or t 3.Make guess or null hypothesis, H o :  = 0 H a :   0 4.Collect and analyze data 5.Calculate Z or t: z = Y-bar -  s.e. 6.Make a decision about the null hypothesis (reject or fail to reject) 7.Find the P-value

Significance Tests Guess, µ o Sampling distribution of sample means, s.e. calculated by s/√n 1. Decide -level ( =.05) and nature of test (two-tailed vs. one-tailed). -level refers to how unlikely a sample’s mean would have to be before you’d reject your guess. The scientific standard is typically.05 probability, or a 5% chance that your sample came from a population whose mean is what you guessed. If your sample’s mean has less than 5% chance of having come from a population with your guess, you’d reject the guess (the null hypothesis). -level could be set at.10,.01, etc. What if my sample mean were here?

Significance Tests Guess, µ o Sampling distribution of sample means, s.e. calculated by s/√n 1. Decide -level ( =.05) and nature of test (two-tailed vs. one-tailed). One- or two-tailed test refers to the rejection region in your sampling distribution. If your -level were.05, in a two-tailed test your rejection region would be the outer 2.5% of each tail. A two-tailed test implies a directionless null hypothesis such as µ o = 0. What if my sample mean were here? 2.5% of sampling distribution.

Significance Tests Guess, µ o Sampling distribution of sample means, s.e. calculated by s/√n 1. Decide -level ( =.05) and nature of test (two-tailed vs. one-tailed). One- or two-tailed test refers to the rejection region in your sampling distribution. If your -level were.05, in a one-tailed test your rejection region would be the outer 5% of one of the tails. A one-tailed test implies a directional null hypothesis such as µ o ≤ 0 or µ o ≥ 0. The idea: If I have good reason to think a parameter would be above a particular value, then I only need to set the guess at that value or less (µ o ≤ 0) and look to see if the sample statistic is in the rare 5% of possible samples above the null. If it is in the extreme low end, I won’t reject the null! What if my sample mean were here or there? 5% of sampling distribution.

Significance Tests Guess, µ o Sampling distribution of sample means, s.e. calculated by s/√n 2. Set critical z (z = +/- 1.96) or t -level refers to how unlikely a sample’s mean would have to be before you’d reject your guess. There is a z- or t-score that corresponds with that  proportion of the area in the tails of the curve (area in the tails of the sampling distribution). For example, ?? in the right tail corresponds with z = ?? What if my sample mean were here?

Significance Tests We use t instead of z to be more accurate: t curves are symmetric and bell-shaped like the normal distribution. However, the spread is more than that of the standard normal distribution—the tails are fatter. Tea Tests? df = 1, 2, 3, and so on, approaching normal as df exceeds 120.

Significance Tests The reason for using t is due to the fact that we use sample standard deviation (s) rather than population standard deviation (σ) to calculate standard error. Since s, standard deviations, will vary from sample to sample, the variability in the sampling distribution ought to be greater than in the normal curve. t has a larger spread, more accurately reflecting the likelihood of extreme samples, especially when sample size is small. The larger the degrees of freedom (n – 1), the closer the t curve is to the normal curve. This reflects the fact that the standard deviation s approaches σ for large sample size n. Even though z-scores based on the normal curve will work for larger samples (n > 120) SPSS uses t for all tests because it works for small samples and large samples alike. Tea Tests?

Significance Tests Guess, µ o Sampling distribution of sample means, s.e. calculated by s/√n 3. Make guess or null hypothesis: Ho:  = 0 Ha:   0 The guess refers to the value that you will feel comfortable with declaring is true for the population unless your sample evidence suggests otherwise. In science, we wouldn’t want to assert something based on a sample unless we had extremely good evidence. The null is a default assumption, such as saying previous research says the mean is . In more advanced statistics, we will typically use null hypotheses that declare “no difference between groups” or “no relationship between variables.” The alternative is typically consistent with your research hypothesis or expectations. What if my sample mean were here?

Significance Tests Guess, µ o Sampling distribution of sample means, s.e. calculated by s/√n 3. Make guess or null hypothesis: Ho:  = 0 (or some other value) Ha:   0 The hypotheses above refer to a two-tailed test. Hypotheses for one-tailed tests would be like this: Ho:  ≤ 0 (or some other value) Ha:  > 0 Ho:  ≥ 0 Ha:  < 0 What if my sample mean were here?

Significance Tests Guess, µ o Sampling distribution of sample means, s.e. calculated by s/√n 4. Collect and analyze data. Once you’ve established your assumptions and what you are testing for, you can get into data analysis. Note that this ordering of steps helps prevent you from “peaking into” the data to establish your assumptions and tests. Basing tests on sample information sets up predetermined outcomes—bad! If calculating inferential statistics by hand, you would need to find your mean and standard deviation for each variable. What if my sample mean were here?

Significance Tests Guess, µ o Sampling distribution of sample means, s.e. calculated by s/√n 5. Calculate Z or t: z or t = Y-bar -  s.e. s.e. for z = σ/√n s.e. for t = s/√n Calculating the test statistic will tell you how many standard errors away from the null hypothesis your sample statistic is. Corresponding with the z or t value is an area under the curve that tells what proportion or percentage of sample means would have been that far away if your null hypothesis were correct. What if my sample mean were here?

Significance Tests Guess, µ o Sampling distribution of sample means, s.e. calculated by s/√n 6. Make a decision about the null hypothesis. Is your sample statistic more standard errors away from your guess or null hypothesized value than your critical z or t? If it is farther out: It meets the criteria for implausibly rare that you established from the outset. You would reject the null, saying it is unlikely your sample could have come from a population where that null value is true If it is not more extreme than your critical z or t: It is not an unlikely occurrence as established from the outset. You would fail to reject the null, saying that your guess is likely true and your sample has a good chance of having come from a population with that null value. What if my sample mean were here? - Z? -

Significance Tests Guess, µ o Sampling distribution of sample means, s.e. calculated by s/√n 7. Find the p-value. The p-value will tell you the actual likelihood that you’d get a sample with your statistic that is as far away from the null value or the guess if your null or guess were true for your population. To find p, look in a z or t table to find the proportion of the area in the tails of the curve that corresponds with the z or t that you calculated for your sample statistic. Remember to be sure you keep track of whether you are doing a two-tailed (p * 2) or one-tailed (p) test. What if my sample mean were here? p?

Significance Tests Guess,.50 Sampling distribution of sample means, s.e. calculated by s/√n Another Example of a Significance Test of the mean or proportion. An administrator read that “snob universities” have over 50% of student GPAs over 3.5. He wants to determine whether SJSU is a “snob university.” He decides: 1.To use an alpha-level of.05 with a one-tailed test. 2.That the critical z or t will be Thinking SJSU is a “snob university” he sets his null as: Ho: Π ≤.5; Ha: Π >.5 5%

Significance Tests Guess,.50 Our sample. Sampling distribution of sample means, s.e. calculated by s/√n Another Example of a Significance Test of the mean or proportion. An administrator read that “snob universities” have over 50% of student GPAs over 3.5. He wants to determine whether SJSU is a “best university.” He decides: 1.To use an alpha-level of.05 with a one-tailed test. 2.That the critical z or t will be Thinking SJSU is a “snob university” he sets his null as: Ho: Π ≤.5; Ha: Π >.5 4.He collects data from 500 randomly selected SJSU students and finds that.40 have GPAs above He calculates z s.e.= √(p)(1-p)/N z = p – Π o / s.e. s.e. = √.4(.6) = √.24 = z =.4 -.5/.022 = %

Significance Tests Guess,.50 Our sample. Sampling distribution of sample means, s.e. calculated by s/√n Another Example of a Significance Test of the mean or proportion. An administrator read that “snob universities” have over 50% of student GPAs over 3.5. He wants to determine whether SJSU is a “snob university.” He decides: 5.He calculates z s.e.= √(p)(1-p)/N z = p – p o / s.e. s.e. = √.4(.6) = √.24 = z =.4 -.5/.022 = Making a decision about the null is easy. He sees that his sample proportion is lower than the null of.5 and within the null of less than.5. He fails to reject the null. 7.In finding the p-value, he sees that if the population value were.5, he’d have less than.ooo1 chance of getting a GPA that low. He has good evidence that SJSU is not a “snob univeristy.” 5%

Significance Tests One final note: –The tests we typically use in sociology have assumptions of large sample sizes. –When conducting tests with small sample sizes, some restrictions apply: When working with means, we typically have to assume the population values are normally distributed. When working with proportions, we must use a binomial probability distribution.