Chapter 6-7-8 Sampling Distributions and Hypothesis Testing.

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

Chapter 16 Inferential Statistics
1 COMM 301: Empirical Research in Communication Lecture 15 – Hypothesis Testing Kwan M Lee.
Inference Sampling distributions Hypothesis testing.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
1. Estimation ESTIMATION.
Review: What influences confidence intervals?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Cal State Northridge  320 Ainsworth Sampling Distributions and Hypothesis Testing.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
PROBABILITY AND SAMPLES: THE DISTRIBUTION OF SAMPLE MEANS.
IENG 486 Statistical Quality & Process Control
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 5 Chicago School of Professional Psychology.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Probability Population:
Chapter 11: Random Sampling and Sampling Distributions
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Inferential Statistics
Probability and the Sampling Distribution Quantitative Methods in HPELS 440:210.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Hypothesis Testing:.
Overview of Statistical Hypothesis Testing: The z-Test
Overview Definition Hypothesis
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 9. Hypothesis Testing I: The Six Steps of Statistical Inference.
Descriptive statistics Inferential statistics
Introduction to Hypothesis Testing for μ Research Problem: Infant Touch Intervention Designed to increase child growth/weight Weight at age 2: Known population:
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 8 Introduction to Hypothesis Testing
Significance Tests …and their significance. Significance Tests Remember how a sampling distribution of means is created? Take a sample of size 500 from.
Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.
1 Today Null and alternative hypotheses 1- and 2-tailed tests Regions of rejection Sampling distributions The Central Limit Theorem Standard errors z-tests.
EDUC 200C Friday, October 26, Goals for today Homework Midterm exam Null Hypothesis Sampling distributions Hypothesis testing Mid-quarter evaluations.
1 Statistical Inference Greg C Elvers. 2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
Chapter 8 Introduction to Hypothesis Testing
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
Making decisions about distributions: Introduction to the Null Hypothesis 47:269: Research Methods I Dr. Leonard April 14, 2010.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Chapter 20 Testing hypotheses about proportions
Hypothesis Testing A procedure for determining which of two (or more) mutually exclusive statements is more likely true We classify hypothesis tests in.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
1 Chapter 10: Introduction to Inference. 2 Inference Inference is the statistical process by which we use information collected from a sample to infer.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
Chapter 9 Probability. 2 More Statistical Notation  Chance is expressed as a percentage  Probability is expressed as a decimal  The symbol for probability.
Chapter 7 Probability and Samples: The Distribution of Sample Means.
Lecture 2 Review Probabilities Probability Distributions Normal probability distributions Sampling distributions and estimation.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
1 Chapter 8 Introduction to Hypothesis Testing. 2 Name of the game… Hypothesis testing Statistical method that uses sample data to evaluate a hypothesis.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
PSY 307 – Statistics for the Behavioral Sciences Chapter 9 – Sampling Distribution of the Mean.
Education 793 Class Notes Decisions, Error and Power Presentation 8.
Chapter 10: Introduction to Statistical Inference.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Review I A student researcher obtains a random sample of UMD students and finds that 55% report using an illegally obtained stimulant to study in the past.
Hypothesis Testing Introduction to Statistics Chapter 8 Feb 24-26, 2009 Classes #12-13.
Education 793 Class Notes Inference and Hypothesis Testing Using the Normal Distribution 8 October 2003.
Distributions of Sample Means. z-scores for Samples  What do I mean by a “z-score” for a sample? This score would describe how a specific sample is.
Review: Stages in Research Process Formulate Problem Determine Research Design Determine Data Collection Method Design Data Collection Forms Design Sample.
Chapter 8: Introduction to Hypothesis Testing. Hypothesis Testing A hypothesis test is a statistical method that uses sample data to evaluate a hypothesis.
Uncertainty and confidence Although the sample mean,, is a unique number for any particular sample, if you pick a different sample you will probably get.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
PCB 3043L - General Ecology Data Analysis Organizing an ecological study What is the aim of the study? What is the main question being asked? What are.
Chapter 9 Introduction to the t Statistic
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Test Review: Ch. 7-9
Hypothesis Testing.
Presentation transcript:

Chapter Sampling Distributions and Hypothesis Testing

 When we have a frequency distribution, or histogram, we can determine probabilities. Look at the M&M example.  What is one of the most common shapes of frequency distributions??

The normal distribution.  Again, all normal distributions are characterized by the mean and the standard deviation. There are an infinite number of normal distributions.  But some are very special to us, like the Standardized Normal Distribution. –ALL normal distributions can be standardized. –All scores are put in terms of Standard Deviation units from the mean. –SO, we know proportions, and hence, probabilities associated with scores that fall in a normal distribution. We just did that in Chapter 5.

100% of our observations appear in the normal distribution.  Proportions and probabilities are the same.  What proportion of scores fall above a z- score of 1?  What is the probability that a randomly chosen z-score will be 1 or higher?  What is the probability that a randomly chosen z-score will fall between 0 and.5?  There is a.05 probability (or a 5% chance) of a z-score being this high or higher?

More  We can also look at specific scores (X), convert them into z-score, and find the probability of getting a score that high or higher, lower than that score, and so on. –Given sigma = 100 and the mean = 500, what is the probability of getting a 600 or higher? –1) Convert to z; ( )/100 = 1. –2) What proportion of the distribution falls at or above a z-score of 1?

The past  What we have been doing is descriptive statistics.  We have come up with distributions, measures of central tendency and measures of variability, all of which describe a population or a sample.  We can use these, as we have found out, to find the probability of a score, or range of scores, etc.  But statistics, z-scores, probabilities, etc., can be used for more interesting purposes.

The future  Inferential statistics – Estimate population parameters from a sample, or determine if two samples are different –Hypothesis testing – Is the population parameter equal to some specific value? –Ex. This class (random sample) takes a study skills course: Seating, classroom tips, study habits –G. P. A. – Is the G.P.A. of this class now different than MSU students generally (population)?

Well, let’s think about this.  Of course, if we were to randomly sample 50 MSU students and get their mean GPA, it would be a little different than the actual population mean GPA.  There will always be a little error, the sample mean will probably not equal the population mean until all of the members in the population are in our sample.  The quantification of this discrepancy is called Sampling Error –  The discrepancy, or amount or error, between a sample statistic and its corresponding parameter.

Well, let’s think about this.  Also, we can take numerous samples. For example, the next day I can get the GPAs of 40 different students. The mean GPA for this sample will also be a little different than the true population mean. ALSO, this second sample will have a mean that is slightly different from our first sample mean. –In fact, we could take a huge number of samples, and get a huge number of sample means.  So, how do we use a given sample to estimate the population if every sample will be a little different?

Sampling Distribution  To answer this we have to create a sampling Distribution of a statistic (mean, median)  In particular, we will use a Sampling Distribution of Sample Means = –This is the collection of sample means for all the possible random samples of a particular size (n) that could be obtained from a population.  OR –The distribution of a statistic (the mean) over repeated sampling from a specified population.  Sampling distribution of sample means : (Most common), G.P.A.: Say MSU population mean is 2.74,  distribution of means of an infinity of random samples.

 We have been looking at distributions of SCORES, now we are doing to look at distributions of all possible SAMPLE MEANS.  We are dealing with particular type of sampling distribution = a distribution of statistics (e.g., mean) obtained by selecting all the possible samples of a specific size from a population

DRAW SAMPLING DISTRIBUTION OF MEANS: N = 50  Distribution of means if we sample 50 students and assume the population mean is 2.74:  Sample 1: 2.77  Sample 2: 2.91  Sample 3: 2.55  Sample 4: 3.77

 NOTE: This is similar to what we were doing with z scores. We were looking at where a z score falls in a distribution of scores. Now we are looking at where a sample statistic (in this case the mean) falls among a distribution of samples.  If close to the middle of the distribution we retain null hypothesis (no difference)  If far from the middle – sample unlikely, reject hypothesis.

 Sampling Error: Variability of a statistic from sample to sample. Due to chance.  Standard Error: The standard deviation of a sampling distribution from the population. (sigma/ sqrt n)

 As usual, n = sample size, which should be taken into account when calculating standard deviations.  Obviously, the larger the sample, the closer the sample means will be to the population mean (i.e., less error). So, we have to take sample size into account.  Law of large numbers = the larger the sample size, the more probable it is that the sample mean will be close to the population mean.

 When n = 1, se = sd  As n increases, the standard error should decrease. The equation takes this into account.  There is this great mathematical Theorem that allows us to know the general properties of our sampling distribution as our samples (and population) get larger and larger.

Central Limit Theorem:  Central Limit Theorem:  From the book: For any population with a mean (mu) and a standard deviation (sigma), the distribution of sample means for sample size n will have a mean or mu and a standard deviation of sigma/sqrt n and will approach a normal distribution as n approaches infinity. –So what is this saying?  As N increases, sample means and standard deviations approach those of the population. –With a sample size of 30+, the distribution of sample means is practically normal. –So, we have a clue about the mean of the sampling distribution, the standard deviation, and its shape (normal). What can we do with this information???

So what is this saying?  As N increases, sample means and standard deviations approach those of the population.  With a sample size of 30+, the distribution of sample means is practically normal.  So, we have a clue about the mean of the sampling distribution, the standard deviation, and its shape (normal). What can we do with this information???  This allows us to know the distribution of sample means for any population, regardless of the mean and SD, and even if the population distribution is not normal.

Back to our example:  MSU Mean: 2.53  Class Mean: 3.02  There may be no relationship between this class (the intervention) and G.P.A.

Goal:  Determine whether this difference is due to chance (sampling error)  Can determine with probabilities how likely/unlikely it is that this difference is due to chance.  If this class is different, then we can classify it as a different population with different population parameters (higher mean)  A statistical test will answer this question for us:

HYPOTHESIS TESTING!  A hypothesis test = a statistical procedure that uses sample data to evaluate hypotheses about a population parameter.  General steps. –1) generate a hypothesis about the population mean. –2) So, we hypothesize that our sample mean will be close to this guess regarding the population mean. –3) Obtain a sample and sample mean –4) Compare the sample and population means.

1) Set up Null Hypothesis:  The null hypothesis always says the opposite of that in which we are interested: –We can never prove something is true; We can only prove that it is false  In other words: –There is no difference between our groups or: –If we are only interested in whether our group is better:  Null Hypothesis would say our group is equal to or worse than other. –We are usually working to reject the null hypothesis –Note:Assuming the null is true, we create our sampling distribution. In this case the sampling distribution of means. –M class = 2.53

2. Set up the “Alternative hypothesis” (What we want to find)  M class ne 2.53  Doing this before we collect our data. Mean could be higher or lower. Maybe our class hurts people G.P.A.

3. Set a criterion level for our Decision:  How far away does the mean have to be for us to reasonably doubt that this sample came from the same population?  When are we going to say this sample is the same as the population (just sampling error) or when we are going to say this sample is different from the population.

3. Set a criterion level for our Decision:  When are we going to say this sample is the same as the population (just sampling error) or when we are going to say this sample is different from the population.  Significance level – Predetermined probability that represents a sample result that is so rare or unusual that is cast doubt on the accuracy of Ho: alpha –The probability with which we are willing to reject Ho when it is correct. –Rejection region: the set of outcomes from an experiment that will lead to a rejection of Ho.  Typically: –Choose : alpha = 5%