The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.

Slides:



Advertisements
Similar presentations
“Students” t-test.
Advertisements

Hypothesis Testing. To define a statistical Test we 1.Choose a statistic (called the test statistic) 2.Divide the range of possible values for the test.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Is it statistically significant?
Topic 6: Introduction to Hypothesis Testing
Chapter Seventeen HYPOTHESIS TESTING
1 MF-852 Financial Econometrics Lecture 4 Probability Distributions and Intro. to Hypothesis Tests Roy J. Epstein Fall 2003.
PSY 307 – Statistics for the Behavioral Sciences
Elementary hypothesis testing
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
1/55 EF 507 QUANTITATIVE METHODS FOR ECONOMICS AND FINANCE FALL 2008 Chapter 10 Hypothesis Testing.
Topic 2: Statistical Concepts and Market Returns
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 9-1 Introduction to Statistics Chapter 10 Estimation and Hypothesis.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Basic Business Statistics.
A Decision-Making Approach
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 10-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Chapter 11: Inference for Distributions
BCOR 1020 Business Statistics Lecture 18 – March 20, 2008.
15-1 Introduction Most of the hypothesis-testing and confidence interval procedures discussed in previous chapters are based on the assumption that.
Hypothesis Testing.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Chapter 15 Nonparametric Statistics
The Neymann-Pearson Lemma Suppose that the data x 1, …, x n has joint density function f(x 1, …, x n ;  ) where  is either  1 or  2. Let g(x 1, …,
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Experimental Statistics - week 2
Overview of Statistical Hypothesis Testing: The z-Test
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 9-1 Chapter 9 Fundamentals of Hypothesis Testing: One-Sample Tests Business Statistics,
Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
Education 793 Class Notes T-tests 29 October 2003.
NONPARAMETRIC STATISTICS
Two Sample Tests Nutan S. Mishra Department of Mathematics and Statistics University of South Alabama.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
Chapter 7 Hypothesis testing. §7.1 The basic concepts of hypothesis testing  1 An example Example 7.1 We selected 20 newborns randomly from a region.
Copyright © Cengage Learning. All rights reserved. 10 Inferences Involving Two Populations.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Essential Question:  How do scientists use statistical analyses to draw meaningful conclusions from experimental results?
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Experimental Design and Statistics. Scientific Method
Experimental Psychology PSY 433 Appendix B Statistics.
Ch11: Comparing 2 Samples 11.1: INTRO: This chapter deals with analyzing continuous measurements. Later, some experimental design ideas will be introduced.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Nonparametric Statistical Methods. Definition When the data is generated from process (model) that is known except for finite number of unknown parameters.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Inen 460 Lecture 2. Estimation (ch. 6,7) and Hypothesis Testing (ch.8) Two Important Aspects of Statistical Inference Point Estimation – Estimate an unknown.
T tests comparing two means t tests comparing two means.
© Copyright McGraw-Hill 2004
Statistical Inference Making decisions regarding the population base on a sample.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Logistic regression. Recall the simple linear regression model: y =  0 +  1 x +  where we are trying to predict a continuous dependent variable y from.
§2.The hypothesis testing of one normal population.
Nonparametric Statistical Methods. Definition When the data is generated from process (model) that is known except for finite number of unknown parameters.
Lecture 8 Estimation and Hypothesis Testing for Two Population Parameters.
Hypothesis Testing. A statistical Test is defined by 1.Choosing a statistic (called the test statistic) 2.Dividing the range of possible values for the.
1 Underlying population distribution is continuous. No other assumptions. Data need not be quantitative, but may be categorical or rank data. Very quick.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Hypothesis Testing. Steps for Hypothesis Testing Fig Draw Marketing Research Conclusion Formulate H 0 and H 1 Select Appropriate Test Choose Level.
Chapter Nine Hypothesis Testing.
Estimation & Hypothesis Testing for Two Population Parameters
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Hypothesis Testing: Hypotheses
Chapter 9 Hypothesis Testing.
Comparing Populations
What are their purposes? What kinds?
Hypothesis Testing: The Difference Between Two Population Means
The z-test for the Mean of a Normal Population
Presentation transcript:

The paired sample experiment The paired t test

Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable. The two treatments determine two different populations –Popn 1 cases treated with treatment 1. –Popn 2 cases treated with treatment 2 The response variable is assumed to have a normal distribution within each population differing possibly in the mean (and also possibly in the variance)

Two independent sample design A sample of size n cases are selected from population 1 (cases receiving treatment 1) and a second sample of size m cases are selected from population 2 (cases receiving treatment 2). The data –x 1, x 2, x 3, …, x n from population 1. –y 1, y 2, y 3, …, y m from population 2. The test that is used is the t-test for two independent samples

The test statistic (if equal variances are assumed): where

The matched pair experimental design (The paired sample experiment) Prior to assigning the treatments the subjects are grouped into pairs of similar subjects. Suppose that there are n such pairs (Total of 2n = n + n subjects or cases), The two treatments are then randomly assigned to each pair. One member of a pair will receive treatment 1, while the other receives treatment 2. The data collected is as follows: –(x 1, y 1 ), (x 2,y 2 ), (x 3,y 3 ),, …, (x n, y n ). x i = the response for the case in pair i that receives treatment 1. y i = the response for the case in pair i that receives treatment 2.

Let d i = y i - x i. Then d 1, d 2, d 3, …, d n Is a sample from a normal distribution with mean,  d =  2 –  1, and variance standard deviation Note if the x and y measurements are positively correlated (this will be true if the cases in the pair are matched effectively) than  d will be small.

To test H 0 :  1 =  2 is equivalent to testing H 0 :  d = 0. (we have converted the two sample problem into a single sample problem). The test statistic is the single sample t-test on the differences d 1, d 2, d 3, …, d n namely df = n - 1

Example We are interested in comparing the effectiveness of two method for reducing high cholesterol The methods 1.Use of a drug. 2.Control of diet. The 2n = 8 subjects were paired into 4 match pairs. In each matched pair one subject was given the drug treatment, the other subject was given the diet control treatment. Assignment of treatments was random.

The data reduction in cholesterol after 6 month period Pair Treatment1234 Drug treatment Diet control Treatment

Differences Pair Treatment1234 Drug treatment Diet control Treatment didi for df = n – 1 = 3, Hence we accept H 0.

Nonparametric Statistical Methods

Many statistical procedures make assumptions The t test, z test make the assumption that the populations being sampled are normally distributed. (True for both the one sample and the two sample test).

This assumption for large sample sizes is not critical. (Reason: The Central Limit Theorem) The sample mean, the statistic z will have approximately a normal distribution for large sample sizes even if the population is not normal.

For small sample sizes the departure from the assumption of normality could affect the performance of a statistical procedure that assumes normality. For testing, the probability of a type I error may not be the desired value of  = 0.05 or 0.01 For confidence intervals the probability of capturing the parameter may be the desired value (95% or 99%) but a value considerably smaller

Example: Consider the z-test For  = 0.05 we reject the hypothesized value of the mean if z 1.96 Suppose the population is an exponential population with parameter. (  = 1/ and  = 1/ )

Actual population Assumed population

Suppose the population is an exponential population with parameter. (  = 1/ and  = 1/ ) It can be shown that the sampling distribution of is the Gamma distribution with The distribution of is not the normal distribution with Use mgf’s

Sampling distribution of Actual distribution Distribution assuming normality n = 2

Sampling distribution of Actual distribution Distribution assuming normality n = 5

Sampling distribution of Actual distribution Distribution assuming normality n = 20

Definition When the data is generated from process (model) that is known except for finite number of unknown parameters the model is called a parametric model. Otherwise, the model is called a non- parametric model Statistical techniques that assume a non- parametric model are called non-parametric.

The sign test A nonparametric test for the central location of a distribution

We want to test: H 0 : median =  0 H A : median   0 against (or against a one-sided alternative)

The assumption will be only that the distribution of the observations is continuous. Note for symmetric distributions the mean and median are equal if the mean exists. For non-symmetric distribution, the median is probably a more appropriate measure of central location.

The Sign test: S = the number of observations that exceed  0 Comment: If H 0 : median =  0 is true we would expect 50% of the observations to be above  0, and 50% of the observations to be below  0, 1.The test statistic:

50% median =  0 If H 0 is true then S will have a binomial distribution with p = 0.50, n = sample size.

median If H 0 is not true then S will still have a binomial distribution. However p will not be equal to 00 p  0 > median p < 0.50

median 00 p  0 < median p > 0.50 p = the probability that an observation is greater than  0.

n = 10 Summarizing: If H 0 is true then S will have a binomial distribution with p = 0.50, n = sample size.

n = 10 The critical and acceptance region: Choose the critical region so that  is close to 0.05 or e. g. If critical region is {0,1,9,10} then  = =.0216

n = 10 e. g. If critical region is {0,1,2,8,9,10} then  = =.1094

If one can’t determine a fixed confidence region to achieve a fixed significance level , one then randomizes the choice of the critical region In the example with n = 10, if the critical region is {0,1,9,10} then  = =.0216 If the values 2 and 8 are added to the critical region the value of increases to (.0439) = = Note 0.05 = (.0878) Consider the following critical region 1.Reject H 0 if the test statistic is {0,1,9,10} 2.If the test statistic is {2,8} perform a success-failure experiment with p = P[success] = , If the experiment is a success Reject H o. 3.Otherwise we accept H 0.

Example Suppose that we are interested in determining if a new drug is effective in reducing cholesterol. Hence we administer the drug to n = 10 patients with high cholesterol and measure the reduction.

The data Let S = the number of negative reductions = 2

n = 10 If H 0 is true then S will have a binomial distribution with p = 0.50, n = 10. We would expect S to be small if H 0 is false.

Choosing the critical region to be {0, 1, 2} the probability of a type I error would be  = = Since S = 2 lies in this region, the Null hypothesis should be rejected. Conclusion: There is a significant positive reduction (  = ) in cholesterol.

If n is large we can use the Normal approximation to the Binomial. Namely S has a Binomial distribution with p = ½ and n = sample size. Hence for large n, S has approximately a Normal distribution with mean and standard deviation

Hence for large n,use as the test statistic (in place of S) Choose the critical region for z from the Standard Normal distribution. i.e. Reject H 0 if z z  /2 two tailed ( a one tailed test can also be set up.