ISMT253a Tutorial 1 By Kris PAN 2008-02-11. Skewness:  a measure of the asymmetry of the probability distribution of a real-valued random variable 

Slides:



Advertisements
Similar presentations
Hypothesis Testing. To define a statistical Test we 1.Choose a statistic (called the test statistic) 2.Divide the range of possible values for the test.
Advertisements

Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Means Chapter 13.
Significance Testing Chapter 13 Victor Katch Kinesiology.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. More About Significance Tests Chapter 13.
Comparing Two Population Means The Two-Sample T-Test and T-Interval.
Chapter 9: Inferences for Two –Samples
Testing means, part III The two-sample t-test. Sample Null hypothesis The population mean is equal to  o One-sample t-test Test statistic Null distribution.
Hyp Test II: 1 Hypothesis Testing: Additional Applications In this lesson we consider a series of examples that parallel the situations we discussed for.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
The Normal Distribution. n = 20,290  =  = Population.
6.4 One and Two-Sample Inference for Variances. Example - Problem 26 – Page 435  D. Kim did some crude tensile strength testing on pieces of some nominally.
IEEM 3201 Two-Sample Estimation: Paired Observation, Difference.
HIM 3200 Normal Distribution Biostatistics Dr. Burton.
Two Population Means Hypothesis Testing and Confidence Intervals With Unknown Standard Deviations.
Chapter 9 Hypothesis Testing.
Let sample from N(μ, σ), μ unknown, σ known.
Chapter 10, sections 1 and 4 Two-sample Hypothesis Testing Test hypotheses for the difference between two independent population means ( standard deviations.
5-3 Inference on the Means of Two Populations, Variances Unknown
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 9 Hypothesis Testing.
Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.
Week 9 Chapter 9 - Hypothesis Testing II: The Two-Sample Case.
1/2555 สมศักดิ์ ศิวดำรงพงศ์
T-test Mechanics. Z-score If we know the population mean and standard deviation, for any value of X we can compute a z-score Z-score tells us how far.
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
T-distribution & comparison of means Z as test statistic Use a Z-statistic only if you know the population standard deviation (σ). Z-statistic converts.
Laws of Logic and Rules of Evidence Larry Knop Hamilton College.
More About Significance Tests
Dependent Samples: Hypothesis Test For Hypothesis tests for dependent samples, we 1.list the pairs of data in 2 columns (or rows), 2.take the difference.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
1 Design of Engineering Experiments Part 2 – Basic Statistical Concepts Simple comparative experiments –The hypothesis testing framework –The two-sample.
Statistical Power The ability to find a difference when one really exists.
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
One Sample Inf-1 If sample came from a normal distribution, t has a t-distribution with n-1 degrees of freedom. 1)Symmetric about 0. 2)Looks like a standard.
Week 111 Power of the t-test - Example In a metropolitan area, the concentration of cadmium (Cd) in leaf lettuce was measured in 7 representative gardens.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
For 95 out of 100 (large) samples, the interval will contain the true population mean. But we don’t know  ?!
Slide 1 DSCI 5180: Introduction to the Business Decision Process Spring 2013 – Dr. Nick Evangelopoulos Lecture 2: Review of Basic Statistics (Ch. 2)
STATISTICAL INFERENCE PART VIII HYPOTHESIS TESTING - APPLICATIONS – TWO POPULATION TESTS 1.
1 10 Statistical Inference for Two Samples 10-1 Inference on the Difference in Means of Two Normal Distributions, Variances Known Hypothesis tests.
AP STATISTICS LESSON 11 – 2 (DAY 2) More Accurate Levels in The t Procedures.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Lecture 9 Chap 9-1 Chapter 2b Fundamentals of Hypothesis Testing: One-Sample Tests.
© Copyright McGraw-Hill 2000
Two-Sample Hypothesis Testing. Suppose you want to know if two populations have the same mean or, equivalently, if the difference between the population.
I271B The t distribution and the independent sample t-test.
Week111 The t distribution Suppose that a SRS of size n is drawn from a N(μ, σ) population. Then the one sample t statistic has a t distribution with n.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
8.2 Testing the Difference Between Means (Independent Samples,  1 and  2 Unknown) Key Concepts: –Sampling Distribution of the Difference of the Sample.
Chapter 10 The t Test for Two Independent Samples
Math 4030 – 9b Comparing Two Means 1 Dependent and independent samples Comparing two means.
Business Statistics, 4e, by Ken Black. © 2003 John Wiley & Sons Business Statistics, 4e by Ken Black Chapter 10 Statistical Inferences about Two.
Confidence Intervals for a Population Mean, Standard Deviation Unknown.
STATISTICAL INFERENCE PART VI HYPOTHESIS TESTING 1.
MATB344 Applied Statistics I. Experimental Designs for Small Samples II. Statistical Tests of Significance III. Small Sample Test Statistics Chapter 10.
Copyright ©2011 Brooks/Cole, Cengage Learning Testing Hypotheses about Difference Between Two Means.
Inference for distributions: - Comparing two means.
Chapter 9 Lecture 3 Section: 9.3. We will now consider methods for using sample data from two independent samples to test hypotheses made about two population.
 What is Hypothesis Testing?  Testing for the population mean  One-tailed testing  Two-tailed testing  Tests Concerning Proportions  Types of Errors.
Chapter 10: The t Test For Two Independent Samples.
Chapters 22, 24, 25 Inference for Two-Samples. Confidence Intervals for 2 Proportions.
Student ’ s t-distribution. In cases where the population variance σ 2 is unknown we can use the sample variance S 2 as the best point estimate for the.
3. The X and Y samples are independent of one another.
Math 4030 – 10a Tests for Population Mean(s)
Psychology 202a Advanced Psychological Statistics
Chapter 9 Hypothesis Testing.
Hypothesis Testing and Confidence Intervals
Presentation transcript:

ISMT253a Tutorial 1 By Kris PAN

Skewness:  a measure of the asymmetry of the probability distribution of a real-valued random variable  1)positive skew: The right tail is longer; the mass of the distribution is concentrated on the left of the figure. The distribution is said to be right-skewed.  2)negative skew: The left tail is longer; the mass of the distribution is concentrated on the right of the figure. The distribution is said to be left-skewed.

Kurtosis:  a measure of the "peakedness" of the probability distribution of a real-valued random variable. It is sometimes referred to as the "volatility of volatility."  A high kurtosis portrays a chart with fat tails and a low, even distribution, whereas a low kurtosis portrays a chart with skinny tails and a distribution concentrated toward the mean.

 distribution with kurtosis of infinity (red); 2 (blue); and 0 (black)

2.8 Estimating the Difference Between Two Population Means Here we have two samples and two sets of statistics: Sample 1: Sample 2: and want to use them to estimate the difference between the two population means, µ 1 and µ 2

Estimate and Standard Error  A good estimate of the difference in means, ( µ 1 - µ 2 ) is the difference in sample means,.  If we know the standard deviations, the standard error of is:

Interval Estimate  If we are sampling from two normal populations, an interval estimate is:  We can also use this as a good approximate interval if both sample sizes are large (n 1  30 and n 2  30).

Unknown  1 and  2  We can use this formula only if the population standard deviations are known.  If they are not, we can use the sample standard deviations and get:

The Approximate Interval  As before, use of the sample standard deviations means we use a t distribution for the multiplier.  In this case, the results are only approximate and the t distribution has  degrees of freedom (see the text for how  is computed.)

The Pooled Variance Estimate  In some cases, it may be reasonable to assume that  1 and  2 are approximately equal, in which case we need only estimate their common value.  For this purpose, we "pool" the two sample variances and get S p 2 which is a weighted average of the two sample variances.

The Exact (pooled sample) Interval If this is the situation, we can compute an exact interval: Note that the pooling allows us to combine degrees of freedom: df = (n 1 -1)+(n 2 -1) = n 1 + n 2 -2

What Should We Use?  If we know the two population variances are about equal, use the exact procedure.  If we think they differ a lot, we should use the approximate result.  If we do not really know, the approximate approach is probably best.

Example 2.10  For the 83 mutual funds we discussed earlier, we want to compare the five-year returns for load funds versus no-load funds.  The Minitab output for both procedures is on the next slide. The exact procedure output is on the lower half.

Minitab Two-Sample Output Two-sample T for 5yr ret LoadNoLo N Mean StDev SE Mean Difference = mu (0) - mu (1) Estimate for difference: % CI for difference: (-1.54, 3.42) T-Test of difference = 0 (vs not =): T-Value = 0.76 P-Value = DF=56 Two-sample T for 5yr ret LoadNoLo N Mean StDev SE Mean Difference = mu (0) - mu (1) Estimate for difference: % CI for difference: (-1.41, 3.29) T-Test of difference = 0 (vs not =): T-Value = 0.80 P-Value = DF=81 Both use Pooled StDev = 5.24 Exact (uses pooled SD) Approximate

Interpretation  Since we do not have information that the population variances are equal, it is best to use the approximate procedure.  The degrees of freedom are  =56 and the interval estimate of ( µ NoLoad - µ Load ) is to  Because this interval contains zero, we can conclude the return rates are not that different.

2.9 Hypothesis Tests About the Difference Between Two Population Means Our test is of the form: H 0 : µ 1 = µ 2 (No difference) H a : µ 1  µ 2 (One is higher) which has an equivalent form: H 0 : µ 1 - µ 2 = 0(Difference is zero) H a : µ 1 - µ 2  0 (Difference not zero)

Test Statistic  For the hypothesis of zero difference, the test statistic is just:  The standard error (SE) is either: or

Choice of Procedure  As before, we use the approximate procedure with  degrees of freedom if we cannot assume  1 and  2 are equal to some common value.  If that is a reasonable assumption, we compute the pooled standard error and use the exact procedure with (n 1 +n 2 -2) degrees of freedom.

Example To test the hypothesis that load and no load funds have the same return, we write: H 0 : µ N - µ L = 0 H a : µ N - µ L  0 We do not know that the variances are equal, so we use the approximate procedure which has  = 56 degrees of freedom.

Results At a 5% level of significance, Reject H 0 if t > t.025,56  1.96 ** or t < Minitab gives us t = 0.76 so we accept H 0 and will conclude there is no difference in average return. ** The correct value for a t 56 is