Chapter 13: Inferences about Comparing Two Populations Lecture 8b Date: 15 th November 2015 Instructor: Naveen Abedin.

Slides:



Advertisements
Similar presentations
“Students” t-test.
Advertisements

Chapter 10 Estimation and Hypothesis Testing II: Independent and Paired Sample T-Test.
Lecture 3 Outline: Thurs, Sept 11 Chapters Probability model for 2-group randomized experiment Randomization test p-value Probability model for.
Inference for Regression
Chapter 14 Comparing two groups Dr Richard Bußmann.
Objectives (BPS chapter 15)
Inference for distributions: - Comparing two means IPS chapter 7.2 © 2006 W.H. Freeman and Company.
MARE 250 Dr. Jason Turner Hypothesis Testing II. To ASSUME is to make an… Four assumptions for t-test hypothesis testing:
ANOVA Determining Which Means Differ in Single Factor Models Determining Which Means Differ in Single Factor Models.
Today Today: Chapter 10 Sections from Chapter 10: Recommended Questions: 10.1, 10.2, 10-8, 10-10, 10.17,
BCOR 1020 Business Statistics
1 Inference about Comparing Two Populations Chapter 13.
Announcements Homework 2 will be posted on the web by tonight.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 13 Inference About Comparing Two Populations.
Announcements Extra office hours this week: Thursday, 12-12:45. The midterm will cover through Section I will spend half of Thursday’s class going.
Lecture Inference for a population mean when the stdev is unknown; one more example 12.3 Testing a population variance 12.4 Testing a population.
1 Inference about Comparing Two Populations Chapter 13.
Copyright © 2010 Pearson Education, Inc. Chapter 24 Comparing Means.
1 Inference About a Population Variance Sometimes we are interested in making inference about the variability of processes. Examples: –Investors use variance.
Chapter 9 Hypothesis Testing.
Economics 173 Business Statistics Lecture 9 Fall, 2001 Professor J. Petry
Economics 173 Business Statistics Lecture 8 Fall, 2001 Professor J. Petry
Chapter 24: Comparing Means.
Review of Statistical Inference Prepared by Vera Tabakova, East Carolina University ECON 4550 Econometrics Memorial University of Newfoundland.
Linear Regression Inference
Ch 10 Comparing Two Proportions Target Goal: I can determine the significance of a two sample proportion. 10.1b h.w: pg 623: 15, 17, 21, 23.
The paired sample experiment The paired t test. Frequently one is interested in comparing the effects of two treatments (drugs, etc…) on a response variable.
More About Significance Tests
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Statistical Inferences Based on Two Samples Chapter 9.
Confidence Intervals and Hypothesis Tests for the Difference between Two Population Means µ 1 - µ 2 : Independent Samples Inference for  1  2 1.
QBM117 Business Statistics Estimating the population mean , when the population variance  2, is known.
Inference for distributions: - Comparing two means IPS chapter 7.2 © 2006 W.H. Freeman and Company.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
One-sample In the previous cases we had one sample and were comparing its mean to a hypothesized population mean However in many situations we will use.
1 Psych 5500/6500 t Test for Two Independent Means Fall, 2008.
STATISTICAL INFERENCE PART VIII HYPOTHESIS TESTING - APPLICATIONS – TWO POPULATION TESTS 1.
Maximum Likelihood Estimator of Proportion Let {s 1,s 2,…,s n } be a set of independent outcomes from a Bernoulli experiment with unknown probability.
Economics 173 Business Statistics Lecture 7 Fall, 2001 Professor J. Petry
Chapter 13 Inference About Comparing Two Populations.
1 Inference about Two Populations Chapter Introduction Variety of techniques are presented to compare two populations. We are interested in:
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc Chapter 12 Inference About A Population.
Chapter 13 Inference About Comparing Two Populations.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
ANOVA Assumptions 1.Normality (sampling distribution of the mean) 2.Homogeneity of Variance 3.Independence of Observations - reason for random assignment.
1 Inference about Two Populations Chapter Introduction Variety of techniques are presented whose objective is to compare two populations. We.
AP Statistics Chapter 24 Comparing Means.
1 Economics 173 Business Statistics Lectures 5 & 6 Summer, 2001 Professor J. Petry.
STATISTICAL INFERENCE PART VI HYPOTHESIS TESTING 1.
Comparing Means Chapter 24. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Chapter 14: Analysis of Variance One-way ANOVA Lecture 9a Instructor: Naveen Abedin Date: 24 th November 2015.
Hypothesis Tests for 1-Proportion Presentation 9.
Chapter 10: The t Test For Two Independent Samples.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Chapter 9 Introduction to the t Statistic
Comparing Means: Confidence Intervals and Hypotheses Tests for the Difference between Two Population Means µ 1 - µ 2 Chapter 22 Part 2 CI HT for m 1 -
Hypothesis Tests l Chapter 7 l 7.1 Developing Null and Alternative
Chapter 23 Comparing Means.
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
Inference about Two Populations
Inference about Comparing Two Populations
ECO 173 Chapter 10: Introduction to Estimation Lecture 5a
Towson University - J. Jung
Chapter 14: Analysis of Variance One-way ANOVA Lecture 8
Chapter 11: Introduction to Hypothesis Testing Lecture 5b
Chapter 13: Inferences about Comparing Two Populations Lecture 7a
Chapter 24 Comparing Means Copyright © 2009 Pearson Education, Inc.
IE 355: Quality and Applied Statistics I Confidence Intervals
Chapter 13: Inferences about Comparing Two Populations
Presentation transcript:

Chapter 13: Inferences about Comparing Two Populations Lecture 8b Date: 15 th November 2015 Instructor: Naveen Abedin

Recap To conduct hypothesis tests for “difference in two population means”, a t-statistic must be used when population means and variances are all unknown. There are two types of t-statistics: i.Equal-variances t-statistic ii.Unequal-variances t-statistic Since population variances are unknown, we have to conduct a “difference in two population variances” hypothesis test to understand whether the population variances are statistically equal or not. Based on the results, we use the appropriate t-statistic to carry out the “difference of means” hypothesis test.

Example 1’s statistics

Example 1: Confidence Interval Confidence interval estimate in a two population test estimates the difference between the two means. Here, since population 1 is returns from direct purchase and population 2 is returns from broker purchase, this confidence interval is trying to estimate the difference in returns.

Example 2’s statistics

Some pointers to note Both the equal-variances test statistic and the unequal variances test statistic requires the population to be normally distributed. We can check whether these requirements have been satisfied through visual representation of the data in the form of a histogram. If the histogram assumes a normal (or somewhat normal shape), then it is appropriate to use the t-statistic. (look in pg of textbook for histograms)

Some pointers to note (cont.) As seen before, the test statistic to be used for a “difference in population means” test depends on whether or not the population variances are statistically equal or unequal. It is, however, more advantageous to pool sample data under the assumption that the two samples were drawn from populations with a common variance. Combining the samples increases the accuracy of the estimate. This is especially desirable in situations when one or both of the samples are not large enough. The pooled variance estimator is said to be an exact procedure, meaning the underlying calculations are perfectly accurate if the assumptions are true. It is consistent with other statistical procedures like ANOVA and regression, meaning is we use pooled variance the conclusions would be exactly the same as any one of these other procedures mentioned. Be warned: we can get very misleading results if used in the wrong situation.

Observational vs. Experimental data Observational study is a study where researchers simply collect data based on what is seen and heard, and infer based on the data collected. Researchers should not interfere with the subjects or variables in any way. In observational study, the variables of interest are simply observed (e.g. if a variable of interest is age, simply ask the people in your sample what their ages are). A researcher, also cannot add any extra information or assumptions. All information must be collected solely through observation.

Observational vs. Experimental data (cont.) An experiment is a method of applying treatments to a group and recording the effects. A good group experiment will have two basic elements: a control and a treatment. The control is the group that remains untreated throughout the duration of the experiment (i.e. they are not introduced to any artificial situation, like taking a new drug). The treatment is the group upon whom the experiment is conducted (they are the ones who take the new drug).

Observational vs. Experimental data (cont.) An observational study is conducted between two populations: Population 1: Consumers of high-fiber breakfast cereal Population 2: Non-consumers of high-fiber breakfast cereal. Theory to test: The study was generated to see if people who consumed high-fiber breakfast cereal necessarily consume fewer calories at lunch.

Observational vs. Experimental data (cont.) An unequal variances test was conducted for the difference in population means (mean calories consumed at lunch): p – value = < 0.05, so reject the null hypothesis. Therefore, the test shows that people who consume high-fiber cereal consume fewer calories at lunch and thus do not gain so much weight.

Observational vs. Experimental data (cont.) However, the test might be showing such favorable results towards high-fiber breakfast cereal for entirely different reasons that high- fiber dietary intake. People who eat fewer calories at lunch are probably more-health conscious than people who consume more calories, and the former kind of people are also more likely to consume high-fiber breakfast cereals as a part of a healthy breakfast.

Observational vs. Experimental data (cont.) To overcome this problem, it is better to conduct an experimental study. Suppose a 150 people are randomly selected to participate in an experiment where 75 of them are assigned to eat high-fiber cereal for breakfast and the other 75 eats something else. Then the number of calories consumed by each group are recorded. Since people have been randomly assigned to each group, on average both groups will be similar in all other dimensions, including health consciousness. (especially if the samples are large).

Observational vs. Experimental data (cont.) Experimental data however are usually more expensive than observational data,. In many cases it is also not possible to conduct experimental studies. Suppose you want to determine whether an undergraduate degree in engineering better prepared students for an MBA than an arts degree does. In a controlled experiment, one group of student would be assigned to graduate with an engineer degree and then enroll for MBA, and the other group would be assigned to study arts and then enroll for MBA. Of course, this kind of an experiment is impossible to conduct, as no experimental subjects will be willing to commit 5 to 6 years to an experiment.

Observational vs. Experimental data (cont.) Thus, we have no choice but to gather information through observational study to assess whether an engineering degree better prepares students for MBA than arts students. If it observed however that engineering students perform better than arts, the cause may not be the degree, but rather the profile of the student. It may be true that better students tend to choose engineering as their undergraduate major and better students achieve higher grades in all programs, including MBA programs.