Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.

Slides:



Advertisements
Similar presentations
Hypothesis testing Another judgment method of sampling data.
Advertisements

Sampling: Final and Initial Sample Size Determination
Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
Introduction to Statistics
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
1 MF-852 Financial Econometrics Lecture 4 Probability Distributions and Intro. to Hypothesis Tests Roy J. Epstein Fall 2003.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Inferences About Means of Single Samples Chapter 10 Homework: 1-6.
Probability & Statistics for Engineers & Scientists, by Walpole, Myers, Myers & Ye ~ Chapter 10 Notes Class notes for ISE 201 San Jose State University.
Inference about a Mean Part II
Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Chapter 9 Hypothesis Testing II. Chapter Outline  Introduction  Hypothesis Testing with Sample Means (Large Samples)  Hypothesis Testing with Sample.
Standard error of estimate & Confidence interval.
Confidence Intervals for the Mean (σ Unknown) (Small Samples)
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
Overview of Statistical Hypothesis Testing: The z-Test
Chapter 13 – 1 Chapter 12: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Errors Testing the difference between two.
Overview Definition Hypothesis
Statistical inference: confidence intervals and hypothesis testing.
Section #4 October 30 th Old: Review the Midterm & old concepts 1.New: Case II t-Tests (Chapter 11)
Chapter 6 Confidence Intervals.
Statistics Primer ORC Staff: Xin Xin (Cindy) Ryan Glaman Brett Kellerstedt 1.
7 Elementary Statistics Hypothesis Testing. Introduction to Hypothesis Testing Section 7.1.
From last lecture (Sampling Distribution): –The first important bit we need to know about sampling distribution is…? –What is the mean of the sampling.
1 Statistical Inference Greg C Elvers. 2 Why Use Statistical Inference Whenever we collect data, we want our results to be true for the entire population.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Chapter 9: Testing Hypotheses
Mid-Term Review Final Review Statistical for Business (1)(2)
Business Research Methods William G. Zikmund Chapter 17: Determination of Sample Size.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Measures of Dispersion CUMULATIVE FREQUENCIES INTER-QUARTILE RANGE RANGE MEAN DEVIATION VARIANCE and STANDARD DEVIATION STATISTICS: DESCRIBING VARIABILITY.
Go to index Two Sample Inference for Means Farrokh Alemi Ph.D Kashif Haqqi M.D.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
Chapter 9 Tests of Hypothesis Single Sample Tests The Beginnings – concepts and techniques Chapter 9A.
Introduction to Inferential Statistics Statistical analyses are initially divided into: Descriptive Statistics or Inferential Statistics. Descriptive Statistics.
The Central Limit Theorem and the Normal Distribution.
Determination of Sample Size: A Review of Statistical Theory
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
What does Statistics Mean? Descriptive statistics –Number of people –Trends in employment –Data Inferential statistics –Make an inference about a population.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Interval Estimation and Hypothesis Testing Prepared by Vera Tabakova, East Carolina University.
Statistical Inference for the Mean Objectives: (Chapter 9, DeCoursey) -To understand the terms: Null Hypothesis, Rejection Region, and Type I and II errors.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Confidence Interval Estimation For statistical inference in decision making:
Confidence Intervals for the Mean (Small Samples) 1 Larson/Farber 4th ed.
Inferences from sample data Confidence Intervals Hypothesis Testing Regression Model.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
: An alternative representation of level of significance. - normal distribution applies. - α level of significance (e.g. 5% in two tails) determines the.
Chapter Eight: Using Statistics to Answer Questions.
Chapter 11: Estimation of Population Means. We’ll examine two types of estimates: point estimates and interval estimates.
© Copyright McGraw-Hill 2004
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Section 6.2 Confidence Intervals for the Mean (Small Samples) Larson/Farber 4th ed.
Chapter Eleven Sample Size Determination Chapter Eleven.
Hypothesis Testing and Statistical Significance
CHAPTER 7: TESTING HYPOTHESES Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Statistical Inference for the Mean Objectives: (Chapter 8&9, DeCoursey) -To understand the terms variance and standard error of a sample mean, Null Hypothesis,
Section 6.2 Confidence Intervals for the Mean (Small Samples) © 2012 Pearson Education, Inc. All rights reserved. 1 of 83.
Statistical Decision Making. Almost all problems in statistics can be formulated as a problem of making a decision. That is given some data observed from.
Chapter 9 Introduction to the t Statistic
Outline Sampling Measurement Descriptive Statistics:
Chapter Nine Part 1 (Sections 9.1 & 9.2) Hypothesis Testing
Chapter 6 Confidence Intervals.
Statistical Inference for the Mean: t-test
Presentation transcript:

Go to Index Analysis of Means Farrokh Alemi, Ph.D. Kashif Haqqi M.D.

Go to Index Table of Content Review Objectives Definitions Expected Value Normal Distribution Distribution of Mean Central Limit Theorem Standard Normal DistributionStandard Normal Distribution Use of Z Values Confidence Interval Hypothesis Two Types of Error One-tailed Tests Steps in Testing a HypothesisSteps in Testing a Hypothesis When to Assume Normal Distribution for MeansWhen to Assume Normal Distribution for Means Use t-distribution

Go to Index Review Frequency distribution Mean, median, and mode Standard deviation and range Statistics is the art of making sense of distributions.

Go to Index Objectives Describe different distributions, including normal, and t-distributions. Calculate and interpret confidence intervals using normal distributions. Understand types of errors that occurs with hypothesis testing. Hypothesis testing using t-distribution.

Go to Index Example You Should Be Able to Answer at the End Is it important to ask these types of questions? The cost of rehabilitation in the industry is $25,000, with a standard deviation of Assume that the average cost in our hospital is $30,000. With 95% confidence, would you say that our cost is different than the industry?

Go to Index Definitions A random variable is a variable whose values are determined by chance. A probability distribution is the probability with which values of a random variable can or are observed. Probability of a value is the frequency of occurrence of that value divided by the frequency of occurrences of all values.

Go to Index Example of Probability Estimates We examined the waiting time of 50 people at our emergency room and found that 10 people waited up to 5 minutes, 20 people waited to 10 minutes, 13 people waited to 15 minutes and 7 people waited to 20 minutes. What is the probability of waiting 5 minutes? What is the probability of waiting up to 10 minutes? Distributions help us make probability estimates about observed values.

Go to Index Example of Probability Estimates (Continued) The probability of waiting up to 5 minutes is the number of times people waited up to 5 minutes divided by the total number of people: 10/50=.20. The probability of waiting up to 10 minutes is the number of people who waited up to 10 minutes divided by the total number of people: (10+20)/50=0.6.

Go to Index Expected Value Expected value of a distribution is the mean of the distribution. It represents our long run expectations about the distribution. The expected value of X is given by summing the product of each value of X, referred to as “i”, times its probability of occurring, referred to as p(X=i). Expected value = mean =  p(X=i) * i.

Go to Index Example Calculation of Expected Value or Mean We examined the waiting time of 50 people at our emergency room and found that 10 people waited up to 5 minutes, 20 people waited 6 to 10 minutes, 13 people waited 11 to 15 minutes and 7 people waited minutes. What is the mean waiting time at our emergency room?

Go to Index Example Calculation of Expected Value or Mean (Continued) Do this in Excel

Go to Index Normal Distribution A symmetric distribution, meaning that data are evenly distributed about the mean. Mean, median and mode are the same value. It has one mode and looks like a bell shaped curve.

Go to Index Normal Distribution Continued The curve is continuous, there are no gaps or holes. The curve never touches the X-axis as any value is possible but with infinitely small probabilities. 99.7% of values are within 3 standard deviations of mean.

Go to Index Distribution of Mean If you take a repeated sample of some observations and average them, then you have a distribution for the mean. The distribution of the mean has the same mean as the distribution of the observations. Standard deviation of the mean = Standard error = Standard deviation of the observations / Square root of the sample size.

Go to Index Example What is the mean, standard deviation and standard error for the following data: 4, 5, 6? Mean = 5 Standard deviation = 1 Standard error = 1 / 1.7 = 0.58

Go to Index Central Limit Theorem For any distribution of n observations with mean of  and standard deviation . As n increases, the sample means will have a Normal distribution of mean  and standard deviation  / square root (n). The theorem is important because it helps us ignore questions about the shape of distribution and focus on the mean and standard deviation of it. Do this in Excel

Go to Index Standard Normal Distribution A Normal distribution. Mean of zero. Standard deviation of 1. Z = (Observed value – mean) / standard deviation of average. Where standard deviation of mean = standard error = standard deviation of observations divided by square root of sample size.

Go to Index Example Calculation of Z What is the Z value for the observed mean of 16, if the average mean is 10 and the standard error is 2? Z = (16-10) / 2 = 3.

Go to Index Another Example What is the Z value for the mean 16 of 4 observations, if the average of repeated sample of means is 10 and the standard deviation of the observations is 2? Standard deviation of mean = 2 / 4^0.5 = 2/2 =1 Z value for 16 = (16-10)/1 = 6

Go to Index Use of Z Values 99.7% of data are between z=3 and z=-3. Z is the number of standard deviations that X is away from the mean. 0.15% of data are below z= % of data are above z=3.

Go to Index Use of Z Value (Continued) 95% of data are within z=1.96 and z= % are outside z=1.96 and z= % of data are below z= % of data are above z=1.96

Go to Index Confidence Interval For Normal distributions, the 95% two tailed confidence interval corresponds to observations where z=1.96 and z=-1.96.

Go to Index Example What is the 95% confidence interval for mean of 10 and standard deviation of 2? Lower limit = *2 = Upper limit = *2 = At 13.92, Z value is ( )/2=1.96. At 6.08, Z value is ( ) / 2= % of data fall within these limits.

Go to Index Two Tailed Confidence Interval What percentage of data are between z=1.96 and Z= Answer: 95%. Often referred to as two-tailed confidence interval. What percentage of data are below z=1.96? Answer = Often referred to as one tailed- confidence interval. What percentage of data are above Z= Answer =97.5. Often referred to as one tailed confidence interval.

Go to Index Hypothesis A statistical hypothesis is a conjecture about population parameter. The null hypothesis is that there is no difference between the parameter and a value. The alternative hypothesis states there is a specific difference. Experimental data can only reject a hypothesis not accept it.

Go to Index Possible Outcomes of Hypothesis Test There are four possible outcomes: 1.We reject a hypothesis that is true. 2.We reject a hypothesis that is false. 3.We do not reject a hypothesis that is true. 4.We do not reject a hypothesis that is false.

Go to Index Two Types of Error Hypothesis is true Hypothesis is false We reject hypothesis Type one errorCorrect We do not reject hypothesis CorrectType two error

Go to Index Type 1 Error The level of significance is the maximum probability of type 1 error, symbolized by alpha, . When we base our decision on 95% confidence intervals, 5% of the data are ignored at the two tails of the distribution. Therefore, there is 5% chance that we will reject a hypothesis that is true. Type one error= 5%,  = 0.05.

Go to Index One-tailed Tests In a two-tailed test, the hypothesis is rejected when the value is above higher limit and below the lower limit. In a one-tailed test that a parameter is larger than a particular value, the hypothesis is rejected when the value is above higher limit.

Go to Index One-tailed Tests (Continued) When we base our decision on 95% confidence intervals, 2.5% of the data are ignored at one tail of the distribution. Therefore, there is 2.5% chance that we will reject a hypothesis that is true.  =0.025.

Go to Index Steps in Testing a Hypothesis 1.State the null hypothesis. 2.Identify the alternative hypothesis. 3.Is this a one tailed or two tailed test? 4.Decide the critical Z value above or below which the hypothesis is rejected, usually Calculate the Z value corresponding to the observation. 6.Reject or do not reject the hypothesis by comparing the calculated Z to the critical values.

Go to Index Example The cost of rehabilitation in the industry is $25,000, with a standard deviation of In our hospital, the average cost is $30,000. With 95% confidence, would you say that our cost is different than the industry? Do this in Excel

Go to Index Steps in Testing Example Hypothesis 1.The null hypothesis: Our cost is higher or lower than average. 2.Alternative hypothesis: Our costs are the same as the industry. 3.This is a two tailed test. 4.The critical Z is or – Observed Z = ( )/3000 = Do not reject the hypothesis.

Go to Index When to Assume Normal Distribution for Means When the population variance is known and observations have a Normal distribution. When the population variance is unknown and there are more than 30 observations. Otherwise use t-distribution an approximation for Normal distribution.

Go to Index Use t-distribution If the values in the population is Normal. If we have less than 30 observations. If we have to estimate the standard deviation from the sample and variance of the population is not known. The t-distribution is used as an approximation for near Normal data.

Go to Index Calculating t Statistic t= (observed average – mean) / standard deviation of the average. Critical value of t depends on sample size. For one tail test of alpha = and two tailed test of alpha =0.05. The critical t value for sample size of 10 is 2.22 and for sample size of 20 is 2.08.

Go to Index Calculating t Statistic (Continued) If we are examining sample size of 10, 95% of data are within t=2.22 and t= If we are examining sample size of 10, 97.5% of data are below t=2.22.

Go to Index Testing With t-distribution 1.State the null hypothesis. 2.Identify the alternative hypothesis. 3.Is this a one tailed or two tailed test? 4.Decide the critical t value above or below which the hypothesis is rejected, the value depends on sample size. 5.Calculate the t value corresponding to the observation. 6.Reject or do not reject the hypothesis by comparing the calculated t to the critical values.

Go to Index Example Data

Go to Index Selecting Data Analysis

Go to Index Select Descriptive Statistics

Go to Index Enter Data Range

Go to Index Result Confidence interval is the mean plus or minus the confidence level. If it does not include $30,000, then our hospital has a different cost structure than other hospitals in our database