Presentation is loading. Please wait.

Presentation is loading. Please wait.

Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.

Similar presentations


Presentation on theme: "Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University."— Presentation transcript:

1 Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University

2 Chapter Topics Sampling: Design and Methods Estimation: Confidence Interval Estimation for the Mean (  Known) Confidence Interval Estimation for the Mean (  Unknown) Confidence Interval Estimation for the Proportion

3 Chapter Topics The Situation of Finite Populations Student’s t distribution Sample Size Estimation Hypothesis Testing Significance Levels ANOVA

4 Statistical Sampling Sampling: Valuable tool Population: Too large to deal with effectively or practically Impossible or too expensive to obtain all data Collect sample data to draw conclusions about unknown population

5 Sample design Representative Samples of the population Sampling Plan: Approach to obtain samples Sampling Plan: States Objectives Target population Population frame Method of sampling Data collection procedure Statistical analysis tools

6 Objectives Estimate population parameters such as a mean, proportion or standard deviation Identify if significant difference exists between two populations Population Frame List of all members of the target population

7 Sampling Methods Subjective Sampling: Judgment: select the sample (best customers) Convenience: ease of sampling Probabilistic Sampling: Simple Random Sampling Replacement Without Replacement

8 Sampling Methods Systematic Sampling: Selects items periodically from population. First item randomly selected - may produce bias Example: pick one sample every 7 days Stratified Sampling: Populations divided into natural strata Allocates proper proportion of samples to each stratum Each stratum weighed by its size – cost or significance of certain strata might suggest different allocation Example: sampling of political districts - wards

9 Sampling Methods Cluster Sampling: Populations divided into clusters then random sample each Items within each cluster become members of the sample Example: segment customers for each geographical location Sampling Using Excel: Population listed in spreadsheet Periodic Random

10 Sampling Methods: Selection Systematic Sampling: Population is large – considerable effort to randomly select Stratified Sampling: Items in each stratum homogeneous - Low variances Relatively smaller sample size than simple random sampling Cluster Sampling: Items in each cluster are heterogeneous Clusters are representative of the entire Population Requires larger sample

11 Sampling Errors Sample does not represent target population (e. g. selecting inappropriate sampling method) Inherent error:samples only subset of population Depends on size of Sample relative to population Accuracy of estimates Trade-off: cost/time versus accuracy

12 Sampling From Finite Populations Finite without replacement (R) Statistical theory assumes: samples selected with R When n <.05 N – difference is insignificant Otherwise need a correction factor Standard error of the mean

13 Statistical Analysis of Sample Data Estimation of population parameters (PP) Development of confidence intervals for PP Probability that the interval correctly estimates true population parameter Means to compare alternative decisions/process (comparing transmission production processes) Hypothesis testing: validate differences among PP

14 Mean, , is unknown PopulationRandom Sample I am 95% confident that  is between 40 & 60. Mean X = 50 Estimation Process Sample

15 Mean  Proportion pp s Variances 2 Population Parameters Estimated  2 X _ Point Estimate Population Parameter Std. Dev.  s

16 Provides Range of Values  Based on Observations from Sample Gives Information about Closeness to Unknown Population Parameter Stated in terms of Probability Never 100% Sure Confidence Interval Estimation

17 Confidence Interval Sample Statistic Confidence Limit (Lower) Confidence Limit (Upper) A Probability That the Population Parameter Falls Somewhere Within the Interval. Elements of Confidence Interval Estimation

18 Example: 90 % CI for the mean is 10 ± 2. Point Estimate = 10 Margin of Error = 2 CI = [8,12] Level of Confidence = 1 -  = 0.9 Probability that true PP is not in this CI = 0.1 Example of Confidence Interval Estimation

19 Parameter = Statistic ± Its Error Confidence Limits for Population Mean Error = Error = Error

20 90% Samples 95% Samples  x _ Confidence Intervals 99% Samples X _

21 Probability that the unknown population parameter falls within the interval Denoted (1 -  ) % = level of confidence e.g. 90%, 95%, 99%   Is Probability That the Parameter Is Not Within the Interval Level of Confidence

22 Confidence Intervals Intervals Extend from (1 -  ) % of Intervals Contain .  % Do Not. 1 -   /2  X _  x _ Intervals & Level of Confidence Sampling Distribution of the Mean to

23 Data Variation measured by  Sample Size Level of Confidence (1 -  ) Intervals Extend from Factors Affecting Interval Width X - Z  to X + Z  xx

24 Mean  Unknown Confidence Intervals Proportion Finite Population  Known Confidence Interval Estimates

25 Assumptions  Population Standard Deviation is Known  Population is Normally Distributed  If Not Normal, use large samples Confidence Interval Estimate Confidence Intervals (  Known)

26 Mean  Unknown Confidence Intervals Proportion Finite Population  Known Confidence Interval Estimates

27 Assumptions  Population Standard Deviation is Unknown  Population Must Be Normally Distributed Use Student’s t Distribution Confidence Interval Estimate Confidence Intervals (  Unknown)

28 Shape similar to Normal Distribution Different t distributions based on df Has a larger variance than Normal Larger Sample size: t approaches Normal At n = 120 - virtually the same For any sample size true distribution of Sample mean is the student’s t For unknown  and when in doubt use t Student’s t Distribution

29 Standard Normal Z t 0 t (df = 5) t (df = 13) Bell-Shaped Symmetric ‘Fatter’ Tails Student’s t Distribution

30 Number of Observations that Are Free to Vary After Sample Mean Has Been Calculated Example  Mean of 3 Numbers Is 2 X 1 = 1 (or Any Number) X 2 = 2 (or Any Number) X 3 = 3 (Cannot Vary) Mean = 2 degrees of freedom = n -1 = 3 -1 = 2 Degrees of Freedom (df)

31 Upper Tail Area df.25.10.05 11.0003.0786.314 2 0.8171.886 2.920 30.7651.6382.353 t 0 Assume: n = 3 df = n - 1 = 2  =.10  /2 =.05 2.920 t Values.05 Student’s t Table

32 A random sample of n = 25 has = 50 and s = 8. Set up a 95% confidence interval estimate for .  .. 46695330 Example: Interval Estimation  Unknown

33 Sample of n = 30, S = 45.4 - Find a 99 % CI for, , the mean of each transmission system process. Therefore  =.01 and  =.005   266.75312.45 Example: Tracway Transmission

34 Mean  Unknown Confidence Intervals Proportion Finite Population  Known Confidence Interval Estimates

35 Assumptions  Sample Is Large Relative to Population n / N >.05 Use Finite Population Correction Factor Confidence Interval (Mean,  X Unknown) X  Estimation for Finite Populations

36 Mean  Unknown Confidence Intervals Proportion Finite Population  Known Confidence Interval Estimates

37 Assumptions  Two Categorical Outcomes  Population Follows Binomial Distribution  Normal Approximation Can Be Used  n·p  5 & n·(1 - p)  5 Confidence Interval Estimate Confidence Interval Estimate Proportion

38 A random sample of 1000 Voters showed 51% voted for Candidate A. Set up a 90% confidence interval estimate for p. p .484.536 Example: Estimating Proportion

39 Sample Size Too Big: Requires too much resources Too Small: Won’t do the job

40 What sample size is needed to be 90% confident of being correct within ± 5? A pilot study suggested that the standard deviation is 45. n Z Error    2 2 2 22 2 164545 5 2192220 .. Example: Sample Size for Mean Round Up

41 What sample size is needed to be within ± 5 with 90% confidence? Out of a population of 1,000, we randomly selected 100 of which 30 were defective. Example: Sample Size for Proportion Round Up 228 

42 Hypothesis Testing Draw inferences about two contrasting propositions (hypothesis) Determine whether two means are equal: 1.Formulate the hypothesis to test 2.Select a level of significance 3.Determine a decision rule as a base to conclusion 4.Collect data and calculate a test statistic 5.Apply the decision rule to draw conclusion

43 Hypothesis Formulation Null hypothesis: H 0 representing status quo Alternative hypothesis: H 1 Assumes that H 0 is true Sample evidence is obtained to determine whether H 1 is more likely to be true

44 Test Accept Reject Significance Level False True Type II Error Type I Error Probability of making Type I error  = level of significance Confidence Coefficient = 1-  Probability of making Type II error  = level of significance Power of the test = 1- 

45 Decision Rules Sampling Distribution: Normal or t distribution Rejection Region Non Rejection Region Two-tailed test,  /2 One-tailed test,  P-Values

46 Hypothesis Testing: Cases Two-Sample Means F-Test for Variances Proportions ANOVA: Differences of several means Chi-square for independence

47 Chapter Summary Sampling: Design and Methods Estimation: Confidence Interval Estimation for Mean (  Known) Confidence Interval Estimation for Mean (  Unknown) Confidence Interval Estimation for Proportion

48 Chapter Summary Finite Populations Student’s t distribution Sample Size Estimation Hypothesis Testing Significance Levels: Type I/II errors ANOVA


Download ppt "Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University."

Similar presentations


Ads by Google