DECS 430-A Business Analytics I: Class 5

Slides:



Advertisements
Similar presentations
SADC Course in Statistics General approaches to sample size determinations (Session 12)
Advertisements

Managerial Statistics Why are we all here? In a classroom, near the beginning of a two-year professional program in management, getting ready to start.
Mean, Proportion, CLT Bootstrap
Sampling A population is the total collection of units or elements you want to analyze. Whether the units you are talking about are residents of Nebraska,
Sampling Methods.
Confidence Intervals for Proportions
1-1 Copyright © 2015, 2010, 2007 Pearson Education, Inc. Chapter 18, Slide 1 Chapter 18 Confidence Intervals for Proportions.
Who and How And How to Mess It up
Sampling and Sample Size Determination
Aaker, Kumar, Day Ninth Edition Instructor’s Presentation Slides
CHAPTER twelve Basic Sampling Issues Copyright © 2002
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Chapter 12 Sample Surveys
The Excel NORMDIST Function Computes the cumulative probability to the value X Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc
Chapter 7 Selecting Samples
Sampling Methods.
HL2 MARKETING THEORY: QUANTITATIVE MARKET RESEARCH IB BUSINESS & MANAGEMENT A COURSE COMPANION.
Marketing Research Aaker, Kumar, Day Seventh Edition Instructor’s Presentation Slides.
Chapter 19: Confidence Intervals for Proportions
Managerial Statistics Why are we all here? In a classroom, partway through an executive M.B.A. program in management, getting ready to start a course on.
Chapter 3 Goals After completing this chapter, you should be able to: Describe key data collection methods Know key definitions:  Population vs. Sample.
Sampling UAPP 702 Research Methods for Urban & Public Policy
Copyright © 2011 Pearson Education, Inc. Samples and Surveys Chapter 13.
Chapter 12: AP Statistics
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
McGraw-Hill/Irwin McGraw-Hill/Irwin Copyright © 2009 by The McGraw-Hill Companies, Inc. All rights reserved.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 13.
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
IB Business and Management
Sampling: Theory and Methods
Sampling Distribution
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 1 Section 3 – Slide 1 of 28 Chapter 1 Section 3 Other Effective Sampling Methods.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Surveys and Questionnaires Government agencies, news organizations, and marketing companies often conduct surveys. The results can be factual or subjective.
Section 1.2 ~ Sampling Introduction to Probability and Statistics Ms. Young.
Sampling Design and Analysis MTH 494 LECTURE-12 Ossam Chohan Assistant Professor CIIT Abbottabad.
Non-Experimental designs: Surveys Psych 231: Research Methods in Psychology.
Tahir Mahmood Lecturer Department of Statistics. Outlines: E xplain the role of sampling in the research process D istinguish between probability and.
Copyright © 2012 Pearson Education. All rights reserved © 2010 Pearson Education Copyright © 2012 Pearson Education. All rights reserved. Chapter.
1. Population and Sampling  Probability Sampling  Non-probability Sampling 2.
Learning Objectives Explain the role of sampling in the research process Distinguish between probability and nonprobability sampling Understand the factors.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Chapter Eleven The entire group of people about whom information is needed; also called the universe or population of interest. The process of obtaining.
 The point estimators of population parameters ( and in our case) are random variables and they follow a normal distribution. Their expected values are.
Chapter 19 Confidence intervals for proportions
Chapter 10 Sampling: Theories, Designs and Plans.
Bangor Transfer Abroad Programme Marketing Research SAMPLING (Zikmund, Chapter 12)
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Chapter 3 Surveys and Sampling © 2010 Pearson Education 1.
Probability Sampling. Simple Random Sample (SRS) Stratified Random Sampling Cluster Sampling The only way to ensure a representative sample is to obtain.
1 Data Collection and Sampling ST Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results of a statistical.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Sampling Design and Analysis MTH 494 LECTURE-11 Ossam Chohan Assistant Professor CIIT Abbottabad.
The inference and accuracy We learned how to estimate the probability that the percentage of some subjects in the sample would be in a given interval by.
Statistics 19 Confidence Intervals for Proportions.
Using Surveys to Design and Evaluate Watershed Education and Outreach Day 5 Methodologies for Implementing Mailed Surveys Alternatives to Mailed Surveys.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 3 – Slide 1 of 28 Chapter 1 Section 3 Other Effective Sampling.
Chapter 10 Confidence Intervals for Proportions © 2010 Pearson Education 1.
AC 1.2 present the survey methodology and sampling frame used
Marketing Research Aaker, Kumar, Leone and Day Eleventh Edition
SAMPLING (Zikmund, Chapter 12.
Sampling: Theory and Methods
Estimating Means and Proportions
Polling If the individuals in the population differ in some qualitative way, we often wish to estimate the proportion / fraction / percentage of the population.
SAMPLING (Zikmund, Chapter 12).
Confidence Intervals for Proportions
Confidence Intervals for Proportions
Presentation transcript:

DECS 430-A Business Analytics I: Class 5 Sampling Polling, and estimating proportions Choosing a sample size Sampling methods Stratified sampling, cluster sampling Sampling problems Non-response bias, measurement bias Optimization (Excel’s “Solver”) Adverse selection

Polling If the individuals in the population differ in some qualitative way, we often wish to estimate the proportion / fraction / percentage of the population with some given property. For example: We track the sex of purchasers of our product, and find that, across 400 recent purchasers, 240 were female. What do we estimate to be the proportion of all purchasers who are female, and how much do we trust our estimate?

First, the Estimate Let Obviously, this will be our estimate for the population proportion. But how much can this estimate be trusted?

And Now, the Trick Imagine that each woman is represented by a “1”, and each man by a “0”. Then the proportion (of the sample or population) which is female is just the mean of these numeric values, and so estimating a proportion is just a special case of what we’ve already done!

The Result Estimating a mean: Estimating a proportion: The example: [When all of the numeric values are either 0 or 1, s takes the special form shown above.] The example:

Multiple-Choice Questions If the Republican Party’s candidate were to be chosen today, which one would you most prefer? Romney, Cain, Bachman, Perry, Gingrich, Santorum, Paul, Huntsman, none The results are reported as if 9 separate “yes/no” questions had been asked. If the Republican Party’s candidate were to be chosen today, which of these would have your approval? The same reporting method is used.

Choice of Sample Size Set a “target” margin of error for your estimate, based on your judgment as to how small will be small enough for those who will be using the estimate to make decisions. There’s no magic formula here, even though this is a very important choice: Too large, and your study is useless; too small, and you’re wasting money.

Estimating a Proportion: Polling Pick the target margin of error. Why do news organizations always use 3% or 4% during the election season? Because that’s the largest they can get away with. So, for example, n=400 (resp., 625, or 1112) assures a margin of error of no more than 5% (resp., 4%, or 3%).

Estimating a Mean: Choice of Sample Size Set the target margin of error. Solve From whence comes s? From historical data (previous studies) or from a pilot study (small initial survey). target = $25. s  $180. Set n = 207.

The “Square-Root” Effect : Choice of Sample Size after an Initial Study Given the results of a study, to cut the margin of error in half requires roughly 4 times the original sample size. And generally, the sample size required to achieve a desired margin of error =

How to Read Presidential-Race Polls When reading political polls, remember that the margin of error in an estimate of the “gap” between the two leading candidates is roughly twice as large as the poll's reported margin of error. The margin of error in the estimated “change in the gap” from one poll to the next is nearly three times as large as the poll's reported margin of error.

Summary Whenever you give an estimate or prediction to someone, or accept an estimate or prediction from someone, in order to facilitate risk analysis be sure the estimate is accompanied by its margin of error: A95%-confidence interval is If you’re estimating a mean using simple random sampling: If you’re estimating a proportion using simple random sampling: (one standard-deviation’s-worth of uncertainty inherent in the way the estimate was made) (your estimate) ± (~2) ·

How Will the Data be Collected? Primary Goals: No bias High precision Low cost Simple random sampling with replacement Typically implemented via systematic sampling Simple random sampling without replacement Typically done if a population list is available Stratified sampling Done if the population consists of subgroups with relative within-group homogeneity Cluster sampling Done if the population consists of (typically geographic) subgroups with substantial within-group heterogeneity Specialized approaches (e.g., tagging the U-Haul fleet)

Non-Response Bias One of the difficulties in surveying people (whether by mail, telephone, or direct approach) is that some choose not to respond. Assume that you have decided to conduct a study which requires a sample size of 100. If you only expect 10% of those surveyed to respond to your questionnaire, what should you do? A naïve answer is, "Simply send out 1000 questionnaires!" Unfortunately, the demographics of respondents and nonrespondents may differ substantially. To base estimates for the entire population merely on the data collected from respondents therefore might leave you exposed to substantial sampling bias.

Non-Response Bias A form of stratified sampling is typically used to overcome non- response bias. An initial mass mailing of questionnaires takes place, with identifying codes placed on each questionnaire (or its return envelope). When the submission deadline for responses is reached, estimates can be made for the stratum of "people who respond to the initial mailing." Crossing these people off the mailing list (by cross-referencing the codes on their responses) leaves a list of people all of whom are now known to be in the other "people who don't respond" stratum. The initial response rate is used to estimate the relative sizes of the two strata. A sample of those who didn't respond is now recontacted, using a more expensive approach designed to obtain responses from everyone. (The expense is typically related to an incentive of some kind.) Their data provides estimates for the second stratum, and the study can then be completed. See “Nonresponse_Bias.xls” for an example.

Measurement Bias Asking sensitive questions People will lie Software piracy Sexual activities Tax fraud People will lie Allow them to hide behind a mask of randomness

Randomized Response Surveys Larger samples are required for the same precision … But the bias can be completely eliminated. See Sampling.xls for details.

Using Excel’s “Solver” add-in Optimization Using Excel’s “Solver” add-in

Take My Car. Please! Have I got a deal for you! I've got this great used car, and I might be willing to sell. The actual value of the car depends on how well it has been maintained, and this is of course only known to me: Expressed in terms of the car's value to me, you believe it to be equally likely to be worth any amount between $0 and $5000. You, who would utilize the car to a greater extent than I, would derive 50% more value from ownership (e.g., if it's worth $3000 to me, then it's worth $4500 to you). How much are you willing to offer me? (I'll interpret your offer as "take-it-or-leave-it.")

Adverse Selection You are subject to adverse selection whenever You offer to engage in a transaction with another party, and that party can either accept or refuse your offer. The other party holds information not yet available to you concerning the value to you of the transaction. The other party is most likely to accept the offer (i.e., to select to do the deal) when the information is "bad news" (i.e., adverse) to you.

Adverse Selection: Dealing with It We need to be able to compute E[ V | V  v] . For normally-distributed uncertainty, this can be done analytically. (See Adverse_Selection_plus.xls)

Adverse Selection: Examples Making a buyout offer Setting an insurance premium getting (forcing) healthy young people to carry insurance is critical to the ACA Giving bid/ask quotes Auctions with objective value uncertainty contracting (unknown costs) natural resource sales (unknown supply) the “Winner’s Curse” debt auctions (unknown post-auction market price) Here’s another Saturday night … mothers teach daughters to avoid giving bad signals

Course Finale We’ve covered … Enough probability to get you started in FINC-430, OPNS-430, and other courses dealing with risk. Enough statistics to begin DECS-431, on regression analysis. Enough warning to provide a bit of protection against common errors. Good luck, and bon voyage!