Creating User Interfaces Review midterm Sampling Homework: User observation reports due next week.

Slides:



Advertisements
Similar presentations
Inference about a Population Proportion
Advertisements

Chapter 12: Testing hypotheses about single means (z and t) Example: Suppose you have the hypothesis that UW undergrads have higher than the average IQ.
Chapter 8: Estimating with Confidence
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
Introduction to Confidence Intervals using Population Parameters Chapter 10.1 & 10.3.
Business Statistics for Managerial Decision
Chapter 19 Confidence Intervals for Proportions.
Confidence Intervals for Proportions
1. Estimation ESTIMATION.
Review: What influences confidence intervals?
1 Difference Between the Means of Two Populations.
1 Confidence Interval for the Population Proportion.
Independent Sample T-test Often used with experimental designs N subjects are randomly assigned to two groups (Control * Treatment). After treatment, the.
Estimating a Population Proportion
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
7-2 Estimating a Population Proportion
Fundamentals of Hypothesis Testing: One-Sample Tests
10.3 Estimating a Population Proportion
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
CHAPTER 16: Inference in Practice. Chapter 16 Concepts 2  Conditions for Inference in Practice  Cautions About Confidence Intervals  Cautions About.
Chap 8-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Business Statistics: A First Course.
Many times in statistical analysis, we do not know the TRUE mean of a population of interest. This is why we use sampling to be able to generalize the.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Sections 6-1 and 6-2 Overview Estimating a Population Proportion.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
16-1 Copyright  2010 McGraw-Hill Australia Pty Ltd PowerPoint slides to accompany Croucher, Introductory Mathematics and Statistics, 5e Chapter 16 The.
Chapter 7 Estimates and Sample Sizes
Estimating a Population Proportion
Sampling Distribution ● Tells what values a sample statistic (such as sample proportion) takes and how often it takes those values in repeated sampling.
PARAMETRIC STATISTICAL INFERENCE
Inferential Statistics 2 Maarten Buis January 11, 2006.
Section 2 Part 2.   Population - entire group of people or items for which we are collecting data  Sample – selections of the population that is used.
Inference We want to know how often students in a medium-size college go to the mall in a given year. We interview an SRS of n = 10. If we interviewed.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 10.1 Confidence Intervals: The Basics.
Section 10.1 Confidence Intervals
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 7-1 Review and Preview.
Communicating Quantitative Information Inflation Election district Polling, predictions, confidence intervals, margin of error Homework: Identify topic.
Inference about a Population Proportion BPS chapter 19 © 2010 W.H. Freeman and Company.
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
6.1 Inference for a Single Proportion  Statistical confidence  Confidence intervals  How confidence intervals behave.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Introduction to Confidence Intervals using Population Parameters Chapter 10.1 & 10.3.
Review - Confidence Interval Most variables used in social science research (e.g., age, officer cynicism) are normally distributed, meaning that their.
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
Copyright © 2009 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Chapter 19 Confidence intervals for proportions
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Confidence Interval for a Single Proportion p-hat, not phat.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
AP Statistics Chapter 11 Notes. Significance Test & Hypothesis Significance test: a formal procedure for comparing observed data with a hypothesis whose.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Warm Up In May 2006, the Gallup Poll asked 510 randomly sampled adults the question “Generally speaking, do you believe the death penalty is applied fairly.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Creating User Interfaces Qualitative vs Quantitative research. Sampling. Panels. Homework: Post proposal & work on user observation study. Next week:Review.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
Copyright © 2010 Pearson Education, Inc. Slide
10.1 Estimating with Confidence Chapter 10 Introduction to Inference.
Statistics 19 Confidence Intervals for Proportions.
CHAPTER 8 (4 TH EDITION) ESTIMATING WITH CONFIDENCE CORRESPONDS TO 10.1, 11.1 AND 12.1 IN YOUR BOOK.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
 Confidence Intervals  Around a proportion  Significance Tests  Not Every Difference Counts  Difference in Proportions  Difference in Means.
Chapter 9 Introduction to the t Statistic
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Confidence Interval Estimation
Chapter 9: Inferences Involving One Population
Chapter 8: Inference for Proportions
Confidence Interval Estimation
Lecture Slides Elementary Statistics Twelfth Edition
Presentation transcript:

Creating User Interfaces Review midterm Sampling Homework: User observation reports due next week

Sampling Basic technique when it is impossible or too expensive to measure everything/everybody Premise: possible to get random sample, meaning every member of whole population equally likely to be in sample NOTE: not a substitute for monitoring directly activity on / with interface

Source The Cartoon guide to Statistics by Larry Gonick and Woollcott Smith HarperResource Procedures (formulas) presented without proof, though, hopefully, motivated

Task Want to know the percentage (proportion) of some large group –adults in USA –television viewers –web users For a particular thing –think the president is doing a good job –watched specific program viewed specific commercial visited specific website

Strategy: Sampling Ask a small group –phone –solicitation at a mall –Follow-up or prelude to access to webpage –other? Monitor actions of a small group, group defined for this purpose Monitor actions of a panel chosen ahead of time ALL THESE: make assumption that those in group are similar to the whole population.

Two approaches Estimating with confidence interval c in general population based on proportion p hat in sample Hypothesis testing: H0 (null hypothesis) p = p0 versus Ha p > p0

Estimation process Construct a sample of size n and determine p hat –Ask who they are voting for (for now, let this be binomial choice) Use this as estimate for actual proportion p. … but the estimate has a margin of error. This means : The actual value is within a range centered at p hat …UNLESS the sample was really strange. The confidence value specifies what the chances are of the sample being that strange.

Statement I'm 95% sure that the actual proportion is in the following range…. p hat – m <= p <= p hat + m Notice: if you want to claim more confidence, you need to make the margin bigger.

Image from Cartoon book You are standing behind a target. An arrow is shot at the target, at a specific point in the target. The arrow comes through to your side. You draw a circle (more complex than +/- error) and say Chances are: the target point is in this circle unless shooter was 'way off'. Shooter would only be way off X percent of the time. (Typically X is 5% or 1%.)

Mathematical basis Samples are themselves normally distributed… –if sample and p satisfy certain conditions. Most samples produce p hat values that are close to the p value of the whole population. Only a small number of samples produce values that are way off. –Think of outliers of normal distribution

Actual (mathematical) process Can use these techniques when n*p>=5 and n*(1-p)>=5 The p hat values are distributed close to normal distribution with standard deviation sd(p) = Can estimate this using p hat in place of p in formula! Choose the level of confidence you want (again, typically 5% or 1%). For 5% (95% confident), look up (or learn by heart the value 1.96: this is the amount of standard deviations such that 95% of values fall in this area. So.95 is P(-1.96 <= (p-p hat )/sd(p) <=1.96) Sample size must be this big

Notes p is less than 1 so (1-p) is positive. Margin of error decreases as p varies from.5 in either direction. (Check using excel). –if sample produces a very high (close to 1) or very low value (close to 0), p * (1-p) gets smaller –(.9)*(.1) =.09; (.8)*(.2) =.16, (.6)*(.4) =.24; (.5)*.5)=.25

Notes Need to quadruple the n to halve the margin of error.

Formula Use a value called the z transform –95% confidence, the value is 1.96

Level of confidence 1-leg or 2-legStandard deviations (z- score) 80%.10 or %.05 or %.025 or %.005 or

Mechanics Process is Gather data (get p hat and n) choose confidence level Using table, calculate margin of error. Book example: 55% (.55 of sample of 1000) said they backed the politician) sd(p hat ) = square_root ((.55)*(.45)/1000) =.0157 Multiply by z-score (e.g., 1.96 for a 95% confidence) to get margin of error So p is within the range:.550 – (1.96)*(.0157) and (1.96)*(.0157).519 to.581 or 51.9% to 58.1%

Example, continued 51.9% to 58.1% may round to 52% to 58% or may say 55% plus or minus 3 percent. What is typically left out is that there is a 1/20 chance that the actual value is NOT in this range.

95% confident means 95/100 probability that this is true 5/100 chance that this is not true 5/100 is the same as 1/20 so, There is only a 1/20 chance that this is not true. Only 1/20 truly random samples would give an answer that deviated more from the real –ASSUMING NO INTRINSIC QUALITY PROBLEMS –ASSUMING IT IS RANDOMLY CHOSEN

99% confidence means [Give fraction positive] [Give fraction negative]

Why Confidence intervals given mainly for 95% and 99%?? History, tradition, doing others required more computing….

Let's ask a question How many of you watched the last Super Bowl? –Sample is whole class How many registered to vote? –Sample size is number in class 18 and older ????

Excel: columns A & B students watchers psample=B2/B1 sd=SQRT(B3*(1-B3)/B1) Ztransform for 95%=1.96 margin=B5*B4 lower=MAX(0,B3-B6) upper=MIN(B3+B6,1)

Variation of book problem Say sample was 300 (not 1000). sd(phat) = square_root ((.55)*(.45)/300) =.0287 Bigger number. The circle around the arrow is larger. The margin is larger because it was based on a smaller sample. Multiplying by 1.96 get.056, subtracting and adding from the.55 get.494 to.606 You/we are 95% sure that true value is in this range. Oops: may be better, but may be worse. The fact that the lower end is below.5 is significant for an election! Divisor smaller

Exercise size of sample is n proportion in sample is p hat confidence level produces factor called the z-score –Can be anything but common values are [80%], 90%, 95%, 99%) –Use table. For example, 95% value is 1.96; 99% is 2.58 Calculate margin of error m – m = zscore * sqrt((p hat )*(1-p hat )/n) Actual value is >= p hat – m and <= p hat + m

Opportunity sample Common situation –people assigned/asked to have a meter attached to their TVs –people asked/voluntarily sign up to have a meter (software) installed in their computers. –people asked during a Web session to participate in survey –students in a specific class! Practice is to determine categories (demographics) and project the sample results to the subpopulation to the population –For example, if actual population was 52% female and 48% male, and sample (panel) is 60% male and 40% female, use proportions to adjust result… But maybe this fact hides problem with the sample Has negative features of any opportunity sample –Are these folks different than others in their (sub)population?

Requirements Model / Categories must be well-defined and valid –Hispanic versus (Cuban, others) in Florida in 2000 Need independent analysis of subpopulations representation in general population The sample sizes are the individual Ns, making the margin of errors larger

Adjustment from panel data Panel of 10: 6 females, 4 males Population is 52% female and 48% male Female panelists: 5 liked interface, 1 didn't. Male panelists: 2 liked interface, 2 didn't. Estimate for whole population (size P) (5/6)*.52 * P + (2/4)*.48* P

Critical part of surveys and survey analysis: Understand the exact wording of question. Understand definition of categories of population. Don't make assumptions… Admire Michelle Obama example Belief in Holocaust example

Usability research Often aims for qualitative, not quantitative results –Ideas, critical factors Note: there are fields of study –Non-numeric statistics –Qualitative research Still necessary to be systematic. AD: consider taking Statistics!

Homework Continue work on user observation studies –This is qualitative work