Stat 155, Section 2, Last Time Producing Data How to Sample? –History of Presidential Election Polls Random Sampling Designed Experiments –Treatments &

Slides:



Advertisements
Similar presentations
SADC Course in Statistics Revision using CAST (Session 04)
Advertisements

One-Sample t-Tests. A Story Problem An educational consulting firm claims its new reading curriculum improves the mean reading score of children. Last.
12.1 – Experiments, Surveys, & Observational Studies.
Chapter 8: Estimating with Confidence
Section 9.3 Inferences About Two Means (Independent)
GATHERING DATA Chapter Experiment or Observe?
MA 102 Statistical Controversies Monday, January 28, 2002 Today: Discussion of Statistical Thought, First Ideas About Data, and Exercises. Introduction.
Today Today: Chapter 9 Assignment: Recommended Questions: 9.1, 9.8, 9.20, 9.23, 9.25.
12.The Chi-square Test and the Analysis of the Contingency Tables 12.1Contingency Table 12.2A Words of Caution about Chi-Square Test.
Chapter 9 Hypothesis Testing 9.4 Testing a Hypothesis about a Population Proportion.
MA 102 Statistical Controversies Wednesday, February 13, 2002 Today: Discuss of data ethics, and exercises Measuring: Turning ideas to numbers Exercise.
Active Learning Lecture Slides
MA 102 Statistical Controversies Monday, February 18, 2002 Today: Discuss Chapter 9 concepts and exercises Review Part 1 of the text Test 1 question types.
Today Today: Chapter 8, start Chapter 9 Assignment: Recommended Questions: 9.1, 9.8, 9.20, 9.23, 9.25.
Statistics Pooled Examples.
Statistics 3502/6304 Prof. Eric A. Suess Chapter 2.
4.2 Statistics Notes What are Good Ways and Bad Ways to Sample?
Apr. 8 Stat 100. To do Read Chapter 21, try problems 1-6 Skim Chapter 22.
AP Statistics Chapter 9 Notes.
June 10, 2008Stat Lecture 9 - Proportions1 Introduction to Inference Sampling Distributions for Counts and Proportions Statistics Lecture 9.
Estimation: Sampling Distribution
1 Chapter 1. Section 1-4. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION E LEMENTARY.
+ Warm-Up4/8/13. + Warm-Up Solutions + Quiz You have 15 minutes to finish your quiz. When you finish, turn it in, pick up a guided notes sheet, and wait.
Confidence Intervals A confidence interval is an interval that a statistician hopes will contain the true parameter value. A level C confidence interval.
Stat 31, Section 1, Last Time Correlation Linear Regression –Idea – graphics –Computation –Interpretation.
Bias and Variability Lecture 38 Section 8.3 Wed, Mar 31, 2004.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Unit 5: Estimating with Confidence Section 10.1 Confidence Intervals: The Basics.
Math IIIWarm Up 4/3/14 1.What are the differences between the 4 types of sampling? 2.What are the 2 types of studies and how are they defined?
Do What Needs to Be Done Today. The secret of happy successful living is to do what needs to be done now, and not worry about the past or the future.
Slide 4- 1 © 2012 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Introductory Statistics: Exploring the.
Government October 12, Warm up—What is a political party? 2. Chapter 5: Political Parties (p Main Ideas 1-16; Critical Thinking 1-3) DUE.
Section A Confidence Interval for the Difference of Two Proportions Objectives: 1.To find the mean and standard error of the sampling distribution.
Suppose we wanted to estimate the proportion of registered voters who are more enthusiastic about voting in this election compared to other years? Suppose.
Section 2.1: Taking a Good Sample. Sampling Design  Design of a sample refers to the method used to collect the data.  A proper sampling design must.
Section 6-3 Estimating a Population Mean: σ Known.
Statistics – OR 155 Section 1 J. S. Marron, Professor Department of Statistics and Operations Research.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Warm Up In May 2006, the Gallup Poll asked 510 randomly sampled adults the question “Generally speaking, do you believe the death penalty is applied fairly.
Chapter 22 Comparing Two Proportions. Comparing 2 Proportions How do the two groups differ? Did a treatment work better than the placebo control? Are.
Plan for Today: Chapter 1: Where Do Data Come From? Chapter 2: Samples, Good and Bad Chapter 3: What Do Samples Tell US? Chapter 4: Sample Surveys in the.
10.1 Estimating with Confidence Chapter 10 Introduction to Inference.
Chapter 9 Day 2. Warm-up  If students picked numbers completely at random from the numbers 1 to 20, the proportion of times that the number 7 would be.
Essential Statistics Chapter 191 Comparing Two Proportions.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Spring 2016 Room 150 Harvill.
Collecting Sample Data Chapter 1 Section 4 Part 2.
 A national opinion poll recently estimated that 44% (p-hat =.44) of all adults agree that parents of school-age children should be given vouchers good.
Course Extra for Part II. Quizzes  Any questions from the quizzes?
Hypothesis Testing Involving One Population Chapter 11.
Inference: Conclusion with Confidence
HW Page 23 Have HW out to be checked.
Chapter 9 Roadmap Where are we going?.
Introduction to Statistics for the Social Sciences SBS200 - Lecture Section 001, Spring 2017 Room 150 Harvill Building 9:00 - 9:50 Mondays, Wednesdays.
Introduction to Hypothesis Test – Part 2
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Probability and Statistics
Inference: Conclusion with Confidence
Statistics 350 Lecture 4.
American History Chapter 7 Sections 1,2 and 3.
Chapter 9 Hypothesis Testing
الأستاذ المساعد بقسم المناهج وطرق التدريس
Hypothesis tests for the difference between two means: Independent samples Section 11.1.
Introduction to Inference
Select and draw conclusions from samples

Confidence Intervals: The Basics
Probability and Statistics
WARM – UP.
CHAPTER- 3.1 ERROR ANALYSIS.
The Powers of the Presidency
Inference for Proportions
Presentation transcript:

Stat 155, Section 2, Last Time Producing Data How to Sample? –History of Presidential Election Polls Random Sampling Designed Experiments –Treatments & Levels –Controls –Randomization

Reading In Textbook Approximate Reading for Today’s Material: Pages , Approximate Reading for Next Class: Pages ,

Chapter 3: Producing Data (how this is done is critical to conclusions) Section 3.1: Statistical Settings 2 Main Types: I.Observational Study II.Designed Experiment

Random Sampling Key Idea: “random error” is smaller than “unintentional bias”, for large enough sample sizes How large? Current sample sizes: ~1, ,000 Note: now << 50,000 used in So surveys are much cheaper (thus many more done now….)

Random Sampling How Accurate? Can (& will) calculate using “probability” Justifies term “scientific sampling” 2 nd improvement over quota sampling

Controlled Experiments Common Type: compare “treatment” with “placebo”, a “sham treatment” that controls for psychological effects (think you are better, just because you are treated, so you are better…) Called a “blind” experiment

Controlled Experiments Further Refinement: “Double Blind” experiment means neither patient, nor doctor knows is real or not Eliminates possible doctor bias

Design of Experiments 2.Randomization Useful method for choosing groups above (e.g. Treatment and Control) Recall: Different from “just choose some”, instead means “make each equally likely”

Design of Experiments 2.Randomization Big Plus: Eliminates biases, i.e. effects of “lurking variables” (same as random choice of samples, again pay price of added variability, but well worth it)

Example of an Experiment (to tie above ideas together) Gastric Freezing: Treatment for stomach ulcers –Anesthetize patient –Put balloon in stomach –Fill with freezing coolant

Gastric Freezing Initial Experiment, patients, all cured Became popular, and better than surgery But there were some skeptics…. Was it a Placebo Effect??? I.e. was fact of “some type of treatment” enough for “cure”

Gastric Freezing Approach, 1963: (i)Controlled Experiment (some treated others not, shows who gets better with no treatment) (ii)Randomize: Eliminates other sources of bias, i.e. lurking variables (randomly choose: treated or not)

Gastric Freezing (iii)“Blind” Patient doesn’t know if treated (Got a balloon in stomach or not? Both groups got that, but only Treatment group got freezing coolant) (iv)“Double Blind”: Doctor doesn’t know if treated or not. (somebody else controls freezing coolant) Important: since doctor decides if “cured”

Gastric Freezing Results: Treatment Group: 82 Control Group: 78 Initially: Treatment Control No Symptoms: 29% 29% Improved: 47% 39% After 24 Months: Relapse: 45% 39%

Gastric Freezing Results: No strong effect of treatment over control is apparent. All placebo effect? Analysis: Will build tools to show: “Difference within natural variation, assuming there is no difference”

Gastric Freezing Historical Notes: Famous case for eliminating “ineffective treatments” Showed importance of double blind controlled experiments That are commonly used today Stomach ulcers currently very effectively treated with drugs

And Now for Something Completely Different Pepsi Challenge: Try a blind taste test of Pepsi vs. Coca-Cola Successful ad campaign Most thought Pepsi tasted better Many were surprised by blind test result

Class Experiment Pepsi Challenge Do a careful “double blind approach” –Taste test Pepsi vs. Coke –Where “taster” doesn’t know which is which (“blind” part of experiment) –And same for “giver” (“double blind” part)

Pepsi Challenge Approach: Groups of 3 Each does each job once: Pourer (put your name and others on slip, pour cups outside room, and stays out) Giver (also puts names on slip, goes to get cups after pouring) Taster (always inside room) Giver: record results on their slip

Pepsi Challenge Ideas: Create “double blind”, i.e. “no knowledge of doctor” by pourer filling cups in room, so that giver does not see Avoid “color association” by randomizing Pourer does not watch tasting (no telegraphing with big grin….) After tasting: compare notes, check forms Will report, and analyze results later

Sec. 3.4: Basics of “Inference” Idea: Build foundation for statistical inference, i.e. quantitative analysis (of uncertainty and variability) Fundamental Concepts: Population described by parameters e.g. mean, SD. Unknown, but can get information from…

Fundamental Concepts Last page: Population, here think about parameters:, Sample (usually random), described by corresponding “statistics” e.g. mean, SD. (Will become important to keep these apart)

Population vs. Sample E.g. 1: Political Polls Population is “all voters” Parameter of interest is: = % in population for Candidate A (bigger than 50% or not?) Sample is “voters asked by pollsters” Statistic is = % in sample for A (careful to keep these straight!)

Population vs. Sample E.g. 1: Political Polls Notes – is an “estimate” of –Variability is critical –Will construct models of variability –Possible when sample is random –Recall random sampling also reduces bias

Population vs. Sample E.g. 2: Measurement Error (seemingly quite different…) Population is “all possible measurem’ts” (a thought experiment only) Parameters of interest are: = population mean = population SD

Population vs. Sample E.g. 2: Measurement Error Sample is “measurem’ts actually made” Statistics are: = mean of measurements = SD of measurements

Population vs. Sample E.g. 2: Measurement Error Notes: – estimates –Again will model variability –“Randomness” is just a model for measurement error

Population vs. Sample HW:

Basic Mathematical Model Sampling Distribution Idea: Model for “possible values” of statistic E.g. 1: Distribution of in “repeated samplings” (thought experiment only) E.g. 2: Distribution of in “repeated samplings” (again thought experiment)

Basic Mathematical Model Sampling Distribution Tools Can study these with: Histograms  “shape”: often Normal Mean  Gives measure of “bias” SD  Gives measure of “variation”

Bias and Variation Graphical Illustration Scanned from text: Fig. 3.12

Bias and Variation Class Example: Results from previous class on “Estimate % of males at UNC” Recall several approaches to estimation (3 bad, one sensible)

E.g. % Males at UNC At top: –Counts –Corresponding proportions (on [0,1] scale) –Bin Grid (for histograms on [0,1] numbers) Next Part: –Summarize mean of each –Summarize SD (spread) of each Histograms (appear next)

E.g. % Males at UNC Recall 4 way to collect data: Q1:Sample from class Q2:Stand at door and tally –Q1 “less spread and to left”? Q3:Make up names in head –Q3 “more to right”? Q4:Random Sample –Supposed to be best, can we see it?

E.g. % Males at UNC Better comparison: Q4 vs. each other one Use “interleaved histograms” Q1 & Q4: –Q1 has smaller center: –i.e. “biased”, since Class Population –And less spread: –since “drawn from smaller pool”

E.g. % Males at UNC Q2 & Q4: –Centers have Q2 bigger: –Reflects bias in door choice –And Q2 is “more spread” : –Reflects “spread in doors chosen” + “sampling spread”

E.g. % Males at UNC Q3 & Q4: –Center for Q3 is bigger: –Reflects “more people think of males”? –And Q3 is “more spread” : –Reflects “more variation in human choice”

E.g. % Males at UNC A look under the hood: Highlight an interleaved Chart Click Chart Wizard Note Bar (and interleaved subtype) Different colors are in “series” Computed earlier on left Using Tools  Data Anal.  Histo’m

E.g. % Males at UNC Interesting question: What is “natural variation”? Will model this soon. This is “binomial” part of this example, which we will study later.

Bias and Variation HW: 3.66 (Hi bias – hi var, lo bias – lo var, lo bias – hi var, hi bias – lo var) 3.69

And Now for Something Completely Different Cool movie suggested by Sander Buitelaar Street scene in Amsterdam Photography conveys situation “Plein” = Plaza “football dribble” = …

Chapter 4: Probability Goal: quantify (get numerical) uncertainty Key to answering questions above (e.g. what is “natural variation” in a random sample?) (e.g. which effects are “significant”) Idea: Represent “how likely” something is by a number

Simple Probability E.g. (will use for a while, since simplicity gives easy insights) Roll a die (6 sided cube, faces 1,2,…,6) 1 of 6 faces is a “4” So say “chances of a 4” are: “1 out of 6”. What does that number mean? How do we find such for harder problems?

Simple Probability A way to make this precise: “Frequentist Approach” In many replications (repeat of die roll), expect about of total will be 4s Terminology (attach buzzwords to ideas): Think about “outcomes” from an “experiment” e.g. #s on die e.g. roll die, observe #

Simple Probability Quantify “how likely” outcomes are by assigning “probabilities” I.e. a number between 0 and 1, to each outcome, reflecting “how likely”: Intuition: 0 means “can’t happen” ½ means “happens half the time” 1 means “must happen”

Simple Probability HW: C9: Match one of the probabilities: 0, 0.01, 0.3, 0.6, 0.99, 1 with each statement about an event: a.Impossible, can’t occur. b.Certain, will happen on every trial. c.Very unlikely, but will occur once in a long while. d.Event will occur more often than not.