Chapter 3 Producing Data.

Slides:



Advertisements
Similar presentations
Aim: What must we understand before designing an study? HW#2: complete the assignment on the last slide (must be typed and article must be included when.
Advertisements

Designing Experiments
Chapter 7: Data for Decisions Lesson Plan
GATHERING DATA Chapter Experiment or Observe?
Chapter 5 Producing Data
AP Statistics Chapter 5 Notes.
The Practice of Statistics
Inference for proportions - Comparing 2 proportions IPS chapter 8.2 © 2006 W.H. Freeman and Company.
Inference for proportions - Comparing 2 proportions IPS chapter 8.2 © 2006 W.H. Freeman and Company.
5.1 Designing Samples.  Differentiate between an observational study and an experiment  Learn different types of sampling techniques  Use a random.
Chapter 5 Data Production
Chapter 1: Introduction to Statistics
BPS - 5th Ed. Chapter 81 Producing Data: Sampling.
CHAPTER 8 Producing Data: Sampling BPS - 5TH ED.CHAPTER 8 1.
STT215: CHAPTER 3 PRODUCING DATA Dr. Cuixian Chen Chapter 3: Producing Data STT215 1.
Section 5.2 Designing Experiments AP Statistics
Aim: Review Session 2 for Final Randomization and Experimental Designs HW: Worksheet Final Exam 6/9 and 6/11.
AP Statistics.  Observational study: We observe individuals and measure variables of interest but do not attempt to influence responses.  Experiment:
Part III Gathering Data.
Collection of Data Chapter 4. Three Types of Studies Survey Survey Observational Study Observational Study Controlled Experiment Controlled Experiment.
Chapter 5: Producing Data “An approximate answer to the right question is worth a good deal more than the exact answer to an approximate question.’ John.
Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Thinking About.
Deciding what and how to measure
Designing Samples Chapter 5 – Producing Data YMS – 5.1.
Conducting A Study Designing Sample Designing Experiments Simulating Experiments Designing Sample Designing Experiments Simulating Experiments.
Aim: How can we assign experimental units to treatments in a way that is fair to all of the treatments?
C HAPTER 5: P RODUCING D ATA Section 5.1 – Designing Samples.
BY: Nyshad Thatikonda Alex Tran Miguel Suarez. How to use this power point 1) Click on the box with the number. Best to click on the black part and not.
AP STATISTICS LESSON AP STATISTICS LESSON DESIGNING DATA.
AP STATISTICS Section 5.1 Designing Samples. Objective: To be able to identify and use different sampling techniques. Observational Study: individuals.
EXTRA PRACTICE WITH ANSWERS
Observations vs. Experiments Target Goals: I can distinguish between an observational study and an experiment. I can explain how a lurking variable in.
 Observations that you or someone else records  Data is more than numbers; it is numbers in context  Data isn’t only numbers; it is also the story.
Chapter 5.2 Designing experiments. Terminology The individuals on which the experiment is done are the experiment units. When the units are human beings.
Chapter 3 Producing Data. Observational study: observes individuals and measures variables of interest but does not attempt to influence the responses.
1. What is one method of data collection? 2. What is a truly random way to survey/sample people?
1 Chapter 11 Understanding Randomness. 2 Why Random? What is it about chance outcomes being random that makes random selection seem fair? Two things:
Chapter 3 Generating Data. Introduction to Data Collection/Analysis Exploratory Data Analysis: Plots and Measures that describe a set of measurements.
Silent Do Now (5 minutes) *Before you begin, grab a new weekly sheet and take out your homework!  An opinion poll calls 2000 randomly chosen residential.
Section 5.2 Designing Experiments AP Statistics October 27 th, 2014.
Producing Data 1.
Experiments Textbook 4.2. Observational Study vs. Experiment Observational Studies observes individuals and measures variables of interest, but does not.
EXTRA PRACTICE WITH ANSWERS
Chapter 4: Designing Studies
Chapter 5 Data Production
CHAPTER 4 Designing Studies
Warmup 1. One study of cell phones and the risk of brain cancer looked at a group of 469 people who have brain cancer. The investigators matched each.
Producing Data, Randomization, and Experimental Design
Producing Data, Randomization, and Experimental Design
Producing Data Chapter 5.
Daniela Stan Raicu School of CTI, DePaul University
Day 1 Parameters, Statistics, and Sampling Methods
DRILL If you needed to select 5 students from a group of 6250, how could you use the table of random digits to carry out the selection process. Starting.
Chapter 5: Producing Data
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Day 1 Parameters, Statistics, and Sampling Methods
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 3 producing data
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Chapter 4: Designing Studies
Designing Samples Section 5.1.
Chapter 4: Designing Studies
Presentation transcript:

Chapter 3 Producing Data

Types of data collected Anecdotal data – data collected haphazardly (not representative!!) Available data – existing data (examples: internet, library, census bureau,….) Gather own data (takes money and time to get own data)

Some terminology Population – the entire group of individuals or objects of interest (answers the question: Who?) Sample – subset of the population on which information is obtained. Census-sample is the entire population Variable – characteristics of interest

Observational study vs Experiment Observational study – A study that observes individuals and measures variables of interest but does not attempt to influence the response. Experiment – A study that imposes some treatment on individuals in order to record their response.

Types of variables Response variable – the outcome of the study. Explanatory variable – variable(s) that attempt to explain the changes in the response Examples: Smoking and lung cancer Running on a treadmill and heart rate

Classroom Examples One study of cell phones and the risk of brain cancer looked at a group of 469 people who have brain cancer. The investigators matched each cancer patient with a person of the same sex, age, and race who did not have brain cancer, then asked about use of cell phones. Result: “Our data suggest that use of handheld cellular telephones is not associated with the risk of brain cancer.” Is this an observational study or experiment? Why? What are the explanatory and response variables? A typical hour of prime-time television shows 3-5 violent acts. Linking family interviews and police records shows a clear association between time spent watching TV as a child and later aggressive behavior. Is this an observational study or experiment? What are the explanatory and response variables? Suggest some lurking variables that could explain the aggressive behavior. An educational software company wants to compare the effectiveness of its computer animation for teaching cell biology with that of a textbook presentation. The company tests the biological knowledge of each group of first year college students, then randomly divides them into two groups. One group uses the animation, and the other studies the text. The company retests all the students and compares the increase in understanding of cell biology in the two groups. Is this an observational study or experiment? What are the explanatory and response variables?

3.1 Design of Experiments Experimental units – individual on which experiment is done. Treatment – specific experimental condition Factors = explanatory variables Placebo – false treatment to control for psychological effects. Example: Gastric freezing is a clever treatment for ulcers in the upper intestine. The patient swallows a deflated balloon with tubes attached, then a refrigerated liquid is pumped through the balloon for an hour (cooling will reduce production of acid and relieve ulcers). An experiment reported in the Journal of the American Medical Association showed that gastric freezing did reduce acid production and relieve ulcer pain. Later experiment included a control group (34% of the treatment group improved…..38% of the placebo group improved). Joint effects – combination of levels of two or more factors. Example: A maker of fabric for clothing is setting up a new line to finish the raw fabric. The line will use either metal rollers or natural-bristle rollers to raise the surface of the fabric; a dyeing cycle time of either 30 minutes or 40 minutes and a temperature of either 150 or 175 degrees Celsius. Four specimens of fabric will be subjected to each treatment and scored for quality. What are the factors and the treatments? How many units (fabric specimens) does the experiment require?

Experiments continued Experiments provide good evidence for causation (able to control lurking variables) Confounded variables – variable(s) associated with the response, but are not of interest; effects cannot be separated from the effect of the explanatory explanatory variable on the response Bias – systematically favors certain outcomes.

Experiments continued Randomization is very important in experiments…helps to ensure groups are as similar as possible. The three principles of Experimental Design are Control Randomize Repeat How can we randomize? Draw names out of a hat, use table of random digits, computer software (calculator), phone-random digit dialing

Using R to randomize First, you need to set the seed > set.seed(put seed number in here) Then sample >sample(seq(1:n),sample size,replace=FALSE) Assign class to two groups

Completely Randomized Design (with one treatment group and one control group) Group 1-Treatment Compare results Random Assignment Group 2- Control

More on Experiments Single blind – individual receiving treatment does not know what treatment they are receiving. Double blind – individual getting treatment and individual recording outcome do not know which treatment was administered.

Block design One way to control for confounding variables is to block on them. A block design first breaks the experimental units into blocks according to the “blocking variable” (for example, if one is blocking on gender, first place units into female and male “blocks”).

Example of Block Design The progress of a type of cancer differs in women and men. A clinical experiment to compare three therapies for this cancer therefore treats sex as a blocking variable. Two separate randomizations are done, one assigning the female subjects to the treatment and the other assigning the male subjects. Draw this design.

Matched Pairs Design A special type of block design is called Matched pairs design. Can only compare two treatments (hence the “pairs”). Block usually consists of units as similar as possible (self, twins, husband and wife).

Example of Matched Pairs design Does talking on a hands-free cell phone distract drivers? Undergraduate students “drove” in a high-fidelity driving simulator equipped with hands-free cell phone. Each student drove once while talking on the cell phone and once without talking on the cell phone. The order for each student was randomly assigned. The car ahead breaks: how quickly does the subject respond?

3.2 Sampling Design Voluntary response sample (call-in polls, comment cards) are very biased…bad sampling design. Want to get a probability sample. A probability sample is a sample chosen by chance (will look at four of them in this course).

Different Types of Probability samples SRS (Simple Random Sample) – every sample of size n has the same chance of being selected. Stratified random sample – first divide into groups (strata), and then take a SRS from each stratum. Cluster sample – first divide into clusters, and then take a SRS of clusters (once a cluster is chosen, every unit in that cluster is in the sample).

Probability samples continued Multistage sampling design –at each stage, a probability sample is obtained. Problems with sample surveys Undercoverage Nonresponse Response bias

Towards statistical inference Use information from sample (known information) to infer about the population (unknown) Statistics – information from a sample Parameter – information from a population Sampling variability – information from a sample will differ from one sample to the next. Sample statistics will have a predictable pattern (referred to as sampling distribution)

Bias and variability Figure 3.14 © 2009 W.H. Freeman and Company Introduction to the Practice of Statistics, Sixth Edition © 2009 W.H. Freeman and Company

Figure 3.15 © 2009 W.H. Freeman and Company Introduction to the Practice of Statistics, Sixth Edition © 2009 W.H. Freeman and Company

3.4 Continued Want statistics that are unbiased and have low variability. How can we eliminate or at least reduce the bias? Use a random sample and good instruments. How to increase precision? Larger sample Population size does not effect precision!!! Sample size does.

Statistical Significance Definition, pg 184 Introduction to the Practice of Statistics, Sixth Edition © 2009 W.H. Freeman and Company

Ethics Definition, pg 225 © 2009 W.H. Freeman and Company Introduction to the Practice of Statistics, Sixth Edition © 2009 W.H. Freeman and Company