Producing Data http://www.cartoonstock.com/directory/d/data_gathering.asp.

Slides:



Advertisements
Similar presentations
DESIGNING EXPERIMENTS
Advertisements

Part 2 CHAPTER 6.1. DESIGNING AN EXPERIMENT The first goal in designing an experiment is to ensure that it will show us the effect of the explanatory.
Chapter 7: Data for Decisions Lesson Plan
Chapter 5 Producing Data
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
AP Statistics Chapter 5 Notes.
The Practice of Statistics
Section 5.1. Observational Study vs. Experiment  In an observational study, we observe individuals and measure variables of interest but do not attempt.
Chapter 5 Data Production
Chapter 1: Introduction to Statistics
AP Statistics.  Observational study: We observe individuals and measure variables of interest but do not attempt to influence responses.  Experiment:
Collection of Data Chapter 4. Three Types of Studies Survey Survey Observational Study Observational Study Controlled Experiment Controlled Experiment.
MATH 2400 Chapter 9 Notes. Observation vs. Experiment An observational study observes individuals and measures variables of interest but does not attempt.
Chapter 5: Producing Data “An approximate answer to the right question is worth a good deal more than the exact answer to an approximate question.’ John.
LT 4.2 Designing Experiments Thanks to James Jaszczak, American Nicaraguan School.
Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Thinking About.
Section 5.1 Designing Samples Malboeuf AP Statistics, Section 5.1, Part 1 3 Observational vs. Experiment An observational study observes individuals.
Producing Data 1.
Producing Data 1.
Data Collection: Sample Design. Terminology Observational Study – observes individuals and measures variables of interest but does not impose treatment.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Designing Samples Chapter 5 – Producing Data YMS – 5.1.
Objectives (BPS chapter 9) Producing data: experiments  Experiments  How to experiment badly  Randomized comparative experiments  The logic of randomized.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
AP Review #4: Sampling & Experimental Design. Sampling Techniques Simple Random Sample – Each combination of individuals has an equal chance of being.
Conducting A Study Designing Sample Designing Experiments Simulating Experiments Designing Sample Designing Experiments Simulating Experiments.
Lecture # 6:Designing samples or sample survey Important vocabulary Experimental Unit: An individual person,animal object on which the variables of interest.
Section 5.1 Designing Samples AP Statistics
CHAPTER 9: Producing Data: Experiments. Chapter 9 Concepts 2  Observation vs. Experiment  Subjects, Factors, Treatments  How to Experiment Badly 
BY: Nyshad Thatikonda Alex Tran Miguel Suarez. How to use this power point 1) Click on the box with the number. Best to click on the black part and not.
AP STATISTICS LESSON AP STATISTICS LESSON DESIGNING DATA.
Section 5.2 Designing Experiments. Observational Study - Observes individuals and measures variables of interest but DOES NOT attempt to influence the.
AP STATISTICS Section 5.1 Designing Samples. Objective: To be able to identify and use different sampling techniques. Observational Study: individuals.
Producing Data (C11-13 BVD) C13: Experiments and Observational Studies.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Chapter 3 Producing Data. Observational study: observes individuals and measures variables of interest but does not attempt to influence the responses.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Chapter 7 Data for Decisions. Population vs Sample A Population in a statistical study is the entire group of individuals about which we want information.
1. What is one method of data collection? 2. What is a truly random way to survey/sample people?
Producing Data: Experiments BPS - 5th Ed. Chapter 9 1.
Designing Studies In order to produce data that will truly answer the questions about a large group, the way a study is designed is important. 1)Decide.
CHAPTER 9: Producing Data Experiments ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
1 Chapter 11 Understanding Randomness. 2 Why Random? What is it about chance outcomes being random that makes random selection seem fair? Two things:
Chapter 3 Generating Data. Introduction to Data Collection/Analysis Exploratory Data Analysis: Plots and Measures that describe a set of measurements.
CHAPTER 9: Producing Data Experiments ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Ten things about Experimental Design AP Statistics, Second Semester Review.
Producing Data 1.
Experiments Textbook 4.2. Observational Study vs. Experiment Observational Studies observes individuals and measures variables of interest, but does not.
Chapter 5 Data Production
Sampling and Experimentation
Arrangements or patterns for producing data are called designs
CHAPTER 4 Designing Studies
Producing Data, Randomization, and Experimental Design
Producing Data, Randomization, and Experimental Design
Section 5.1 Designing Samples
Chapter 3 Producing Data
Arrangements or patterns for producing data are called designs
Producing Data Chapter 5.
Ten things about Experimental Design
Daniela Stan Raicu School of CTI, DePaul University
Chapter 4: Designing Studies
Day 1 Parameters, Statistics, and Sampling Methods
Statistical Reasoning December 8, 2015 Chapter 6.2
Chapter 5: Producing Data
Day 1 Parameters, Statistics, and Sampling Methods
Chapter 4: Designing Studies
Chapter 3 producing data
Designing Samples Section 5.1.
Presentation transcript:

Producing Data http://www.cartoonstock.com/directory/d/data_gathering.asp

3.1: Sources of Data - Goals Identify anecdotal data in a specific situation and explain why we should not use this type of data. Define what is meant by available data. Distinguish between experiments and observational studies.

Anecdotal Data Anecdotal data represent individual cases that often come to our attention because they are striking in some way. A woman who was deaf from birth was hit by lightning and regained her hearing. Does this mean that lightning is a cure for deafness?

Available Data Available data are data that were produced in the past for some other purpose but that may help answer a present question inexpensively. The library and the Internet are sources of available data.

Definitions Population Sample Census

Observational Studies and Experiments An experiment deliberately imposes some treatment on individuals to measure their responses. An observational study observes individuals and measures variables of interest but does not attempt to influence the responses.

Observational Data In an article published in the Journal of the American Veterinary Medical Association, Whitney and Mehlaff (1987) presented results on the injury rates of cats that had plummeted from buildings in New Your City according to the number of floors that they had fallen. The researches merely recorded the number of injuries from the cats that were brought into the vet. No cats were thrown from the windows – the cats did it to themselves! The Analysis of Biological Data, Whitlock, Schluter, 2009, Roberts and Company, p. 3

Causality A lurking variable is a variable that is not among the explanatory or response variables in a study but that may influence the response variable. Confounding occurs when two variables are associated in such a way that their effects on a response variable cannot be distinguished from each other.

Lurking Variables 1. For children, there is an extremely strong correlation between shoe size and math scores. 2. There is a very strong correlation between ice cream sales and number of deaths by drowning. 3. There is very strong correlation between number of churches in a town and number of bars in a town.

3.2: Design of Experiment - Goals In experiments, identify the units, subjects, treatments and outcomes. Identify a comparative experiment and explain why they are used. Identify bias in an experiment. Be able to apply the basic principles of experimental design: compare, randomize and repeat. Be able to identify matched pairs design and block design and explain why they are used.

Terms used in experiments Experimental units Treatments Outcome Statistically significant An observed effect so large that it would rarely occur by chance.

Example: Terminology of Experimental Design For each of the following, define the experimental unit, factor, levels, response variable and what would be statistically significant. In a study of cell phone usage by college students, we want to know how much time is being spend on different types of apps on the phones. In a study of sickle cell anemia, 150 patients were given the drug hydroxyurea, and 150 were given a placebo (dummy pill). The researchers counted the episodes of pain in each subject.

Principles of Experimental Design Control: Compare two or more treatments. Randomize: use chance to assign experimental units to treatments. Replication: Use enough experimental units in each group to reduce chance variation in the results.

Comparative Experiments Experimental units Treatment Measure response Control Measure response Compare results Experimental units Treatment Measure response

Bias The design of a study is biased if it systematically favors certain outcomes. An investigator is interested if the size of fish versus the size of the lake that they are in. A number of lakes are chosen in a particular area and the investigator uses nets to catch the fish.

Principles of Experimental Design Control: Compare two or more treatments, Randomize: use chance to assign experimental units to treatments. Replication: Use enough experimental units in each group to reduce chance variation in the results.

Randomized Experiments In a completely randomized design, the treatments are assigned to all the experimental units completely by chance. Group 1 Group 2 Treatment 1 Treatment 2 Compare results Experimental units Random assignment

Randomization Label each of the N individuals. Put the N numbers into a hat. Draw the numbers one at a time until you have n individuals.

Principles of Experimental Design Control: Compare two or more treatments, Randomize: use chance to assign experimental units to treatments. Replication: Use enough experimental units in each group to reduce chance variation in the results.

Cautions about Experimentation Bias Generalization

Lack of Realism Is studying the effects of eating red meat on cholesterol values in a group of middle aged men a realistic way to study factors affecting heart disease problems in humans? Is studying the effects of hair spray on rats a realistic way to determine what will happen to women with large amounts of hair?

Other Experimental Designs A matched pair design is when each experimental unit is matched with another one. A block is a group of experimental units that are similar. In a block design, the random assignment of experimental units to treatments is carried out within each block. Control what you can, block what you can’t control, and randomize to create comparable groups.

3.3: Sampling Design - Goals Be able to differentiate between a population and a sample. Be able to determine and explain when a probability sample, simple random sample, a stratified random sample, or a multistage random sample is the preferred method. Be able to state when there is a response bias with obtaining a sample: voluntary response, convenience sample, nonresponse

Population and Sample Population: the entire group of objects that we want information about. Sample: the part (subset) of the population that we actual examine.

Sampling Methods Probability Sample Simple Random Sample (SRS) Stratified Random Sample Multistage Random Sample

SRS SRS of size n consists of n objects from the population chosen in such a way that every set of n individuals has an equal chance to be the sample actually selected. Method Label every object from 1 to n. Generate random numbers to select the objects. You are done when you have selected n different objects.

Stratified Random Samples Procedure Divide the population into groups into strata. Choose a SRS fro each group.

Multistage Sampling Divide sample into primary sampling units (PSU) Use SRS to select n PSUs Divide each SRS into smaller units. In each smaller unit, use stratified sampling. Choose smaller units close to each other.

Response Bias Convenience sample Voluntary response Undercoverage Nonresponse Your random sample needs to be representative of the population.

3.4: Toward Statistical Inference- Goals State what is meant by statistical inference and its relationship to probability. Identify parameters and statistics and the relationships between them. Be able to define what a sampling distribution is. Identify bias in a statistic by examining its sampling distribution. Describe the relationship between the sample size and the variability of a statistic. Identify ways to reduce bias and variability of a statistic.

Statistical Inference Sampling

Parameter and statistic Parameter: number describing a characteristic of the population. Statistic: number describing a characteristic of the sample.

? Sampling Variability What would happen if we took many samples? Population Sample ?

Sampling Distribution The sampling distribution of a statistic consist of all possible values of the statistic and the relative frequency with which each value occurs.

Bias vs. Variability

Managing Bias and Variability To reduce bias, use random sampling. To reduce variability of a statistic from an SRS, use a larger sample

Statistical Inference Sample has to be representative of the population Randomize The experiment has to be performed in such a way that you can obtain the data that you are interested in. Perform the correct analysis.

Example 3.34: Provide All the Critical Information Papers reporting scientific research are supposed to be short, with no extra baggage. Brevity, however, can allow researchers to avoid complete honesty about their data. Did they choose their subjects in a biased way? Did they report data on only some of their subjects? Did they try several statistical analyses and report only the ones that looked best? The statistician John Bailar screened more than 4000 medical papers in more than a decade as consultant to the New England Journal of Medicine. He says, “When it came to the statistical review, it was often clear that critical information was lacking, and the gaps nearly always had the practical effect of making the authors’ conclusions look stronger than they should have.” The situation is no doubt worse in fields that screen published work less carefully.

In-class discussion Should we allow this personal information to be collected? A government agency takes a random sample of income tax returns to obtain information on the average income of people in different occupations. Only the incomes and occupations are recorded from the returns, not the names. A social psychologist attends public meetings of a religious group to study the behavior patterns of members, A social psychologist pretends to be converted to membership in a religious group and attends private meetings to study the behavior patterns of members.