Class One Before Class One Read To The Students: Statistical Thinking pages xx-xxv; Chapters 8, 9 & 10 For Class Two: Chapter 8: 34, 36, 38, 44, 46 Chapter.

Slides:



Advertisements
Similar presentations
Chapter 7: Data for Decisions Lesson Plan
Advertisements

GATHERING DATA Chapter Experiment or Observe?
CHAPTER 8: Producing Data: Sampling
Chapter 5 Producing Data
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
AP Statistics Chapter 5 Notes.
The Practice of Statistics
Section 5.1. Observational Study vs. Experiment  In an observational study, we observe individuals and measure variables of interest but do not attempt.
Chapter 1 Getting Started
Chapter 5 Data Production
Chapter 1: Introduction to Statistics
Experiments and Observational Studies. Observational Studies In an observational study, researchers don’t assign choices; they simply observe them. look.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 13 Experiments and Observational Studies.
BPS - 5th Ed. Chapter 81 Producing Data: Sampling.
Chapter 1: The Nature of Statistics
AP Statistics.  Observational study: We observe individuals and measure variables of interest but do not attempt to influence responses.  Experiment:
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.1 Samples and Surveys.
CHAPTER 8: Producing Data: Sampling
Collection of Data Chapter 4. Three Types of Studies Survey Survey Observational Study Observational Study Controlled Experiment Controlled Experiment.
Chapter 5: Producing Data “An approximate answer to the right question is worth a good deal more than the exact answer to an approximate question.’ John.
Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Thinking About.
Producing Data 1.
Data Collection: Sample Design. Terminology Observational Study – observes individuals and measures variables of interest but does not impose treatment.
CHAPTER 8: Producing Data Sampling ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Designing Samples Chapter 5 – Producing Data YMS – 5.1.
Conducting A Study Designing Sample Designing Experiments Simulating Experiments Designing Sample Designing Experiments Simulating Experiments.
Lecture # 6:Designing samples or sample survey Important vocabulary Experimental Unit: An individual person,animal object on which the variables of interest.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
C HAPTER 5: P RODUCING D ATA Section 5.1 – Designing Samples.
Section 5.1 Designing Samples AP Statistics
AP STATISTICS LESSON AP STATISTICS LESSON DESIGNING DATA.
AP STATISTICS Section 5.1 Designing Samples. Objective: To be able to identify and use different sampling techniques. Observational Study: individuals.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 4 Designing Studies 4.2Experiments.
 An observational study observes individuals and measures variable of interest but does not attempt to influence the responses.  Often fails due to.
Chapter 3 Producing Data. Observational study: observes individuals and measures variables of interest but does not attempt to influence the responses.
Chapter 7 Data for Decisions. Population vs Sample A Population in a statistical study is the entire group of individuals about which we want information.
1. What is one method of data collection? 2. What is a truly random way to survey/sample people?
Designing Studies In order to produce data that will truly answer the questions about a large group, the way a study is designed is important. 1)Decide.
1 Chapter 11 Understanding Randomness. 2 Why Random? What is it about chance outcomes being random that makes random selection seem fair? Two things:
Producing Data 1.
1.3 Experimental Design. What is the goal of every statistical Study?  Collect data  Use data to make a decision If the process to collect data is flawed,
Chapter 4 Designing Studies 4.1Samples and Surveys 4.2Experiments 4.3Using Studies Wisely.
Chapter 5 Data Production
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
Probability and Statistics
Section 5.1 Designing Samples
CHAPTER 4 Designing Studies
Producing Data Chapter 5.
CHAPTER 4 Designing Studies
Daniela Stan Raicu School of CTI, DePaul University
Day 1 Parameters, Statistics, and Sampling Methods
Chapter 5: Producing Data
CHAPTER 4 Designing Studies
Day 1 Parameters, Statistics, and Sampling Methods
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
Chapter 3 producing data
10/18/ B Samples and Surveys.
CHAPTER 4 Designing Studies
CHAPTER 4 Designing Studies
Designing Samples Section 5.1.
CHAPTER 4 Designing Studies
Probability and Statistics
Presentation transcript:

Class One Before Class One Read To The Students: Statistical Thinking pages xx-xxv; Chapters 8, 9 & 10 For Class Two: Chapter 8: 34, 36, 38, 44, 46 Chapter 9: 28, 48 Chapter 10: 32, 36 Read Chapters 1 & 2

Objectives for Class One Identify categorical and quantitative variables. Distinguish between a population and a sample. Discuss the advantages and disadvantages of various sampling techniques. Distinguish between an observational study and an experiment. Discuss the meaning of a random event and the probability of a given outcome. Discuss basic rules of probability.

Comments on the Course This course will focus on the interpretations of and uses for statistics not the calculation of statistics. –You need to understand enough about the calculations of numbers so that you are able to spot errors because technology is only as competent as the user. –The goal is to make you statistically literate not statisticians or actuaries. –I recommend that you use Microsoft Excel® for all of your calculations, diagrams and charts since this is probably the technology you will use in most workplaces.

“Mathematics is the language in which God has written the universe.” Galileo Galilei Mathematics and therefore statistics is the language we use to describe our sensory experiences of the universe. Mathematics is an axiomatic system i.e. a set of rules based on unproven assumptions. –Think of it as a game like monopoly. –You cannot prove the rules of monopoly true or false they just are. For example, you only get $200 for passing Go when playing the game. This rule has no meaning outside the game of Monopoly. Players who know and understand the rules well have an advantage and will beat players who are new to the game or unfamiliar with the rules. Unlike Monopoly though Mathematics can be used for other things than entertainment, but like Monopoly those who understand the rules have a distinct advantage in the game and understanding mathematics gives you a distinct advantage in the world.

The basic rules of statistics: data beats anecdotes: there is no such thing as proof by example beware the lurking variable: life is complicated where data comes from is important: beware the magician variation is everywhere: no two things are exactly alike conclusions are not certain: we are not God

When you plan a statistical study or evaluate someone else’s data ask yourself these questions. 1. Who? Who are the individuals described by the data and how many are there? 2. What? What variables (characteristics) and how many do the data contain? What are the units of measurement for these variables? 3.How? How did they collect the data? Did they follow the correct procedure to be able to draw conclusions from the data? 4.Will? Will this information allow us to draw the conclusion we want about the individuals? Can we answer the question Why? with the data we have created? 5.Why? Why were these individuals studied and or why did they collect information on these specific characteristics?

We will be studying these components of the game of statistics: data production: turning experiences into numbers with meaning; the Who?, What?, & How? of statistics data analysis: looking for patterns and trends; the Will? of statistics statistical inference: using the past and present to predict the future or using the past and present to make decisions; the Why? of statistics

Answering the question: Who? Everything starts with an Individual. Individuals: objects described by a set of data. Individuals can be people, objects or events. How you choose your individuals can make or break you. The process of choosing individuals is called sampling. Sampling involves two groups of individuals: –population: the entire group of individuals about whom we want to discover information. This group is usually so large that it is impractical or impossible to collect data from everyone. –sample: a group of people chosen to represent the entire population from whom we gather our information

Answering the question: Who? continued Sample Design: is a description of the process used to choose individuals to be included in the sample. –An effective sample design must eliminate bias which is the systematic favoring of an outcome. Bias taints your data and will weaken your ability to make accurate predictions and good decisions. –sample frame: the list of individuals from which a sample is actually selected Bad sample designs that always contain bias: –Convenience sample: a sample selected by taking the members of the population that are easiest to reach. Bias is created by the person taking the sample. –Voluntary response sample: consists of people who choose themselves by responding to a broad appeal. Bias is created by the individuals in the sample because only those with extreme opinions are likely to respond.

Answering the question: Who? continued If bias is the systematic favoring of one outcome then its opposite is when all outcomes are equally likely. This is the definition of a random event. Random events are governed by the impartial laws of mathematics and probability; therefore, they contain no bias and allow us to make accurate predictions and good decisions. Good sample designs that are random (probability samples): –SRS (Simple Random Sample): each individual in the sample frame is given a numerical label having the same number of digits. Either a table of random digits or software which can generate random digits is then used to select the individuals for the sample. –Systematic Random Sample: choose individuals from the sample frame at fixed intervals from a randomly chosen starting point –Stratified Random Sample: Individuals are first separated into groups (strata) of similar individuals, then and SRS is taken from each strata. –Multistage Sample: SRS’s are taken in succession to generate smaller and smaller subgroups of the initial sample frame.

Answering the question: Who? continued Problems can occur even with good sample designs: –undercoverage: some groups of the population are left out of the sampling process –nonresponse: individuals chosen for the sample can’t be contacted or refuse to participate –response bias: behavior of either the respondent or the interviewer can create a systematic error in the data. respondents may intentionally lie about behaviors that are unpopular or illegal respondents may unintentionally lie because the behavior occurred beyond their ability to remember accurately interviewer may word a question in such a way as to elicit a specific response (wording effect) interviewer may word a question in such a way that the respondent doesn’t understand the question or how to respond. (wording effect)

Answering the question: Who? continued Important points to remember: –sample results, no matter how good the sampling process, are only an estimate for the population –different samples will have different results because of variation –although we cannot know the exact values of the variables for the population because random sampling eliminates systematic bias and follows the laws of probability we can calculate the margin of error our results have giving us a range of probable values for the population –larger random samples give more accurate results than smaller samples

Answering the question: What? Attaching meaning to the numbers. Variable: a characteristic of an individual. Variables can take different values for different individuals. –Types of Variables categorical: places an individual into group by labeling them with an adjective. e.g. color, race, material, etc quantitative: associates the individual with a numerical value that we can perform mathematical operations with by adding, subtracting, multiplying or dividing. –Problems with variables: confounding: when the effects of two variables cannot be distinguished from each other. lurking: a variable which is not measured and for which there is no data but which has a significant impact on the outcome

Answering the question: How? Quantifying our experiences. There are two basic ways to collect data: –observational study: observes individuals and measures variables but does not attempt to influence responses cannot gauge the effect of an intervention purpose is to describe a group or situation –experiment: deliberately imposes treatments on individuals in order to observe their responses allows us to explore and understand cause and effect relationships good experimental designs will attempt to eliminate confounding There are two categories for our variables in studies and experiments: –response: measures the outcome of a study or experiment –explanatory: may explain or influence changes in the response variable.

Answering the question: How? A closer look at experiments. General Principles of experimental design: –control: minimizes the effects of confounding and lurking variables on the response. Methods of control include: comparison: measuring the differences in the response to several treatments randomization: uses chance to assign subjects to treatments creating similar groups experimental blindness: avoids unconscious bias by preventing either subject or experimenter from knowing the differences in treatments. Blindness is often used with a placebo treatment. –sample design: use enough subjects in each group to reduce chance variation in the results. The greater the number of individuals you look at the better your decisions and predictions will be. –statistical significance: an observed effect is so large that it would rarely occur by chance –realism: the subjects, treatments and setting of an experiment should as closely as possible duplicate the conditions that are being studied. There is always a trade off between realism and control.

Answering the question: How? A closer look at experiments. Elements of experimental designs: –subjects: the individuals studied in an experiment selected by our sample design –factors: the explanatory variables in an experiment –treatment: any specific experimental condition applied to the subjects or a combination of specific values for the factors Experimental outlines: is a flowchart summarizing the elements of the experiment and identifying the response variable. sample design experimental groups: size, number, description treatments: number and specific values of factors response variable

Answering the question: How? A closer look at experiments. Types of Experiments: –comparative experiments: compares the results of the experimental group undergoing the treatment with a control group who receives a placebo treatment as a method of controlling lurking variables –randomized comparative experiments: subjects are assigned to either experimental or control groups by the use of randomization not by characteristics of the subjects or judgments of the experimenters –double blind designs: neither the subjects nor the people who work with the subjects know which treatment the subjects are receiving –matched pairs designs: compares just two treatments on two subjects who are as closely matched as possible or the same treatment is given to one subject in a random order –block designs: the random assignment of subjects to treatments is carried out separately within each block, which is a group of individuals that are known before the experiment to be similar in some way that is expected to affect the response to the treatment

Probability: The mathematical foundation of the 5 questions. Random samples eliminate bias from the act of choosing a sample but they can still lead to incorrect results due to variability when we choose at random. Chance behavior is unpredictable in the short run but has a regular and predictable pattern in the long run. random outcome: individual outcomes are uncertain but there is nonetheless a regular distribution of outcomes in a large number of repetitions –the probability of a random phenomenon is the proportion of times the outcome would occur in a very long series of repetitions –independent trials: the outcome of one trial must not influence the outcome of any other

Probability Models sample space S: is the set of all possible outcomes event: is an outcome or set of outcomes of a random phenomenon or it is a subset of the sample space probability model: is a mathematical description of a random phenomenon consisting of two parts: a sample space S and a way of assigning probabilities to events –discrete: the random variable described has a finite list of possible outcomes –continuous: the random variable described can take on any value in an interval with probabilities given as areas under a density curve personal probability: is a number between 0 and 1 that expresses an individual’s judgment about how likely an outcome is

Rules of Probability the probability of an event must always be between 0 and 1 inclusive all possible outcomes, the sample space, together must have a probability of 1 the probability that an event does not occur, called its complementary event, is 1 minus the probability of the event or the probabilities of complementary events must add to 1 if two events have no outcomes in common(disjoint events), the probability that one or the other occurs, their union, is the sum of their individual probabilities because density curves have an area equal to 1 they can be used for assigning probabilities to events that they describe

Objectives for Class One Identify categorical and quantitative variables. Distinguish between a population and a sample. Discuss the advantages and disadvantages of various sampling techniques. Distinguish between an observational study and an experiment. Discuss the meaning of a random event and the probability of a given outcome. Discuss basic rules of probability.

Class One For Class Two: Chapter 8: 34, 36, 38, 44, 46 Chapter 9: 28, 48 Chapter 10: 32, 36 Read Chapters 1 & 2