Statistics Through Applications

Slides:



Advertisements
Similar presentations
© 2011 Pearson Education, Inc
Advertisements

Chapter 10: Estimating with Confidence
 These 100 seniors make up one possible sample. All seniors in Howard County make up the population.  The sample mean ( ) is and the sample standard.
Chapter 8: Estimating with Confidence
Section 2.2: What do samples tell us?.
 What makes a sample bad? Examples…  What makes a sample good? How can we ensure good samples?
Chapter 12 Sample Surveys.
Statistics : Statistical Inference Krishna.V.Palem Kenneth and Audrey Kennedy Professor of Computing Department of Computer Science, Rice University 1.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 13 Experiments and Observational Studies.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Sampling Distributions What is a sampling distribution?
N The Experimental procedure involves manipulating something called the Explanatory Variable and seeing the effect on something called the Outcome Variable.
Chapter 10: Estimating with Confidence
Experiments and Observational Studies.  A study at a high school in California compared academic performance of music students with that of non-music.
What Is the Probability? The chance that something will happen is called probability. For example, if you toss a penny, what is the probability that it.
Assessment Statements  The Internal Assessment (IA) Rubric IS the assessment statement.
Chapter 7 Sampling Distributions
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Copyright © 2010 Pearson Education, Inc. Chapter 13 Experiments and Observational Studies.
Experiments and Observational Studies. Observational Studies In an observational study, researchers don’t assign choices; they simply observe them. look.
From Sample to Population Often we want to understand the attitudes, beliefs, opinions or behaviour of some population, but only have data on a sample.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 13 Experiments and Observational Studies.
Statistics The science of collecting, analyzing, and interpreting data. The Statistical Problem Solving Process: 1.Ask a question of interest 2.Produce.
Chapter 11: Estimation Estimation Defined Confidence Levels
Chapter 8 Introduction to Inference Target Goal: I can calculate the confidence interval for a population Estimating with Confidence 8.1a h.w: pg 481:
PARAMETRIC STATISTICAL INFERENCE
Basic Statistics Concepts Marketing Logistics. Basic Statistics Concepts Including: histograms, means, normal distributions, standard deviations.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
+ Warm-Up4/8/13. + Warm-Up Solutions + Quiz You have 15 minutes to finish your quiz. When you finish, turn it in, pick up a guided notes sheet, and wait.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
© 2010 Pearson Prentice Hall. All rights reserved Chapter Sampling Distributions 8.
2 Types of Studies Observational Study Experimental Study.
Section 2 Part 2.   Population - entire group of people or items for which we are collecting data  Sample – selections of the population that is used.
Statistics Through Applications Chapter 1: How Do We Get “Good” Data? Copyright © 2004 by W. H. Freeman & Company.
Inductive Generalizations Induction is the basis for our commonsense beliefs about the world. In the most general sense, inductive reasoning, is that in.
Psychological Research Strategies Module 2. Why is Research Important? Gives us a reliable, systematic way to consider our questions Helps us to draw.
LT 4.2 Designing Experiments Thanks to James Jaszczak, American Nicaraguan School.
© 2010 Pearson Prentice Hall. All rights reserved 8-1 Objectives 1.Describe the distribution of the sample mean: samples from normal populations 2.Describe.
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling Distributions.
Regression Analysis A statistical procedure used to find relations among a set of variables.
Gathering Useful Data. 2 Principle Idea: The knowledge of how the data were generated is one of the key ingredients for translating data intelligently.
Sampling Distributions and Inference for Proportions(C18-C22 BVD) C18: Sampling Distributions.
Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall 9-1 σ σ.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a measure of the population. This value is typically unknown. (µ, σ, and now.
Introduction Suppose that some students from the junior class will be chosen to receive new laptop computers for free as part of a pilot program. You hear.
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
CONFIDENCE INTERVALS: THE BASICS Unit 8 Lesson 1.
Sampling Theory and Some Important Sampling Distributions.
Sampling Distributions: Suppose I randomly select 100 seniors in Anne Arundel County and record each one’s GPA
Uncertainty2 Types of Uncertainties Random Uncertainties: result from the randomness of measuring instruments. They can be dealt with by making repeated.
7.1 What is a Sampling Distribution? Objectives SWBAT: DISTINGUISH between a parameter and a statistic. USE the sampling distribution of a statistic to.
8.1 Confidence Intervals: The Basics Objectives SWBAT: DETERMINE the point estimate and margin of error from a confidence interval. INTERPRET a confidence.
1 Chapter 11 Understanding Randomness. 2 Why Random? What is it about chance outcomes being random that makes random selection seem fair? Two things:
Chapter 8: Estimating with Confidence
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Section Parameter v. Statistic 2 Example 3.
Bias and Variability Lecture 27 Section 8.3 Wed, Nov 3, 2004.
WHAT IS A SAMPLING DISTRIBUTION? Textbook Section 7.1.
Do Activity 5.2A. Statistic: * A number that describes a sample Parameter: A number that describes a population EXAMPLE: A polling agency takes a sample.
Intro to Probability and Statistics 1-1: How Can You Investigate Using Data? 1-2: We Learn about Populations Using Samples 1-3: What Role Do Computers.
Copyright ©2011 Brooks/Cole, Cengage Learning Gathering Useful Data for Examining Relationships Observation VS Experiment Chapter 6 1.
AP Statistics. Issues Interpreting Correlation and Regression  Limitations for r, r 2, and LSRL :  Can only be used to describe linear relationships.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Sampling Distributions
Unit 8 Statistics Quiz Review
Day 6: Where Do Data Come From (Cont) Measuring
Combining Effect Sizes
1.) Come up with 10 examples of how statistics are used in the real life. Be specific and unique. 2.) Video.
Comparing Two Proportions
Comparing Two Proportions
Presentation transcript:

Statistics Through Applications How Do We Get “Good” Data?

Good Data isn't Based on an Anecdote Using anecdotal evidence is relying on an isolated example or experience to make a decision Good data should come from many varied examples and be non-partial Anecdotes usually appeal to our emotions and fool us into belief while statistics are dry but much more reliable

Good Data is Compared Fairly Often a rate expressed as a percent or fraction is a more valid measure than a simple count of occurrences Two schools both had 1900 students pass TAKS. One school has 2000 students and the other has 2500. Did they perform equally as well?

Good Data needs to be Communicated and Read Carefully An advertisement for a home security system says, “When you go on vacation, burglars go to work. According to FBI statistics, over 26% of home burglaries take place between Memorial Day and Labor Day. Beware - summertime is burglary time!” Only one in two cameras is actually in operation, but this could soon increase to as many as one in three.Watford Observer, 2 August 2002 Continental Airlines once advertised that it had “decreased lost baggage by 100% in the past six months.”

Results from a Gallup poll, taken May 29-31, 2009, with a 3% margin of error. Can we conclude that more Americans have a favorable impression of Dick Cheney than Nancy Pelosi? What can we conclude from these graphs?

Good Data is Valid, Unbiased & Reliable Valid – relevant and appropriate Unbiased – not consistently different from  actuality in one direction Reliable – as little variation as possible

Even Good Data Varies How Long is a Minute? How accurate are you and your classmates at knowing how long a minute is? Get a partner and a stopwatch. You will take turns timing and guessing. Using the stopwatch, the timer tells the guesser when to start. When the guesser believes that a minute has passed, he says “Stop.” At that point, the timer stops the stopwatch and records the time that passed to the nearest tenth of a second. Do not tell your partner how much time actually passed! Reset the stopwatch and switch roles. Continue timing and measuring until each person has been timed three times.

Analyzing How Long is a Minute? Was your data valid? Was either partner’s data biased? Which partner was more reliable? How about the class as a whole? Add your data (3 from each of you) to the class list and graph. Then figure the average of your 3 measurements and add it to the other graph.

All data varies, but we can use Averages to Improve Reliability No measuring process is perfectly reliable. The average of several repeated measurements of the same individual is more reliable (and less variable) than a single measurement.

Goal: The least amount of variability and bias possible!

How do we achieve our goal? To reduce bias, use random sampling. A random sample should represent the population, and therefore give unbiased results. To reduce variability, use a larger sample. Increasing the sample size will almost always give an average estimate that is close to the truth.

Good Data shouldn't be confounding Just because two variables have a relationship, that doesn’t mean one causes the other. There could be a confounding variable at play. A confounding variable is an additional variable that effects the response but isn't separated out. Confounding variables are most often found in observational studies comparing a characteristic of two groups or poorly designed experiments . Sometimes the media ignores confounding variables and misinterprets results from observational studies reporting “proven” links when in statistics we only have shown evidence of a relationship.

Consider this data – is there a relationship? Year Number of Methodist Ministers in New England Cuban Rum Imported to Boston (in barrels) 1860 63 8,376 1865 48 6,406 1870 53 7,005 1875 64 8,486 1880 72 9,595 1885 80 10,643 1890 85 11,265 1895 76 10,071 1900 10,547 1905 83 11,008 1910 105 13,885 1915 140 18,559 1920 175 23,024 1925 183 24,185 1930 192 25,434 1935 221 29,238 1940 262 34,705

Another confounding example A study sites that a group of children who had certain vaccinations were more likely to develop autism than a group of children who did not receive those same vaccinations.  Does this mean that vaccinations cause autism? No, since the children were not randomly assigned to get vaccinations or not, there could be another confounding variable at play.  Perhaps the parents that chose vaccinations also made some other choice that increased the risk of austism or were genetically inclined to have a higher risk of autistic children