MAT 1000 Mathematics in Today's World. Last Time 1.Collecting data with experiments 2.Practical problems with experiments.

Slides:



Advertisements
Similar presentations
The meaning of Reliability and Validity in psychological research
Advertisements

Authority 2. HW 8: AGAIN HW 8 I wanted to bring up a couple of issues from grading HW 8. Even people who got problem #1 exactly right didn’t think about.
Reliability and Validity
MAT 1000 Mathematics in Today's World. Last Time 1.What does a sample tell us about the population? 2.Practical problems in sample surveys.
5.1 day 2 Simulations! .
Chapter 12 Sample Surveys
Section 2.2: What do samples tell us?.
Analysis. Start with describing the features you see in the data.
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Relationships Between Quantitative Variables Chapter 5.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 19 Confidence Intervals for Proportions.
Confidence Intervals for Proportions
Stat 512 – Lecture 14 Analysis of Variance (Ch. 12)
Section Decision Making with Data  NOT ALL DATA IS GOOD DATA!  “Do not put faith in what statisticians say until you have carefully considered.
MA 102 Statistical Controversies Wednesday, February 13, 2002 Today: Discuss of data ethics, and exercises Measuring: Turning ideas to numbers Exercise.
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Experiments and Observational Studies.  A study at a high school in California compared academic performance of music students with that of non-music.
MAT 1000 Mathematics in Today's World. Last Time We saw how to use the mean and standard deviation of a normal distribution to determine the percentile.
AP Statistics Overview and Basic Vocabulary. Key Ideas The Meaning of Statistics Quantitative vs. Qualitative Data Descriptive vs. Inferential Statistics.
Chapter 7 Sampling Distributions
1 Sampling Distributions Presentation 2 Sampling Distribution of sample proportions Sampling Distribution of sample means.
Sampling : Error and bias. Sampling definitions  Sampling universe  Sampling frame  Sampling unit  Basic sampling unit or elementary unit  Sampling.
Misleading Uses of Data. WOULD YOU BUY THIS PRODUCT? WHY MIGHT SOMEONE BUY IT?
Near East University Department of English Language Teaching Advanced Research Techniques Correlational Studies Abdalmonam H. Elkorbow.
From Sample to Population Often we want to understand the attitudes, beliefs, opinions or behaviour of some population, but only have data on a sample.
Statistics: Basic Concepts. Overview Survey objective: – Collect data from a smaller part of a larger group to learn something about the larger group.
Simulating a Sample Distribution
Scatterplots. Learning Objectives By the end of this lecture, you should be able to: – Describe what a scatterplot is – Be comfortable with the terms.
Chapter 7 Statistical Inference: Confidence Intervals
Chapter 8 Measuring Chapter 81. Measurement We measure a property of a person or thing when we assign a number to represent the property. We often use.
CSD 5100 Introduction to Research Methods in CSD Observation and Data Collection in CSD Research Strategies Measurement Issues.
By C. Kohn Waterford Agricultural Sciences.   A major concern in science is proving that what we have observed would occur again if we repeated the.
The Scientific Method/Process By Mr. Victor M. Calzada.
An Overview of Statistics
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
The What and the Why of Statistics The Research Process Asking a Research Question The Role of Theory Formulating the Hypotheses –Independent & Dependent.
ESSENTIAL QUESTION: HOW CAN WE USE MATH TO PREDICT THE FUTURE? 7.1 Fitting Data to a Line.
1  Specific number numerical measurement determined by a set of data Example: Twenty-three percent of people polled believed that there are too many polls.
Chapter 81 Measuring. Do Now 11/6/13 u Complete the three thought questions for Chapter 8 u Staple and submit your homework. –(Chapter 8, Prime numbered.
Psychological Research Strategies Module 2. Why is Research Important? Gives us a reliable, systematic way to consider our questions Helps us to draw.
Chapter 1: The What and the Why of Statistics  The Research Process  Asking a Research Question  The Role of Theory  Formulating the Hypotheses  Independent.
MAT 1000 Mathematics in Today's World. Last Time 1.Two types of observational study 2.Three methods for choosing a sample.
Uncertainty & Error “Science is what we have learned about how to keep from fooling ourselves.” ― Richard P. FeynmanRichard P. Feynman.
Sampling Distribution WELCOME to INFERENTIAL STATISTICS.
The z test statistic & two-sided tests Section
 Producing Data: Experiments Vs. Surveys Chapter 5.
Copyright © 2010 Pearson Education, Inc. Chapter 19 Confidence Intervals for Proportions.
Chapter 8 Parameter Estimates and Hypothesis Testing.
Part III – Gathering Data
Population size * Population size doesn’t matter as long as…
An Overview of Statistics NOTES Coach Bridges What you should learn: The definition of data and statistics How to distinguish between a population and.
Statistical Questioning Lesson After completing this lesson, you will be able to say: I can recognize and write a statistical question. I can recognize.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Chapter 7 Measuring of data Reliability of measuring instruments The reliability* of instrument is the consistency with which it measures the target attribute.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
How Science Works Precision How small a measurement is. Millimetres are more precise than centimetres because they are smaller.
Intro to Inference & The Central Limit Theorem. Learning Objectives By the end of this lecture, you should be able to: – Describe what is meant by the.
8.1 Confidence Intervals: The Basics Objectives SWBAT: DETERMINE the point estimate and margin of error from a confidence interval. INTERPRET a confidence.
Chapter 31 What Do Samples Tell Us?. Chapter 32 Thought Question 1 During a medical exam, the doctor measures your cholesterol two times. Do you think.
Population Distributions vs. Sampling Distributions There are actually three distinct distributions involved when we sample repeatedly andmeasure a variable.
Stat 100 Mar. 27. Work to Do Read Ch. 3 and Ch. 4.
Probability and Statistics “Critical Thinking”. Flawed statistics  Statistics are often misused and flawed either because of: (1) Evil intent on the.
Choosing and using your statistic. Steps of hypothesis testing 1. Establish the null hypothesis, H 0. 2.Establish the alternate hypothesis: H 1. 3.Decide.
Introduction Data may be presented in a way that seems flawless, but upon further review, we might question conclusions that are drawn and assumptions.
Day 6: Where Do Data Come From (Cont) Measuring
Bellwork.
1.) Come up with 10 examples of how statistics are used in the real life. Be specific and unique. 2.) Video.
Confidence Intervals for Proportions
Confidence Intervals for Proportions
STAT 1301 Tests of Significance about Averages.
Presentation transcript:

MAT 1000 Mathematics in Today's World

Last Time 1.Collecting data with experiments 2.Practical problems with experiments

Today Measurement Validity Error Bias Variability

Measurement Recall structure of data: individuals and variables Measurement is the assignment of a value to a variable.

Measurement Example Unemployment rate Individuals: working age American adults Variable: employment status Values: employed or unemployed. To measure, ask an individual: “are you employed?”

Measurement Example Heights of MAT 1000 students Individuals: MAT 1000 students Variable: height Values: numeric To measure, use a measuring tape, yard stick, etc.

Measurement An instrument is a device or method for taking a measurement Measuring height: different possible instruments, like measuring tape or yard sticks Unemployment: the instrument is the question asked

Measurement Some instruments give measurements in “units.” Tape measure could give height measured in inches, or in centimeters.

Validity Are we really measuring the attribute we are interested in? Example University admission boards want to measure how prepared applicants are for college. One way this can be measured is with standardized test scores (SAT or ACT score)

Validity A valid measurement is an appropriate representation of the property being measured. An extreme example: measure college readiness by height. Clearly not valid.

Validity But are ACT test scores even a valid measurement of college readiness? Some people argue that they are not—that ACT scores don’t tell us if someone is ready for college.

Validity This is a common scenario. The question we are interested in is somewhat vague, and we have to try and find an appropriate (valid) way to measure. How else could we measure “college readiness”?

Predictive Validity Can we get evidence that a particular measurement like ACT scores is valid? Yes. We look if ACT scores can predict how well an applicant will do in college. If a measurement can predict success on a related task, it has predicitive validity.

Predictive Validity If applicants with ACT scores tend to do well in college (i.e. high GPA), this gives evidence that the ACT is a valid measurement of college readiness. Conclusion: one way to decide if a measurement is valid is to ask if it has predictive validity. How else can we decide?

Rates Measuring highway safety. One method is to count deaths from car accidents. In 1994 there were 40,676 traffic fatalities in the U.S. In 2003 there were 42,643. Was driving becoming less safe?

Rates What else changed from 1994 to 2004? Higher population: 260 million to 290 million More drivers: 175 million to 196 million More miles driven: 2,359 billion to 2,890 billion So is it valid to compare highway safety from 1993 to 2003 using the number of traffic fatalities? No.

Rates A more valid measurement is to use a rate, like traffic fatalities per number of drivers, or traffic fatalities per mile drive. 1993: fatalities per 100,000 drivers 2003: fatalities per 100,000 drivers 1993: 1.7 fatalities per 100,000,000 miles 2003: 1.48 fatalities per 100,000,000 miles

Rates 2012 (most recent year available): 33,561 fatalities, compared to 42,643 in Are we safer on the road? Yes, but we have to use rates to see this Even though the count is lower in 2012, that doesn’t make it a valid measurement.

Rates Rates from 2003 to 2012: 2003: fatalities per 100,000 drivers 2012: fatalities per 100,000 drivers 2003: 1.48 fatalities per 100,000,000 miles 2012: 1.13 fatalities per 100,000,000 miles

Validity Is a measurement valid? Does it have predictive validity? If we use a count to measure, would a rate perhaps be better? But, even if we believe our measurement is valid, there are still potential sources of error in the measurement process.

Sources of error An error in a measurement is a discrepancy between the measured value and the true value. “Error” does not mean mistake. Error is inevitable in any measurement.

Sources of error Two different sources of errors in measurements: 1.Bias 2.Variability

Sources of error Bias: comes from the way we measure. Systematically wrong in the same direction. Example Using police reports to measure public safety. Problem: not all crimes are reported to the police. Result: a biased measurement.

Sources of error How can we reduce bias? Bias comes from instruments or measuring procedures. Conclusion: the only way to reduce bias is to improve the instrument or use a better procedure.

Sources of error Variability: repeated measurements of the same individual give different values. Example Analog bathroom scale: each time you step on you may get a slightly different measurement of your weight.

Sources of error All measurements will have some variability. Example Measuring time with an atomic clock. Every 10 days the NIST reports the amount of error in time measurements. Some recent values: 6.0 ns, 5.9 ns, 5.5 ns, 5.2 ns Note that 1 ns = seconds

Sources of error If a measurement has small variability, then repeated measurements of the same individual will be close to each other. If a measurement has small variability, we say it is reliable.

Sources of error One way to reduce variability (make a measurement more reliable): Take several measurements, and then take the average. Variability causes measurements to be randomly scattered around the true value— some too small, some too large. Averaging several measurements will reduce the affect of this scattering.

Sources of error The two sources of error—bias and variability—are independent of each other. A measurement could have high bias but low variability, or vice versa. The following picture (which we have seen before) is helpful in understanding the two sources of error:

Sources of error Consider shooting arrows (measurements) at a target (the true value): Bias means the archer systematically misses in the same direction. Variability means that the arrows are scattered.

Sources of error Whenever we collect data, we make measurements. 1.Make sure measurements are valid (look for predictive validity, or use a rate instead of a count) 2.Measurement = true value + bias + variability 3.Averaging several measurements can reduce variability

Sample surveys are measurements A key example of a measurement is a sample survey. We attempt to measure a parameter using a statistic, but our measurement may have some bias, and will certainly have variability statistic = parameter + bias + variability

Bias, Variability, and Validity Keep in mind that bias and variability are both independent of validity. We can have a measurement that has very low bias and low variability, but if it isn’t valid, it’s useless. We can measure height very accurately, but that doesn’t make it a valid way to measure college readiness.