Quantitative vs. Categorical Data

Slides:



Advertisements
Similar presentations
Sampling A population is the total collection of units or elements you want to analyze. Whether the units you are talking about are residents of Nebraska,
Advertisements

Chapter 7: Data for Decisions Lesson Plan
Slide 1 Copyright © 2004 Pearson Education, Inc. Chapter 1 Introduction to Statistics 1-1 Overview 1-2 Types of Data 1-3 Critical Thinking 1-4 Design of.
Business Statistics for Managerial Decision
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 1 Section 2 – Slide 1 of 22 Chapter 1 Section 2 Observational Studies, Experiments, and.
Lesson Designing Samples. Knowledge Objectives Define population and sample. Explain how sampling differs from a census. Explain what is meant by.
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Chapter 1 The Where, Why, and How of Data Collection
Introduction to Statistics
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 1-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Chapter 1 The Where, Why, and How of Data Collection
Section 5.1. Observational Study vs. Experiment  In an observational study, we observe individuals and measure variables of interest but do not attempt.
Chapter 3 Goals After completing this chapter, you should be able to: Describe key data collection methods Know key definitions:  Population vs. Sample.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 1-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
MATH1342 S08 – 7:00A-8:15A T/R BB218 SPRING 2014 Daryl Rupp.
Chapter 1 Getting Started
COLLECTING QUANTITATIVE DATA: Sampling and Data collection
Chapter 1: Introduction to Statistics
1.2: Statistical Thinking
CHAPTER 8 Producing Data: Sampling BPS - 5TH ED.CHAPTER 8 1.
Do Now: 1.Be sure to have picked up three papers upon entry. 2.Work with a partner to complete “The White House is not a Metronome Questions”
AP Statistics Section 5.1 Designing Samples. In an observational study, we simply observe individuals and measure variables, but we do not attempt to.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Elementary Statistics M A R I O F. T R I O L A Copyright © 1998, Triola, Elementary.
1.2 Key Terms Statistics:The collection, analysis, interpretation and presentation of data Population: A collection of persons, things or objects under.
Chapter 1: The Nature of Statistics
An Overview of Statistics
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
1  Specific number numerical measurement determined by a set of data Example: Twenty-three percent of people polled believed that there are too many polls.
Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Thinking About.
Chapter 12 Sample Surveys
Section 5.1 Designing Samples Malboeuf AP Statistics, Section 5.1, Part 1 3 Observational vs. Experiment An observational study observes individuals.
MAT 1000 Mathematics in Today's World. Last Time 1.Two types of observational study 2.Three methods for choosing a sample.
Chapter 1 Getting Started 1.1 What is Statistics?.
Data Collection: Sample Design. Terminology Observational Study – observes individuals and measures variables of interest but does not impose treatment.
Designing Samples Chapter 5 – Producing Data YMS – 5.1.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
Conducting A Study Designing Sample Designing Experiments Simulating Experiments Designing Sample Designing Experiments Simulating Experiments.
Lecture # 6:Designing samples or sample survey Important vocabulary Experimental Unit: An individual person,animal object on which the variables of interest.
BPS - 5th Ed. Chapter 81 Producing Data: Sampling.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
C HAPTER 5: P RODUCING D ATA Section 5.1 – Designing Samples.
Section 5.1 Designing Samples AP Statistics
Slide 1 Copyright © 2004 Pearson Education, Inc..
Chapter 1 Introduction to Statistics 1-1 Overview 1-2 Types of Data 1-3 Critical Thinking 1-4 Design of Experiments.
AP STATISTICS LESSON AP STATISTICS LESSON DESIGNING DATA.
AP STATISTICS Section 5.1 Designing Samples. Objective: To be able to identify and use different sampling techniques. Observational Study: individuals.
Notes 1.3 (Part 1) An Overview of Statistics. What you will learn 1. How to design a statistical study 2. How to collect data by taking a census, using.
A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. Chap 1-1 A Course In Business Statistics 4 th Edition Chapter 1 The Where, Why, and How.
I can identify the difference between the population and a sample I can name and describe sampling designs I can name and describe types of bias I can.
 An observational study observes individuals and measures variable of interest but does not attempt to influence the responses.  Often fails due to.
Chapter 3 Surveys and Sampling © 2010 Pearson Education 1.
Chapter 7 Data for Decisions. Population vs Sample A Population in a statistical study is the entire group of individuals about which we want information.
Introduction/ Section 5.1 Designing Samples.  We know how to describe data in various ways ◦ Visually, Numerically, etc  Now, we’ll focus on producing.
Chapter 5: Producing Data 5.1 – Designing Samples "An approximate answer to the right question is worth a good deal more than the exact answer to an approximate.
Ing. Martina Majorová, FEM SUA Statistics Lecture 4 – Data sampling & Data sorting.
Chapter 1 Introduction to Statistics 1-1 Overview 1-2 Types of Data 1-3 Critical Thinking 1-4 Design of Experiments.
Chapter 5 Producing Data 5.1 Designing Samples
statistics Specific number
Understandable Statistics
Producing Data Chapter 5.
statistics Specific number
The Nature of Probability and Statistics
Warm Up Imagine you want to conduct a survey of the students at Leland High School to find the most beloved and despised math teacher on campus. Among.
Use your Chapter 1 notes to complete the following warm-up.
Daniela Stan Raicu School of CTI, DePaul University
Chapter 5: Producing Data
Chapter 5: Producing Data
Presentation transcript:

Quantitative vs. Categorical Data Quantitative data consist of number that represent counts or measurements. All quantitative data is numerical, but not all numerical data is quantitative. Data with a unit of measurement (seconds, feet, pounds, dollars, etc.) is quantitative. Numerical data used as a label or range of values (Student ID Number, 20-25 years) is not quantitative.

Examples: Quantitative Data The University keeps the following quantitative data about each student. Grade Point Average Number of Credit Hours Completed Age Amount of money owed for tuition Other examples?

Categorical Data Non-quantitative data is called categorical. Non-numerical data must be categorical. Numerical data that serves to label or identify individuals are categorical (Example: Social Security Number). A useful guide: Would it make sense to consider an average value? If not, treat the data as categorical.

Examples: Categorical Data The University keeps the following categorical data about each student: Name Laker ID Number Date of Birth Gender Residency (“in-state” or “out-of-state”) Other?

Basic Practice of Statistics - 3rd Edition Chapter One, Part 2 1.4: Critical Thinking in Statistics 1.5: Collecting Sample Data Chapter 1 Chapter 1 Chapter 1 5 5

Basic Practice of Statistics - 3rd Edition Quick Review We typically have a VERY LARGE set of individuals (called the Population), but we cannot obtain data from every individual. A parameter is a numerical value that describes the population. The actual value is not known. We choose a subset of the Population (this subset is called a Sample), and gather data from those individuals. A statistic is a numerical value, computed from the Sample data. We often use this to estimate the unknown value of some parameter. Chapter 1

Population vs. Sample (not to scale!!) Basic Practice of Statistics - 3rd Edition Population vs. Sample (not to scale!!) Chapter 1

Basic Practice of Statistics - 3rd Edition Sampling Bias Ideally, we want our sample to be representative of the overall population. If the way we choose the sample and/or gather data from the chosen individuals… …is more/less likely to include a certain type of individual or produce a certain type of response... …then the conclusions we draw from the sample might be inaccurate for the intended population. This is called bias. Examples to follow. 08/15/11 Chapter 1

Basic Practice of Statistics - 3rd Edition Examples of Bias Estimate average class height using a sample of students from the front row. Estimate average class height using a sample of male students. Study the effectiveness of a weight-loss diet using a sample of professional athletes. Estimate what percent of Americans approve of the president using a sample of voters from only one political party. 08/15/11 Chapter 1

Common Types of Sampling Bias Basic Practice of Statistics - 3rd Edition Common Types of Sampling Bias A voluntary response sample occurs when the individuals to be studied have control over whether or not they are included in the sample. This is also called “self-selection bias.” A convenience sample occurs when the researcher is more likely to choose individuals for which it is easier to obtain data. The researcher might be unaware of this! Small sample: Using too few individuals increases of chance of getting a sample that consists only of “unusual” individuals. 08/15/11 Chapter 1

Basic Practice of Statistics - 3rd Edition Example: Voluntary Response Ratemyprofessors.com is a website that collects information about college professors from their students. The ratings come from students volunteering to create an account and submit information. Question: What kind of students are likely to volunteer? Chapter 1 11

Voluntary Response Bias Basic Practice of Statistics - 3rd Edition Voluntary Response Bias Answer: Students with stronger opinions are more likely to volunteer a response. In many “customer satisfaction” surveys, those with a strong negative opinion are most likely to volunteer. Those with a neutral opinion are least likely to volunteer. There is potential bias: those in the sample are more likely to have a negative opinion than the entire population. 08/15/11 Chapter 1

Example: Convenience Sample Basic Practice of Statistics - 3rd Edition Example: Convenience Sample I want to determine the average age of all current Clayton State students. For my sample data, I choose five students from the class and compute their average age. Why might this lead to inaccurate results? Although my intended population is all CSU students, I picked only from a small part of the population (that was most convenient for me). 08/15/11 Chapter 1

Basic Practice of Statistics - 3rd Edition Other Common Problems Some types of bias occur not in choosing the sample, but in gathering data from the chosen individual. Misreported data: Individuals may give inaccurate results (perhaps unintentionally) when asked a certain question. Example: How much do you weigh today? Example: How many hours per week do you study? Question wording: Variations in the wording of a question can greatly influence people’s responses. Compare: Should the government spend more money on public education? Should the government spend more of your tax dollars on public education? 08/15/11 Chapter 1

Basic Practice of Statistics - 3rd Edition Good Ways to Sample We usually want to have some degree of randomness when choosing our sample. Randomly-chosen samples reduce the potential for biased results, but complete randomness is not always possible. We’ll talk more about what “random” actually means in Chapters 4 and 5. 08/15/11 Chapter 1

*** Simple Random Sample *** Basic Practice of Statistics - 3rd Edition *** Simple Random Sample *** All of the statistical inference in this course will assume that data comes from a Simple Random Sample (SRS). This means… Before choosing individuals, we decide how many we want to use. This is called the sample size, usually denoted by the letter n. We choose the sample so that each group of n individuals (from the overall population) is equally likely to be picked. 08/15/11 Chapter 1

Basic Practice of Statistics - 3rd Edition Other “Good” Samples Random Sample: Each individual from the population has an equal chance of being chosen for the sample (every SRS is a Random Sample, but not every Random Sample is an SRS). Probability Sample: For each individual, we know the chance that he/she will be chosen for the sample, but different individuals may have different chances (Random Samples and SRS’s are special cases of this). 08/15/11 Chapter 1

Basic Practice of Statistics - 3rd Edition Other “Good” Samples Stratified Sample: Divide the population into mutually exclusive groups (strata), and choose a sample (often an SRS) from within each group. This is often done if we want to account for some kind of population demographics. Example: If our population is 60% women and 40% men, we might choose a sample of 30 women and 20 men. Sample demographics match those of the population. 08/15/11 Chapter 1

Basic Practice of Statistics - 3rd Edition Other “Good” Samples Cluster Sampling: Divide the population into mutually exclusive groups (clusters), and randomly choose a set of clusters. For each selected cluster, gather data from all individuals within that cluster. Example: There are 13 sections of Math 1231 currently offered at Clayton State. Randomly choose 3 sections, and survey all students from each of those 3 sections. 08/15/11 Chapter 1

Basic Practice of Statistics - 3rd Edition Other Types of Samples? Real-world statistics often uses very complex sampling methods. See the text for a survey of some of these. The trade-off: A simple method (like an SRS) is easier to analyze mathematically, but often more difficult to achieve in practice. 08/15/11 Chapter 1

How are the data obtained? In addition to how the individuals are chosen for a sample, we distinguish between the following two scenarios: Observational Study: Simply observe and/or measure individuals, without attempting to modify their characteristics or behavior. Experiment: Deliberately impose a specific set of conditions (a treatment) on each individual. A valid experiment has more than one possible treatment, and we can compare results. 08/15/11

Example: Experiment vs. Observational Study Question: Is there any sort of relationship between caffeine use and exam scores? Here is an Observational Study: Just before the exam, record how much caffeine each student has consumed today. Record each student’s exam score. It will probably be the case that different students consume different amounts of caffeine, but we do not deliberately try to create this difference. 08/15/11

Example: Experiment vs. Observational Study Question: Is there any sort of relationship between caffeine use and exam scores? Here is an Experiment: Require students to consume no caffeine on exam day. 15 minutes before the exam, give each student a cup of coffee. Some students will get regular coffee (with caffeine), others will get decaffeinated coffee. Record each student’s exam score. It will certainly be the case that different students consume different amounts of caffeine, because we deliberately created such a difference. 08/15/11

Questions for Discussion In the Experiment, why not give all students a cup of (caffeinated) coffee? In the Experiment, why not use “regular coffee” versus “no coffee”? What are some advantages/disadvantages of the Study versus the Experiment? Can you think of a scenario where it would not be possible to do an Experiment?