Topic 1: Samples and Populations

Slides:



Advertisements
Similar presentations
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Advertisements

Descriptive and Inferential Statistics.  You’re already familiar with statistics through radio, television, newspapers, and magazines:  Eating 10 g.
Quantitative vs. Categorical Data
Introduction In medicine, business, sports, science, and other fields, important decisions are based on statistical information drawn from samples. A sample.
An Overview of Statistics
Agresti/Franklin Statistics, 1 of 33 Chapter 1 Statistics: The Art and Science of Learning from Data Learn …. What Statistics Is Why Statistics Is Important.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 1 Statistics: The Art and Science of Learning from Data Section 1.2 Sample Versus Population.
What is Statistics? Chapter 0. What is Statistics? Statistics is the science (and art) of learning from data. Statistics is the study of variability.
Organization of statistical investigation. Medical Statistics Commonly the word statistics means the arranging of data into charts, tables, and graphs.
Intro to Probability and Statistics 1-1: How Can You Investigate Using Data? 1-2: We Learn about Populations Using Samples 1-3: What Role Do Computers.
Yandell - Econ 216 Chap 1-1 Chapter 1 Introduction and Data Collection.
Section 4.1 Why Take Samples and.
Topic 2: Types of Statistical Studies
Chapter 1: Statistics, Data and Statistical Thinking
GS/PPAL Research Methods and Information Systems
Learning Objectives : After completing this lesson, you should be able to: Describe key data collection methods Know key definitions: Population vs. Sample.
Chapter 1 Introduction and Data Collection
Statistics: Experimental Design
An Overview of Statistical Inference – Learning from Data
Sample Surveys.
Statistics in Management
Overview of probability and statistics
Lecture 3 Biostatistics in practice of health protection
Introduction to Statistics
Elementary Statistics
Part III – Gathering Data
Chapter 1: Statistics, Data, and Statistical Thinking
Section 5.1 Designing Samples
Josie Burridge & Alyssa Pennacchi
Experimental Design, Data collection, and sampling Techniques
Why Study Statistics? Statistics is the study of collecting, displaying, analyzing, and interpreting information. Information that was collected in a.
The Language of Sampling
Probability and Statistics
Get survey from Mr. Ebersole and read directions and complete it.
Statistical Data Analysis
Data, conclusions and generalizations
Sampling And Sampling Methods.
Bias On-Level Statistics.
CHAPTER 12 Sample Surveys.
Inference for Sampling
An Overview of Statistical Inference – Learning from Data
Designing experiments
The Language of Sampling
Chapter 7 Special Topics
Introduction to Statistics
Gathering and Organizing Data
It is actually way more exciting than it sounds!!!!
WARM – UP Use LINE 5 of the random digit table. 30. The World Series.
Use your Chapter 1 notes to complete the following warm-up.
Designing Samples Statistical techniques for producing data open the door to formal statistical inference, which answers specific questions with a known.
Chapter 1 Statistics: The Art and Science of Learning from Data
Chapter 1: Statistics.
Chapter 1 Why Study Statistics?
Chapter 1 The Where, Why, and How of Data Collection
Statistics · the study of information Data · information
6A Types of Data, 6E Measuring the Centre of Data
Introduction to Statistics
Statistical Data Analysis
The Language of Sampling
Sample Surveys Idea 1: Examine a part of the whole.
Business Statistics: A First Course (3rd Edition)
Designing Samples Statistical techniques for producing data open the door to formal statistical inference, which answers specific questions with a known.
Two Halves to Statistics
Chapter 1 Why Study Statistics?
The Where, Why, and How of Data Collection
Randomization and Bias
Chapter 1: Statistics, Data and Statistical Thinking
Probability and Statistics
Introduction to Statistics
Chapter 1 The Where, Why, and How of Data Collection
Presentation transcript:

Topic 1: Samples and Populations

Population and sample Research Question Are dogs sensitive to the Earth’s magnetic field? Population All dogs Sample 70 dogs (37 different breeds) Article: http://www.pbs.org/newshour/rundown/dogs-poop-in-alignment-with-earths-magnetic-field-study-finds/ Original study: http://www.frontiersinzoology.com/content/pdf/1742-9994-10-80.pdf.

Why choose a sample? If we collect data from the entire population, we call this a census. Reasons we may not collect data from the entire population include: too expensive too time consuming some subjects are unwilling to participate the entire population cannot be identified data collection might be a destructive process

Sampling is natural Here are a few “everyday” examples of decision making based on data collected from a sample: You buy a product because a friend recommends it. You buy a product because it has great reviews on Amazon.com You stop using a product because the news featured a story about someone who had a terrible experience using the product. Common sense tells us that data collected from a few individuals might lead to results that are quite different from data collected from the entire population. What we need is for the sample to be essentially a small version of the population. We call this a representative sample.

Representative sampling is less natural Great care must be taken to choose a representative sample from a population. One of the main contributors to inaccurate statistical results is working with a sample that is not representative of the population. Common problems with sampling: Anecdotal evidence (haphazardly choosing a sample) Non-response bias Voluntary-response bias Convenience sampling

Anecdotal evidence Anecdotal evidence is information collected haphazardly. It typically has the following characteristics: Only a few cases are considered (very small sample size) There was no attempt to make the sample representative of the population Focus is on evidence that is unusual, shocking, interesting, etc. Anecdotal evidence is often like this: A man on the news got mercury poisoning from eating swordfish, so the average mercury concentration in swordfish must be dangerously high. My friend’s dad had a heart attack and died after they gave him a new heart disease drug, so the drug must not work.

Anecdotal evidence and early smoking research Smoking research started in the 1930s and 1940s when cigarette smoking became increasingly popular. While some smokers seemed to be sensitive to cigarette smoke, others were completely unaffected. Anti-smoking study results were faced with resistance based on anecdotal evidence such as, “My uncle smokes three packs a day and he’s in perfectly good health.” In time, researchers were able to examine larger samples of cases (smokers), and trends showing that smoking has negative health impacts became much clearer. Brandt, The Cigarette Century (2009), Basic Books.

Non-response bias If only a small fraction of people in a representative sample choose to respond to a survey, the sample may no longer be representative of the population. There has been a well-documented decline in household survey response rates in recent decades. The American Time Use Survey (ATUS) is designed to produce comprehensive information on how Americans use their time, and should deepen our understanding of family life. In 2004, the ATUS response rate was below 60%. Whether the responses obtained could be generalized to the target population was call into question. http://poq.oxfordjournals.org/content/70/5/676.full.pdf+html

Voluntary-response bias This occurs when the sample consists of people who volunteer to respond because they have strong opinions on the issue. Such a sample is unlikely to be representative of the population.

Convenience sampling Individuals who are easily accessible are more likely to be included in the sample. “One of the most contentious issues in consumer behavior research, and social science research generally, is the use of convenience samples of undergraduate college students as subjects in behavioral investigations.” http://www.mccombs.utexas.edu/~/media/Files/MSB/Research/Publications/2014Q1/Convenience%20samples%20of.pdf

How do we get a representative sample? The universally accepted method for choosing a representative sample is random selection. Ideal situation: You can enumerate every individual in the population and randomly select (using a computer algorithm) the number of individuals you want in your sample. Sometimes this ideal situation is not practical, but all acceptable sampling techniques for obtaining a representative sample are based on random selection.

Descriptive Statistics and Inferential Statistics Suppose we have chosen a random sample from a population and collected some data. Descriptive statistics refers to methods used to summarize the data collected from the sample. This usually entails numerical summaries (for example, averages) and graphical summaries (for example, a bar graph). Inferential statistics refers to methods used to estimate or predict something about a population, based on the data collected from the sample.

Sample Statistics and Population Parameters Suppose we have chosen a random sample from a population and collected some data. A statistic is a numerical summary (for example, an average) of the data collected from the sample. A parameter is a numerical summary (for example, an average) of the population data. The main idea: Starting with a statistic obtained from sample data, we use methods from inferential statistics to estimate the corresponding parameter of the population.