Elementary Statistics Joan Sholars, Professor. Statistics, the Beginning Statistics is the study of procedures for collecting, describing, and drawing.

Slides:



Advertisements
Similar presentations
1.3 Data Collection and Experimental Design
Advertisements

Introduction to Statistics
Chapter 7: Data for Decisions Lesson Plan
AP Stat Trivia Review.
Section 1.3 Experimental Design © 2012 Pearson Education, Inc. All rights reserved. 1 of 61.
Section 1.3 Experimental Design.
The Data Analysis Process & Collecting Data Sensibly
1. Identify the variable(s) of interest (the focus) and the population of the study. 2. Develop a detailed plan for collecting data. Make sure sample.
East Los Angeles College Math 227 – Statistics Fall 2008
Chapter 3 Producing Data 1. During most of this semester we go about statistics as if we already have data to work with. This is okay, but a little misleading.
Research Ethics Levels of Measurement. Ethical Issues Include: Anonymity – researcher does not know who participated or is not able to match the response.
Chapter 1 Introduction to Statistics
Quantitative vs. Categorical Data
MATH1342 S08 – 7:00A-8:15A T/R BB218 SPRING 2014 Daryl Rupp.
Created by Tom Wegleitner, Centreville, Virginia Section 1-4 Design of Experiments.
Homework Check Homework check now... Please take out your homework so we can check it.
Chapter 1: Introduction to Statistics
STATISTICS is about how to COLLECT, ORGANIZE,
Created by Tom Wegleitner, Centreville, Virginia Section 1-4 Design of Experiments.
Experimental Design 1 Section 1.3. Section 1.3 Objectives 2 Discuss how to design a statistical study Discuss data collection techniques Discuss how to.
Chapter 4 Statistics. 4.1 – What is Statistics? Definition Data are observed values of random variables. The field of statistics is a collection.
Introduction to Probability and Statistics Consultation time: Ms. Chong.
Chapter 1: The Nature of Statistics
Section 1.2 ~ Sampling Introduction to Probability and Statistics Ms. Young.
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
Chapter 7: Data for Decisions Lesson Plan Sampling Bad Sampling Methods Simple Random Samples Cautions About Sample Surveys Experiments Thinking About.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Chapter Introduction to Statistics 1.
LECTURE 2 TUESDAY, 1 September STA 291 Fall
© 2010 Pearson Prentice Hall. All rights reserved 1-1 Objectives 1.Define statistics and statistical thinking 2.Explain the process of statistics 3.Distinguish.
Chapter 1 Getting Started 1.1 What is Statistics?.
1. Identify the variable(s) of interest (the focus) and the population of the study. 2. Develop a detailed plan for collecting data. Make sure sample.
Chapter 1 Getting Started Understanding Basic Statistics Fifth Edition By Brase and Brase Prepared by Jon Booze.
S AMPLING T ECHNIQUES. How is it possible that when survey’s come out, they can say that 23% of Canadians like to take long walks on the beach when no.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
© 2008 Pearson Addison-Wesley. All rights reserved Chapter 5 Statistical Reasoning.
Notes 1.3 (Part 1) An Overview of Statistics. What you will learn 1. How to design a statistical study 2. How to collect data by taking a census, using.
EXTRA PRACTICE WITH ANSWERS
Introduction to the practice of Statistics
SAMPLING TECHNIQUES LECTURE - 2 GE 608 Experimental Methods and Analysis Oct 28, 2015 Muharrum 14, 1437.
Introduction to Statistics Chapter 1. § 1.1 An Overview of Statistics.
Intro Stats Warm – Up 1.3 Determine the Level of Measurement of each (nominal, ordinal, interval, ratio) 1. The daily high temperatures in a city. 2. The.
1 Introduction to Statistics. 2 What is Statistics? The gathering, organization, analysis, and presentation of numerical information.
Data Collection and Sampling Techniques.   Data can be collected in a variety of ways. One of the most common methods is through the use of surveys.
Section 1.3 Experimental Design.
CHAPTER 1: INTRODUCTION TO STATISTICS SECTION 1.1: AN OVERVIEW OF STATISTICS.
Ch1 Larson/Farber 1 1 Elementary Statistics Larson Farber Introduction to Statistics As you view these slides be sure to have paper, pencil, a calculator.
SECTION 1.1 SAMPLING. Objectives 1. Construct a simple random sample 2. Determine when samples of convenience are acceptable 3. Describe stratified sampling,
Producing Data: Experiments BPS - 5th Ed. Chapter 9 1.
SECTION 1.2 TYPES OF DATA. Objectives 1. Understand the structure of a typical data set 2. Distinguish between qualitative and quantitative variables.
An Overview of Statistics Section 1.1 After you see the slides for each section, do the Try It Yourself problems in your text for that section to see if.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. T YPES OF D ATA Section 1.2.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. D ESIGN OF E XPERIMENTS Section 1.3.
Ten things about Experimental Design AP Statistics, Second Semester Review.
MATH Section 6.1. Sampling: Terms: Population – each element (or person) from the set of observations that can be made Sample – a subset of the.
Section 1.3 Objectives Discuss how to design a statistical study Discuss data collection techniques Discuss how to design an experiment Discuss sampling.
Biostatistics Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
DATA COLLECTION AND EXPERIMENTAL DESIGN SECTION 1.3 NOTES.
Copyright © 2016 The McGraw-Hill Companies, Inc. Permission required for reproduction or display. S AMPLING Section 1.1.
Applied Statistics Using SAS and SPSS Topic: Inferences for Different Types of Studies By Prof Kelly Fan, Cal State Univ, East Bay.
Elementary Statistics Ron Larson and Betsy Farber
Unit 1 Section 1.3.
Introduction to Statistics
Chapter 1: Introduction to Statistics
Applied Statistics Using SAS and SPSS
Introduction to Statistics
MATH 2311 Section 6.1.
Applied Statistics Using SPSS
Data Collection and Experimental Design
Introduction to Statistics
Presentation transcript:

Elementary Statistics Joan Sholars, Professor

Statistics, the Beginning Statistics is the study of procedures for collecting, describing, and drawing conclusions from information. Statistics is a collection of methods for planning experiments, obtaining data, and then organizing, summarizing, presenting, analyzing, interpreting and drawing conclusions based on data.

A population is the entire collection of individuals about which information is sought. A sample is a subset of a population, containing the individuals that will actually be observed. Ideally, we would like our sample to represent the population as closely as possible.

A simple random sample of size n is a sample chosen by a method in which each collection of n population items is equally likely to comprise the sample.

Questions A pollster wants to estimate the proportion of voters in a certain town who are Democrats. He goes to a large shopping mall and approaches people to ask them if they are Democrats. Is this a simple random sample? Explain. A telephone company wants to estimate the proportion of customers who are satisfied with their service. They use a computer to generate a list of random phone numbers and call those people to ask them whether they are satisfied. Is this a simple random sample? Explain.

Other sampling techniques A sample of convenience is a sample that is not drawn by a well-defined random method. The big problem with samples of convenience is that they may differ systematically in some way from the population. A sample of convenience may be acceptable where it is reasonable to believe that there is no significant difference between the sample and the population.

Stratified Sampling In stratified sample, the population is divided into groups, called strata, where the members of each strata are similar in some way. Then a simple random sample is drawn from each stratum.

Cluster Sample In cluster sampling, items are drown from the population in groups, or clusters. Cluster sampling is useful when the population is too large or too spread out for simple random sampling to be feasible. Cluster sampling is used extensively by U. S. government agencies in sampling the U. S. population to measure sociological factors such as income or employment.

Systematic Sample In a systematic sample, the population items are ordered. It is decided how frequently to sample items. Let k represent the sampling frequency. To begin the sampling, choose a starting point at random. Select the item in the starting place along with every k th item after that. Systematic sampling is sometimes used to sample products as they come off an assembly line, in order to check that they meet quality standards.

Voluntary Response Sampling Voluntary response samples are often used by the media to try to engage the audience. How reliable are voluntary response samples? To put it simply, voluntary response samples are never reliable. People who go to the trouble to volunteer an opinion tend to have stronger opinions than is typical of the population. In addition, people with negative opinions are often more likely to volunteer their responses than those with positive opinions.

Check your understanding A radio talk show host invites listeners to send an to express their opinions on an upcoming election. More than 10,000 s are received. What kind of sample is this? Every ten year, the U. S. Census Bureau attempts to count every person living in the United States. To check the accuracy of their count in a certain city, they draw a sample of census districts (roughly equivalent to a city block) and recount everyone in the sampled districts. What kind of sample is formed by the people who are recounted?

A college basketball team held a promotion at one of its games in which every 20 th person who entered the arena won a free basketball. What kind of sample do the winner represent? A public health researcher is designing a study of the effect of diet on heart disease. The researcher knows that the diets of men and women tend to differ and that men are more susceptible to heart disease. To be sure that both men and women are represented, the study comprises a simple random sample of 100 men and another simple random sample of 100 women. What kind of sample do these 200 people represent?

USA Today, December 10, The biggest study ever of the health effects of alcohol concludes that a drink a day can cut your risk of death by 20%…The researchers gave questionnaires to 490,000 men and women and then followed up nine years later, after 46,000 of them had died…[However], the benefits decreased as people drank more. Among those who averaged four or five drinks a day, the risk of death among men was 10% lower, while among women it was 7% lower.

Statistics and Parameters A statistic is a number that describes a sample. A parameter is a number that describes a population. Which is a statistic and which is a parameter? – 57% of the teachers at Central High School are female. – In a sample of 100 surgery patients who were given a new pain reliever, 78% of them reported significant pain relief.

Potential advertisers value television ’ s well-known Nielsen ratings as a barometer of a TV show ’ s popularity among viewers. The Nielsen rating of a certain TV program is an estimate of the proportion of viewers, expressed as a percentage, who tune their sets to the program on a given night at a given time. A typical Nielsen survey consists of 165 families selected nationwide who regularly watch television. Suppose we are interested in the Nielsen ratings for the latest episode of Criminal Minds. Identify the population of interest. Describe the sample.

We just talked about various methods of collecting information by sampling. Once the information has been collected, the collection is called the data set. The characteristics of the individuals about which we collect information are called variables.

Variables Qualitative variables classify individuals into categories. Quantitative variables tell how much or how many of something there is. Another way to distinguish qualitative from quantitative variables: – Quantitative variables are counts or measurements. – Qualitative variables are descriptions.

Check your Understanding Which of the following variables are qualitative and which are quantitative? – A person’s age – A person’s gender – The mileage in miles per gallon of a car – The color of a car

Qualitative Qualitative variables come in two types: ordinal variables and nominal variables. An ordinal variable is one whose categories have a natural ordering. The letter grade received in a class, such as A, B, C, D or F is an ordinal variable. A nominal variable is one whose categories have no natural ordering. Gender is an example of a nominal variable.

Check your Understanding Which of the following variables are ordinal and which are nominal? – State of residence – Gender – Ranking of service (Poor, Fair, Good, Excellent) – Size of soft drink ordered at a fast-food restaurant (small, medium large) – The majors of students in this class

Quantitative Quantitative variables can either be discreet or continuous. Discreet variables are those whose possible values can be listed. Often discrete variables results from counting something. Continuous variables, in principle, can take on any value in an interval. Continuous variables usually involve measurements.

Check for Understanding Which of the following variables are discrete and which are continuous? – The age of a person at his or her last birthday – The height of a person – The number of siblings a person has – The distance a person commutes to work

Check for Understanding Classify each variable as nominal, ordinal, discrete or continuous. – Rating of newscasts in Houston (poor, fair, good or excellent) – Number of pages in an LA phonebook – Weights of cattle – Temperature of automatic popcorn popping machines – Zip codes – Marital status – Shoe Size

Check for Understanding Classify each sampling technique as random, systematic, cluster or stratified – Every 7 th customer entering a shopping center is asked to state his or her favorite store – In a large school district (such as Mt. SAC) all teachers from two buildings were asked whether they believe students has less homework to do than in previous years – Mail carriers of a large city are divided into four groups according to gender (male and female) and according to whether they walk or ride their route. Then 10 are selected from each group and interviewed to determine whether they have been bitten by a dog in the last year

Designing an Experiment The experimental units are the individuals that are studied. These can be people, animals, plants, or things. When the experimental units are people, they are called subjects. The outcome or response is what is measured on each experimental unit. The treatments are the procedures applied to each experimental unit. There are always two or more treatments. The purpose is to determine whether the choice of treatment affects the outcome.

In general, studies fall into two categories: randomized experiments and observational studies. A randomized experiment is a study in which the investigator assigns the treatments to the experimental units at random. An observational study is one in which the assignment to treatment groups is not made by the investigator.

In July 2008, an article in The New England Journal of Medicine (359: ) reported the results of a study to determine whether a new drug called raltegravir is effective in reducing levels of virus in patients with HIV. A total of 699 patients participated in the experiment. These patients were divided into two groups. One group was given raltegravir. The other group was given a placebo. Raltegravir was given to about two-thirds of the subjects and the placebo was given to the rest. To determine which patients would be assigned to which group, a simple random sample consisting of 442 of the 699 patients were drawn: this sample constituted the raltegravir group. The remaining 237 patients were assigned to the placebo group. It was decided to examine subjects after 16 weeks and measure the levels of virus in their blood. The outcome for this experiment was the number of copies of virus per milliliter of blood. In the raltegravir group, 62% of the subjects had a successful outcome, but only 35% of the placebo group did. The conclusion was that raltegravir was effective in lowering the concentration of virus in HIV patients.

Double Blind Experiments We have discussed the advantages of assigning treatments at random. It is a further advantage if the assignment can be done in such a way that neither the experimenters nor the subjects know which treatment has been assigned to which subject. Experiments like this are called double-blind experiments. The raltegravir experiment was a double-blind experiment because neither the patients nor the doctors knew which patients were receiving the drug and which were receiving the placebo. An experiment is double-blind if neither the investigators nor the subjects know who has been assigned to which treatment.

Observational Studies Observational studies are less reliable than randomized experiments. The major problem with observational studies is that it is difficult to tell whether a difference in the outcome is due to the treatment or to some other difference between the treatment and control groups. This is known as confounding. A confounder is a variable that is related to both the treatment and the outcome. When a confounder is present, it is difficult to determine whether differences in the outcome are due to the treatment or to the confounder.

Observational Studies In a cohort study, a group of subjects (the cohort) is studied to determine whether various factors of interest are associated with an outcome. In a prospective cohort study, the subjects are followed over time. In a cross-sectional study, measurements are taken at one point in time. Cross-sectional studies are relatively inexpensive, and results can be obtained quickly. The main disadvantage is that exposure is measured at only one point in time, so there is little information about how past experiences may have contributed to the outcome. In a retrospective cohort study, subjects are sampled after the outcome has occurred. Investigators then look back over time to determine whether certain factors are related to the outcome.

Case-Controlled Studies In a case-control study, two samples are drawn. One sample consists of people who have the disease of interest (the cases) and the other consists of people who do not have the disease (the controls). The investigators look back in time to determine whether a particular factor of interest differs between the two groups.

Check your understanding A recent study compared the heart rates of 19 infants born to nonsmoking mothers with those of 17 infants born to mothers who smoked an average of 15 cigarettes a day while pregnant and after giving birth. The heart rates of the infants at one year of age were 20% slower on the average for the smoking mothers. – What is the outcome variable? – What is the treatment variable? – Was this a cohort study or a case-control study? – Was the study prospective, cross-sectional, or retrospective? – Could the results be due to confounding? Explain.

Bias in Studies A study conducted by a procedure that produces the correct result on the average is said to be unbiased. A study conducted by a procedure that tends to overestimate or underestimate the true value is said to be biased.

Types of Bias Voluntary response bias Self-interest bias Social acceptability bias Leading question bias Non-response bias Sampling Bias