Chapter 2: Statistics of One Variable

Slides:



Advertisements
Similar presentations
Chapter 13 Section 1 - Slide 1 Copyright © 2009 Pearson Education, Inc. AND.
Advertisements

4 Uses and Abuses of Statistics 4.1 Introduction to Statistics
© 2008 McGraw-Hill Higher Education The Statistical Imagination Chapter 4. Measuring Averages.
MBF3C Lesson #1: Sampling Types and Techniques
Table of Contents Exit Appendix Behavioral Statistics.
Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Sociology 690 – Data Analysis Simple Quantitative Data Analysis.
4. FREQUENCY DISTRIBUTION
Irwin/McGraw-Hill © The McGraw-Hill Companies, Inc., 2000 LIND MASON MARCHAL 1-1 Chapter Seven Sampling Methods and Sampling Distributions GOALS When you.
Introduction to Educational Statistics
Quantitative Data Analysis Definitions Examples of a data set Creating a data set Displaying and presenting data – frequency distributions Grouping and.
Introduction to Statistics
Data observation and Descriptive Statistics
Chapter 5: Descriptive Research Describe patterns of behavior, thoughts, and emotions among a group of individuals. Provide information about characteristics.
Chapter 3: Central Tendency
Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately describes the center of the.
Introduction to Statistics February 21, Statistics and Research Design Statistics: Theory and method of analyzing quantitative data from samples.
Understanding Research Results
MATH1342 S08 – 7:00A-8:15A T/R BB218 SPRING 2014 Daryl Rupp.
Data Handling Collecting Data Learning Outcomes  Understand terms: sample, population, discrete, continuous and variable  Understand the need for different.
Descriptive Statistics
Data Analysis with Graphs. Statistics is the gathering, organization, analysis and presentation of numerical information. Raw Data – unprocessed info.
Data Presentation.
Intro Stats Lesson 1.3 B Objectives: SSBAT classify different ways to collect data. SSBAT distinguish between different sampling techniques. Standards:
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Chapter 9 Statistics Section 9.1 Frequency Distributions; Measures of Central Tendency.
CHAPTER 1 Basic Statistics Statistics in Engineering
STATISTICS is about how to COLLECT, ORGANIZE,
Chapter 4 Statistics. 4.1 – What is Statistics? Definition Data are observed values of random variables. The field of statistics is a collection.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
PPA 501 – Analytical Methods in Administration Lecture 5a - Counting and Charting Responses.
Basic Statistics for Engineers. Collection, presentation, interpretation and decision making. Prof. Dudley S. Finch.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
Data Gathering Techniques. Essential Question: What are the different methods for gathering data about a population?
MM150 Unit 8 Seminar Statistics. 8.1 Sampling Techniques 2.
S AMPLING T ECHNIQUES. How is it possible that when survey’s come out, they can say that 23% of Canadians like to take long walks on the beach when no.
An Overview of Statistics Section 1.1. Ch1 Larson/Farber 2 Statistics is the science of collecting, organizing, analyzing, and interpreting data in order.
MM150 Unit 8 Seminar. Probability (Unit 7) Statistics (Unit 8) : Gathering data; organizing data Statistics (Unit 9) : Analyzing data; making conclusions.
Statistics- a branch of mathematics that involves the study of data The purpose of statistical study is to reach a conclusion or make a decision about.
Subbulakshmi Murugappan H/P:
Basic Statistical Terms: Statistics: refers to the sample A means by which a set of data may be described and interpreted in a meaningful way. A method.
Working with one variable data. Measures of Central Tendency In statistics, the three most commonly used measures of central tendency are: Mean Median.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
5.3.2 Collecting Samples Sampling Methods
1 Review Sections 2.1, 2.2, 1.3, 1.4, 1.5, 1.6 in text.
PROBABILITY AND STATISTICS WEEK 1 Onur Doğan. What is Statistics? Onur Doğan.
Central Tendency A statistical measure that serves as a descriptive statistic Determines a single value –summarize or condense a large set of data –accurately.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Copyright © 2009 Pearson Education, Inc. 8.1 Sampling Distributions LEARNING GOAL Understand the fundamental ideas of sampling distributions and how the.
Introduction to statistics I Sophia King Rm. P24 HWB
Sampling Measures Of Central Tendency SymbolsProbability Random Stuff.
Chapter 3: Central Tendency 1. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Chapter II Methods for Describing Sets of Data Exercises.
Slide Copyright © 2009 Pearson Education, Inc. Slide Copyright © 2009 Pearson Education, Inc. Chapter 8 Statistics.
Section 9.1 Samples and Central Tendency Section 9.1 Samples and Central Tendency.
Chapter 14 Statistics and Data Analysis. Data Analysis Chart Types Frequency Distribution.
Chapter 4: Measures of Central Tendency. Measures of central tendency are important descriptive measures that summarize a distribution of different categories.
Ing. Martina Majorová, FEM SUA Statistics Lecture 4 – Data sampling & Data sorting.
AP PSYCHOLOGY: UNIT I Introductory Psychology: Statistical Analysis The use of mathematics to organize, summarize and interpret numerical data.
Doc.RNDr.Iveta Bedáňová, Ph.D.
Frequency Distributions
Chapter 5 STATISTICS (PART 1).
MEASURES OF CENTRAL TENDENCY
An Introduction to Statistics
Week Three Review.
Chapter 3: Central Tendency
6A Types of Data, 6E Measuring the Centre of Data
Prepared by: C.Cichanowicz, March 2011
Experimental Design Experiments Observational Studies
Chapter 3: Central Tendency
Presentation transcript:

Chapter 2: Statistics of One Variable Grade 12 Data Management

Data Analysis With Graphs Statistics is the gathering, organization, analysis, and presentation of numerical information Unprocessed information collected for a study is called raw data The quantity being measured is a variable A continuous variable can have any value within a given range A discrete variable can have only certain separate values (often integers)

Frequency tables and frequency diagrams can give a convenient overview of the distribution of values of the table and reveal trends in the data A histogram is a special form of bar graph in which the area of the bars are proportional to the frequencies of the values of the variable Connected represents continuous range of values

A frequency polygon can illustrate the same information as a histogram or bar graphs

Example 1: Frequency Tables and Diagrams Here are the sum of two numbers from 50 rolls of a pair of standard dice.

Use a graph to illustrate the information in the frequency table. Bar Graph Frequency Polygon

Create a cumulative – frequency table and graph for the data

Example of Histogram Frequency Polygon may be superimposed onto the same grid as the histogram.

Setting up Frequency – distribution table Incorrect Correct The values in the intervals should not overlap, otherwise, a value belonging to two intervals would create a consistency error For example, suppose an individual is 38 years old, would that individual be placed in the “33 – 38” or “38 – 42” interval? It would be impossible to determine and inconsistent

Indices An index relates the value of a variable (or a group of variables) to a base label, which is often the value on a particular date CPI, TSE 300 Time-series graphs are often used to show how indices change over time Determine rate of change by determining slope using two points on the graph Rate of change = rise/run

Sampling Techniques Population refers to all individuals who belong to a group being studied Sample refers to the segment of the population used in a study Sampling frame refers to the group of individuals who actually have a chance of being selected

Simple Random Sample Every member of the population has an equal chance of being selected and the selection of any particular individual does not affect the chances of any other individual being chosen

Systematic Sample In a systematic sample, go through the population sequentially and select members at regular intervals The sample size and the population size determine the sampling interval Interval = population size/sample size Suppose a study determines the interval to be 3040, an individual may be selected from any of the first 3040 individuals, and select every 3040th individual from that point on

Stratified Sample If a population includes groups of members who share common characteristics, such as gender, age, or education level Such a group are called strata A stratified sample has the same proportion of members from each stratum as the population does

Example 2: Designing a Stratified Sample Before booking bands for the high school dances, the students’ council at Statsville High School wants to survey the music preferences of the student body. The following table shows the enrolment at the school. Design a stratified sample for a survey of 25% of the student body.

Example 2: Solution To obtain a stratified sample with the correct proportions, simple select 25% of the students in each grade level

Other Sampling Techniques Cluster Sample – If certain groups are likely to be representative of the entire population, you can use a random selection of such groups as cluster sample Multi – Stage Sample – Uses several levels of random sampling Voluntary – Response Sample – Researcher simply invites any member of the population to participate Convenience Sample – Often, a sample is selected simply because it is easily accessible

Measures of Central Tendency Often convenient to use a central value to summarize a set of data Various methods exists to find values around which a set of data tends to cluster These are known as measures of central tendency

Mean Commonly referred to as “average” Population mean (N = Entire population) µ = x1 + x2 + … + xN N µ = ∑ x Sample mean (n = Sample size) x = x1 + x2 + … + xn n x = ∑ x

The mode is the value that occurs most frequently in the distribution The median is the middle value of the data when they are ranked from highest to lowest When there is an even number of values, the median is the midpoint between the two middle values The mode is the value that occurs most frequently in the distribution Some distributions do not have a mode, while others have several

Weighted Mean A weighted mean gives a measure of central tendency that reflects the relative importance of the data: xw = w1x1 + w2x2 + … + wnxn w1 + w2 + … wn x = ∑ wixi ∑ wi Differs from standard mean calculation because it gives a stronger weight (importance) to certain categories

Example 3: Weighted Mean The HR manager for Statsville Marketing Limited considers five criteria when interviewing a job applicant. The manager gives each applicant a score between 1 and 5 in each category, with 5 being the highest score. Each category has a weighting between 1 and 3. The following table lists a recent applicant’s score and the company’s weighting factors.

Determine the weighted mean score for this job applicant.

Example 3: Solution xw = 2(4) + 2(2) + 3(5) + 3(5) + 1(4) 2 + 2 + 3 + 3 + 1 xw = 8 + 4 + 15 + 15 + 4 11 xw = 46 xw = 4.2 Therefore, applicant has a weighted mean of approx. 4.2.

Grouped Data When a set of data has been grouped into intervals, it is possible to approximate the mean using the formula: Population mean µ = ∑ fimi ∑ fi Sample mean x = ∑ fimi Where mi is the midpoint value of an interval and fi the frequency for that interval Estimate the median for grouped data by taking the midpoint of the interval within which the median is found

Example 4: Calculate the Mean and Median for Grouped Data A group of children were asked how many hours a day they spend watching television. The table at the right summarizes their response. Determine the mean and median number of hours for this distribution.

Example 4: Solution x = ∑ fimi ∑ fi x = 49 18 x = 2.7 Therefore, the mean time the children spent watching television is approximately 2.7 h a day.

It should be noted the values for the mean and median are approximate because where the data lie within each interval cannot be accurately determined

Example 5: Determine the error of the Frequency – Distribution Table Explain the problem with the intervals in the following table. Missing values between intervals

Homework Page 101 #1a, 2, 3ab, 8, 12, 15 Page 117 #1, 2, 3, 7, 9 Reminders: Mid – Term Exam (Thursday) Chapter 2 Quiz (Entire chapter, next Monday)