Data Description Chapter 3.

Slides:



Advertisements
Similar presentations
Measures of Central Tendency
Advertisements

Frequency Distributions & Graphs Chapter 2. Outline 2-1 Introduction 2-2 Organizing Data 2-3 Histograms, Frequency Polygons, and Ogives 2-4 Other Types.
The mean for quantitative data is obtained by dividing the sum of all values by the number of values in the data set.
Calculating & Reporting Healthcare Statistics
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 3-1.
Intro to Descriptive Statistics
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Chapter 3 Data Description 1 McGraw-Hill, Bluman, 7 th ed, Chapter 3.
Unit 3 Sections 3-2 – Day : Properties and Uses of Central Tendency The Mean  One computes the mean by using all the values of the data.  The.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
Describing Data: Numerical
Chapter 3 Descriptive Measures
Chapter 13 Section 5 - Slide 1 Copyright © 2009 Pearson Education, Inc. AND.
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Summarizing Scores With Measures of Central Tendency
Measurements of Central Tendency. Statistics vs Parameters Statistic: A characteristic or measure obtained by using the data values from a sample. Parameter:
Numerical Descriptive Techniques
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Statistics Workshop Tutorial 3
Chapter 3 Statistics for Describing, Exploring, and Comparing Data
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Created by Tom Wegleitner, Centreville, Virginia Section 3-1 Review and.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
Measures of Central Tendency or Measures of Location or Measures of Averages.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Bluman, Chapter 31 Class Limits Frequency
Unit 3 Sections 3-1 & 3-2. What we will be able to do throughout this chapter…  Use statistical methods to summarize data  The most familiar method.
© The McGraw-Hill Companies, Inc., Chapter 3 Data Description.
Created by Tom Wegleitner, Centreville, Virginia Section 2-4 Measures of Center.
Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Data Description Measures of Central Tendency. Chapter 2 showed how to organize raw data into frequency distributions and then present the data by using.
1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week.
Describing Data Lesson 3. Psychology & Statistics n Goals of Psychology l Describe, predict, influence behavior & cognitive processes n Role of statistics.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Measures of Center.
An Introduction to Statistics. Two Branches of Statistical Methods Descriptive statistics Techniques for describing data in abbreviated, symbolic fashion.
1 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely used)
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
 IWBAT summarize data, using measures of central tendency, such as the mean, median, mode, and midrange.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
1 Measures of Center. 2 Measure of Center  Measure of Center the value at the center or middle of a data set 1.Mean 2.Median 3.Mode 4.Midrange (rarely.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Lecture 4 Dustin Lueker.  The population distribution for a continuous variable is usually represented by a smooth curve ◦ Like a histogram that gets.
1 Descriptive Statistics Descriptive Statistics Ernesto Diaz Faculty – Mathematics Redwood High School.
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Chapter 3 Data Description Section 3-2 Measures of Central Tendency.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
LIS 570 Summarising and presenting data - Univariate analysis.
Chapter 2 Descriptive Statistics
Unit 2 Section 2.3. What we will be able to do throughout this part of the chapter…  Use statistical methods to summarize data  The most familiar method.
1 STAT 500 – Statistics for Managers STAT 500 Statistics for Managers.
Honors Statistics Chapter 3 Measures of Variation.
Measures of Central Tendency (MCT) 1. Describe how MCT describe data 2. Explain mean, median & mode 3. Explain sample means 4. Explain “deviations around.
Data Description Chapter 3. The Focus of Chapter 3  Chapter 2 showed you how to organize and present data.  Chapter 3 will show you how to summarize.
Chapter 3 – Data Description Section 3.1 – Measures of Central Tendency.
Data Description Note: This PowerPoint is only a summary and your main source should be the book. Lecture (8) Lecturer : FATEN AL-HUSSAIN.
Data Description Note: This PowerPoint is only a summary and your main source should be the book. Instructor: Alaa saud.
Slide 1 Copyright © 2004 Pearson Education, Inc.  Descriptive Statistics summarize or describe the important characteristics of a known set of population.
Chapter 3 Numerical Descriptive Measures. 3.1 Measures of central tendency for ungrouped data A measure of central tendency gives the center of a histogram.
Describing, Exploring and Comparing Data
Summarizing Scores With Measures of Central Tendency
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Midrange (rarely used)
An Introduction to Statistics
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Numerical Descriptive Measures
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Chapter 3 Data Description
Numerical Descriptive Measures
Presentation transcript:

Data Description Chapter 3

Outline 3-1 Introduction 3-2 Measures of Central Tendency 3-3 Measures of Variation 3-4 Measures of Position 3-5 Exploratory Data Analysis 3-6 Summary Outline

Important Characteristics of Data Center: a representative or average value that indicates where the middle of the data set is located Variation: a measure of the amount that the values vary among themselves Distribution: the nature or shape of the distribution of data (such as bell-shaped, uniform, or skewed) Outliers: Sample values that lie very far away from the majority of other sample values Time: Changing characteristics of data over time Important Characteristics of Data

Section 3-1 Introduction The most common characteristic to measure is the center the dataset. Often people talk about the AVERAGE. “‘Average’ when you stop to think about it is a funny concept. Although it describes all of us it describes none of us…. While none of us wants to be the average American, we all want to know about him or her.” Mike Feinsilber &William Meed, American Averages Examples The average American man is five feet, nine inches tall; the average woman is five feet, 3.6 inches On the average day, 24 million people receive animal bites Section 3-1 Introduction

Measures of Average are also called the Measures of Central Tendency “Average” Is ambiguous, since several different methods can be used to obtain an average Loosely stated, the average means the center of the distribution or the most typical case Measures of Average are also called the Measures of Central Tendency Mean Median Mode Midrange

Is an average enough to describe a data set? Consider: A shoe store owner knows that the average size of a man’s shoe is size 10, but she would not be in business very long if she ordered only size 10 shoes So, what else do we need to know? We need to know how the data are dispersed—do they cluster around the center or are they spread more evenly throughout the distribution . How spread out are the data points? Measures of Variation or Measures of Dispersion Range Variance Standard Deviation Measures of Variation

Measures of Position We also need to know the Measures of Position Percentiles, Deciles, and Quartiles Used extensively in Psychology and Education, referred to as “Norms” These tell use where a specific data value falls within the data set or its relative position in comparison with other data values Measures of Position

Section 3-2 Measures of Central Tendency Objective(s) Summarize data using measures of central tendency, such as the mean, median, mode, and midrange Section 3-2 Measures of Central Tendency

RECALL from Chapter 1 Population Sample Consists of all subjects (human or otherwise) that are being studied A group (subgroup) of subjects randomly selected from a population RECALL from Chapter 1

Represented by GREEK letters Parameter Statistic A characteristic or numerical measurement obtained by using all the data values from a specified population Population Parameter Represented by GREEK letters A characteristic or numerical measurement obtained by using the data values from a sample Sample Statistic Represented by ROMAN (English) letters

General Rounding Guidelines When calculating the measures of central tendency, variation, or position, do NOT round intermediately. Round only the final answer Rounding intermediately tends to increase the difference between the calculated value and the actual “exact” value Round measures of central tendency and variation to one more decimal place than occurs in the raw data For example, if the raw data are given in whole numbers, then measures should be rounded to nearest tenth. If raw data are given in tenths, then measures should be rounded to nearest hundredth. General Rounding Guidelines

Measures of Central Tendency) Measures of Center is the data value(s) at the center or middle of a data set Mean Median Mode Midrange We will consider the definition, calculation (formula), advantages, disadvantages, properties, and uses for each measure of central tendency Measures of Central Tendency)

Mean AKA Arithmetic Average Is found by adding the data values and dividing by the total number of values In general, mean is the most important of all numerical measurements used to describe data Is what most people call an “average” Is unique and in most cases, but is not necessarily an actual data value Varies less than the median or mode when samples are taken from the same population and all three measures are computed for those samples Is used in computing other statistics, such as variance Is affected by extremely high or low values (outliers) and may not be the appropriate average to use in those situations Mean

Mean ----Formula Notation ∑ (sigma) denotes the sum of a set of values x is the variable usually used to represent the individual data values n represents the number of values in a sample N represents the number of values in a population Mean of a set of sample values (read as x-bar) Mean of all values in a population (read as “mu”)

The number of highway miles per gallon of the 10 worst vehicles is given: 12 15 13 14 15 16 17 16 17 18 Find the mean. Mean ---Example

Is the middle value when the raw data values are arranged in order from smallest to largest or vice versa Is used when one must find the center or midpoint of a data set Is used when one must determine whether the data values fall into the upper half or lower half of the distribution Is affected less than the mean by extremely high or love values Does not have to be an original data value Various notations: MD, Med, Median

Finding the Median Arrange data in order from smallest to largest Odd Number of Data Values (n is odd) Even Number of Data Values (n is even) Arrange data in order from smallest to largest Find the data value in the “exact” middle Arrange data in order from smallest to largest Find the mean of the TWO middle numbers (there is no “exact” middle) Finding the Median

The number of highway miles per gallon of the 10 worst vehicles is given: 12 15 13 14 15 16 17 16 17 18 Find the median. Median ---Example

Measured amounts of lead (in mg/m3) in the air are given: 5.40 1.10 0.42 0.73 0.48 1.10 0.66 Find the median Median – Example #2

Mode Is the data value(s) that occurs most often in a data set Sometimes said to be the most typical case Is the easiest average to compute Cane be used when the data are nominal, such as religious preference, gender, or political affiliation Is not always unique. A data set can have more than one mode, or the mode may not exist for a data set Has no “special” symbol Look for the number(s) that occur the most often in the data set Mode

The number of highway miles per gallon of the 10 worst vehicles is given: 12 15 13 14 15 16 17 16 17 18 Find the mode. Mode ---Example

Measured amounts of lead (in mg/m3) in the air are given: 5.40 1.10 0.42 0.73 0.48 1.10 0.66 Find the mode. Mode – Example #2

Midrange Is a rough estimate of the midpoint for the data set Is found by adding the lowest and highest data values and dividing by 2 Is easy to compute Gives the midpoint Is affected by extremely high or low data values Is rarely used Is denoted by MR Midrange

The number of highway miles per gallon of the 10 worst vehicles is given: 12 15 13 14 15 16 17 16 17 18 Find the midrange. Midrange ---Example

Which Measure of Central Tendency is best? There is no single best answer to that question because there are no objective criteria for determining the most representative measure for all data sets Avoid the term “average” , instead use the actual measure of central tendency that is calculated (mean, median, mode, or midrange) Use the advantages and disadvantages stated above to decide which measure of central tendency is best. Which Measure of Central Tendency is best?

A comparison of the mean, median, and mode can reveal information about the distribution shape RECALL: (p. 56) A bell-shaped (normal) distribution is symmetric Data values are evenly distributed on both sides of the mean Unimodal (one peak) Mean ≈ Median ≈ Mode Making Connections

Right-skewed (or positively) distribution has the majority of data values fall to the left of the mean and cluster at the lower end of the distribution; the “tail” is to the right Mode < Median < Mean Median is the “center” point Making Connections

Left-skewed (or negatively) distribution has the majority of data values to the right of the mean and cluster at the upper end of the distribution, with the tail to the left Mean < Median < Mode Making Connections

Example- Find the mean, median, mode, and midrange for the ages of NASCAR Nextel Cup Drivers Is the distribution symmetric, left-skewed, or right-skewed? Using Technology

Ages of NASCAR Nextel Cup Drivers in Years (NASCAR Ages of NASCAR Nextel Cup Drivers in Years (NASCAR.com) (Data is ranked---Collected Spring 2008) 21 23 24 25 26 27 28 29 30 31 32 34 35 36 37 38 39 41 42 43 44 45 46 47 48 49 50 51 65 72