Section 4.1: Describing the Center of a Data Set.

Slides:



Advertisements
Similar presentations
Descriptive Measures MARE 250 Dr. Jason Turner.
Advertisements

1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
SECTION 3.3 MEASURES OF POSITION Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Calculating & Reporting Healthcare Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Measures of Central Tendency MARE 250 Dr. Jason Turner.
Statistics Lecture 2. Last class began Chapter 1 (Section 1.1) Introduced main types of data: Quantitative and Qualitative (or Categorical) Discussed.
Slides by JOHN LOUCKS St. Edward’s University.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Introduction to Statistics Chapter 3 Using Statistics to summarize.
Chapter 3, Part 1 Descriptive Statistics II: Numerical Methods
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
QBM117 Business Statistics
Chapter 3 Descriptive Measures
Means & Medians Chapter 5. Parameter - ► Fixed value about a population ► Typical unknown.
Describing distributions with numbers
Describing Data from One Variable
 Roughly describes where the center of the data is in the set.  2 Important measures of center: a) Mean b) Median.
Numerical Methods for Describing Data Distributions
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
© 2008 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
STAT 211 – 019 Dan Piett West Virginia University Lecture 1.
1 1 Slide Descriptive Statistics: Numerical Measures Location and Variability Chapter 3 BA 201.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
IT Colleges Introduction to Statistical Computer Packages Lecture 3 Eng. Heba Hamad week
Describing distributions with numbers
Lecture 15 Sections 5.1 – 5.2 Wed, Sep 27, 2006 Measuring Center.
According to researchers, the average American guy is 31 years old, 5 feet 10 inches, 172 pounds, works 6.1 hours daily, and sleeps 7.7 hours. These numbers.
INVESTIGATION 1.
Unit 3 Lesson 2 (4.2) Numerical Methods for Describing Data
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
LECTURE CENTRAL TENDENCIES & DISPERSION POSTGRADUATE METHODOLOGY COURSE.
Chapter 2 Descriptive Statistics Section 2.3 Measures of Variation Figure 2.31 Repair Times for Personal Computers at Two Service Centers  Figure 2.31.
Means & Medians Chapter 4. Parameter - Fixed value about a population Typical unknown.
Essential Statistics Chapter 21 Describing Distributions with Numbers.
Chapter 3, Part A Descriptive Statistics: Numerical Measures n Measures of Location n Measures of Variability.
Chapter 2 Describing Distributions with Numbers. Numerical Summaries u Center of the data –mean –median u Variation –range –quartiles (interquartile range)
1 Review Sections 2.1, 2.2, 1.3, 1.4, 1.5, 1.6 in text.
Numerical Measures. Measures of Central Tendency (Location) Measures of Non Central Location Measure of Variability (Dispersion, Spread) Measures of Shape.
1 Chapter 4 Numerical Methods for Describing Data.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Data Summary Using Descriptive Measures Sections 3.1 – 3.6, 3.8
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Introductory Statistics Lesson 2.5 A Objective: SSBAT find the first, second and third quartiles of a data set. SSBAT find the interquartile range of a.
Lecture 15 Sections 5.1 – 5.2 Mon, Feb 11, 2008 Measuring Center.
Honors Statistics Chapter 3 Measures of Variation.
Measures of Central Tendency, Dispersion, IQR and Standard Deviation How do we describe data using statistical measures? M2 Unit 4: Day 1.
CHAPTER 3 – Numerical Techniques for Describing Data 3.1 Measures of Central Tendency 3.2 Measures of Variability.
Math 201: Chapter 2 Sections 3,4,5,6,7,9.
Numerical Methods for Describing Data
1.4 Measures of Variability
Measures of Center Section 3.1.
Means & Medians Chapter 4.
Means & Medians Chapter 4.
Means & Medians Chapter 5.
Means & Medians Chapter 4.
Numerical Methods for Describing Data
Means & Medians Chapter 4.
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
Lecture 15 Sections 5.1 – 5.2 Mon, Feb 11, 2008
Lecture 15 Sections 5.1 – 5.2 Fri, Oct 1, 2004
Means & Medians Chapter 5.
Essential Statistics Describing Distributions with Numbers
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
Means & Medians.
Mean.
Means & Medians Chapter 4.
Lecture 15 Sections 5.1 – 5.2 Tue, Feb 14, 2006
Lecture 17 Sections 5.1 – 5.2 Wed, Feb 14, 2007
Presentation transcript:

Section 4.1: Describing the Center of a Data Set

Two most popular measures of center are the mean and the median We will look at them separately, then compare the two.

Mean – the average (sometimes called the sample mean) Sample mean – denoted by:

Example: Range of Motion After Knee Surgery Traumatic knee dislocation often requires surgery to repair ruptured ligaments. One measure of recovery is range of motion. The article “Reconstruction of the Anterior and Posterior Cruciate Ligaments After Knee Dislocation” reported the following postsurgical range of motion for a sample of 13 patients:

Range of Motion (degrees) X 1 = 154 x 2 = 142 x 3 = 137 x 4 = 133 x 5 = 122 X 6 = 126 x 7 = 135 x 8 = 135 x 9 = 108 x 10 = 120 X 11 = 127 x 12 = 134 x 13 = 122

Population mean – denoted by μ is the average of all x values in the entire populaton.

Example: County Population Sizes The 50 states plus the District of Columbia contain 3137 counties. Let x denote the number of residents of a country. Then there are 3137 values of the variable x in the population. The sum of these 3137 values is 248,709,873 (1990 census), so the population average value of x is:

One potential drawback to the mean as a measure of center is an outlier. Outlier – an unusually large or small observation in the data set

Example: Number of Visits to a class website Forty students were enrolled in a section of STAT 130, a general education course in statistical reasoning. One month after the course began, the instructor requested a report that indicated how many times each student had accessed a web page on the class site. The 40 observations were:

The sample mean for the data set is 23.10

Median – the middle value in the list Sample median – obtained by first ordering the n observations from smallest to largest (with any repeated values included, so that every sample observation appears in the ordered list). –The single middle value if n is odd –The average of the middle two values if n is even

Example: Website data revisited The sample size for the website access data was n = 40, an even number. The median is the average of the 20 th and 21 st values (arrange the data in order from least to greatest)

The median can now be determined: Median = = 13 2 This value appears to be more typical than 23.1

Population median – the middle value of the ordered list consisting of all population observations.

Comparing Mean and Median Symmetric – mean = median Longer upper tailed (positive skew) – mean is greater than the median Negatively skewed – mean is smaller than the median

Sample Proportion of Success Where s is the label used for the response designated as success

Example: Tampering with Automobile Antipollution Equipment The use of antipollution equipment on automobiles has substantially improved air quality in certain areas. Unfortunately, many car owners have tampered with smog control devices to improve performance. Suppose that a sample of n=15 cars is selected and that each car is classified as S or F, according to whether or not tampering has taken place. The resulting data are:

S F S S S F F S S F S S S F F This sample contains nine S’s so: p = 9 = That is 60% of the sample responses are S’s.

Population proportion of S’s = π (not 3.14) Trimmed mean – computed by first ordering the data values from smallest to largest, then deleting a selected number of values from each end of the ordered list, and finally averaging the remaining values. Trimming Percentage – Is the percentage of values deleted from each end of the ordered list

Example: Alcohol Exposure Alcohol Exposure in seconds Let’s trim 10% off the mean. You will take away the three smallest and three largest numbers

New data values are: We deleted three zeros, 76, 123, and 414 The 10% trimmed mean is 18