Chapter 3 Section 4 Measures of Position.

Slides:



Advertisements
Similar presentations
Statistical Reasoning for everyday life
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Probabilistic & Statistical Techniques
Unit 16: Statistics Sections 16AB Central Tendency/Measures of Spread.
Chapter 3 Numerically Summarizing Data Insert photo of cover.
Descriptive Statistics
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
MEASURES OF SPREAD – VARIABILITY- DIVERSITY- VARIATION-DISPERSION
BCOR 1020 Business Statistics
Numerically Summarizing Data
1 Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Range Standard Deviation Interquartile Range (IQR)
Chapter In Chapter 3… … we used stemplots to look at shape, central location, and spread of a distribution. In this chapter we use numerical summaries.
Unit 3 Section 3-4.
Rules of Data Dispersion By using the mean and standard deviation, we can find the percentage of total observations that fall within the given interval.
Methods for Describing Sets of Data
Exploration of Mean & Median Go to the website of “Introduction to the Practice of Statistics”website Click on the link to “Statistical Applets” Select.
Descriptive Statistics Measures of Variation. Essentials: Measures of Variation (Variation – a must for statistical analysis.) Know the types of measures.
Part II  igma Freud & Descriptive Statistics Chapter 3 Viva La Difference: Understanding Variability.
Chapter 3 Numerically Summarizing Data 3.4 Measures of Location.
Chapter 3: Averages and Variation Section 4: Percentiles and Box- and-Whisker Plots.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
Measures of Position. ● The standard deviation is a measure of dispersion that uses the same dimensions as the data (remember the empirical rule) ● The.
Measures of Dispersion How far the data is spread out.
Box-And-Whisker Plots By Virginia Vimpeny Lewis. What information do we need? Minimum data value Lower Quartile Median Upper Quartile Maximum data value.
Numerical Measures of Variability
 The mean is typically what is meant by the word “average.” The mean is perhaps the most common measure of central tendency.  The sample mean is written.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 4 – Slide 1 of 23 Chapter 3 Section 4 Measures of Position.
Math 15 – Elementary Statistics Chapter 5 – Part 2 Summarizing Data Numerically.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Descriptive Statistics Chapter 2. § 2.5 Measures of Position.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Variability Introduction to Statistics Chapter 4 Jan 22, 2009 Class #4.
MODULE 3: DESCRIPTIVE STATISTICS 2/6/2016BUS216: Probability & Statistics for Economics & Business 1.
What is a box-and-whisker plot? 5-number summary Quartile 1 st, 2 nd, and 3 rd quartiles Interquartile Range Outliers.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Chapter 1 Lesson 4 Quartiles, Percentiles, and Box Plots.
Probability & Statistics Box Plots. Describing Distributions Numerically Five Number Summary and Box Plots (Box & Whisker Plots )
Chapter 3.3 – 3.4 Applications of the Standard Deviation and Measures of Relative Standing.
Lesson Measures of Position. Objectives Determine and interpret z-scores Determine and interpret percentiles Determine and interpret quartiles Check.
CHAPTER 4 NUMERICAL METHODS FOR DESCRIBING DATA What trends can be determined from individual data sets?
7 th Grade Math Vocabulary Word, Definition, Model Emery Unit 4.
Chapter 4 – Statistics II
Exploratory Data Analysis
Methods for Describing Sets of Data
Chapter 16: Exploratory data analysis: numerical summaries
Notes 13.2 Measures of Center & Spread
2.5: Numerical Measures of Variability (Spread)
Chapter 16: Exploratory data analysis: Numerical summaries
Descriptive Statistics (Part 2)
3 3 Chapter Numerically Summarizing Data
Unit 7: Statistics Key Terms
Unit 4 Statistics Review
Numerical Measures: Skewness and Location
Measures of Variation.
Lecture 2 Chapter 3. Displaying and Summarizing Quantitative Data
The absolute value of each deviation.
Section 3:3 Answers page #4-14 even
Statistics and Data (Algebraic)
Summary (Week 1) Categorical vs. Quantitative Variables
Measures of Center and Spread
Numerical Descriptive Measures
Ticket in the Door GA Milestone Practice Test
5 Number Summaries.
Describing Distributions with Numbers
Measures of Relative Standing
Box Plot Lesson 11-4.
Chapter 12 Statistics.
Presentation transcript:

Chapter 3 Section 4 Measures of Position

Chapter 3 – Section 4 Mean / median describe the “center” of the data Variance / standard deviation describe the “spread” of the data This section discusses more precise ways to describe the relative position of a data value within the entire set of data

Chapter 3 – Section 4 The standard deviation is a measure of dispersion that uses the same dimensions as the data (remember the empirical rule) The distance of a data value from the mean, calculated as the number of standard deviations, would be a useful measurement This distance is called the z-score The standard deviation is a measure of dispersion that uses the same dimensions as the data (remember the empirical rule) The standard deviation is a measure of dispersion that uses the same dimensions as the data (remember the empirical rule) The distance of a data value from the mean, calculated as the number of standard deviations, would be a useful measurement

Chapter 3 – Section 4 If the mean was 20 and the standard deviation was 6 The value 26 would have a z-score of 1.0 (1.0 standard deviation higher than the mean) The value 14 would have a z-score of –1.0 (1.0 standard deviation lower than the mean) The value 17 would have a z-score of –0.5 (0.5 standard deviations lower than the mean) The value 20 would have a z-score of 0.0 If the mean was 20 and the standard deviation was 6 The value 26 would have a z-score of 1.0 (1.0 standard deviation higher than the mean) The value 14 would have a z-score of –1.0 (1.0 standard deviation lower than the mean) The value 17 would have a z-score of –0.5 (0.5 standard deviations lower than the mean) If the mean was 20 and the standard deviation was 6 The value 26 would have a z-score of 1.0 (1.0 standard deviation higher than the mean) If the mean was 20 and the standard deviation was 6 If the mean was 20 and the standard deviation was 6 The value 26 would have a z-score of 1.0 (1.0 standard deviation higher than the mean) The value 14 would have a z-score of –1.0 (1.0 standard deviation lower than the mean)

Chapter 3 – Section 4 The population z-score is calculated using the population mean and population standard deviation The sample z-score is calculated using the sample mean and sample standard deviation The population z-score is calculated using the population mean and population standard deviation

Chapter 3 – Section 4 z-scores can be used to compare the relative positions of data values in different samples Pat received a grade of 82 on her statistics exam where the mean grade was 74 and the standard deviation was 12 Pat received a grade of 72 on her biology exam where the mean grade was 65 and the standard deviation was 10 Pat received a grade of 91 on her kayaking exam where the mean grade was 88 and the standard deviation was 6 z-scores can be used to compare the relative positions of data values in different samples Pat received a grade of 82 on her statistics exam where the mean grade was 74 and the standard deviation was 12 Pat received a grade of 72 on her biology exam where the mean grade was 65 and the standard deviation was 10 z-scores can be used to compare the relative positions of data values in different samples Pat received a grade of 82 on her statistics exam where the mean grade was 74 and the standard deviation was 12 z-scores can be used to compare the relative positions of data values in different samples

Chapter 3 – Section 4 Statistics Biology Kayaking Grade of 82 z-score of (82 – 74) / 12 = .67 Biology Grade of 72 z-score of (72 – 65) / 10 = .70 Kayaking Grade of 81 z-score of (91 – 88) / 6 = .50 Biology was the highest relative grade Statistics Grade of 82 z-score of (82 – 74) / 12 = .67 Biology Grade of 72 z-score of (72 – 65) / 10 = .70 Kayaking Grade of 81 z-score of (91 – 88) / 6 = .50 Statistics Grade of 82 z-score of (82 – 74) / 12 = .67 Biology Grade of 72 z-score of (72 – 65) / 10 = .70 Statistics Grade of 82 z-score of (82 – 74) / 12 = .67

Chapter 3 – Section 4 The quartiles are the 25th, 50th, and 75th percentiles Q1 = 25th percentile Q2 = 50th percentile = median Q3 = 75th percentile Quartiles are the most commonly used percentiles The 50th percentile and the second quartile Q2 are both other ways of defining the median

Chapter 3 – Section 4 Quartiles divide the data set into four equal parts The top quarter are the values between Q3 and the maximum The bottom quarter are the values between the minimum and Q1 Quartiles divide the data set into four equal parts The top quarter are the values between Q3 and the maximum Quartiles divide the data set into four equal parts Quartiles divide the data set into four equal parts Quartiles divide the data set into four equal parts

Chapter 3 – Section 4 Quartiles divide the data set into four equal parts The interquartile range (IQR) is the difference between the third and first quartiles IQR = Q3 – Q1 The IQR is a resistant measurement of dispersion

Chapter 3 – Section 4 Extreme observations in the data are referred to as outliers Outliers should be investigated Outliers could be Chance occurrences Measurement errors Data entry errors Sampling errors Outliers are not necessarily invalid data Extreme observations in the data are referred to as outliers Outliers should be investigated Extreme observations in the data are referred to as outliers Outliers should be investigated Outliers could be Chance occurrences Measurement errors Data entry errors Sampling errors

Chapter 3 – Section 4 One way to check for outliers uses the quartiles Outliers can be detected as values that are significantly too high or too low, based on the known spread The fences used to identify outliers are Lower fence = LF = Q1 – 1.5  IQR Upper fence = UF = Q3 + 1.5  IQR Values less than the lower fence or more than the upper fence could be considered outliers One way to check for outliers uses the quartiles Outliers can be detected as values that are significantly too high or too low, based on the known spread One way to check for outliers uses the quartiles Outliers can be detected as values that are significantly too high or too low, based on the known spread The fences used to identify outliers are Lower fence = LF = Q1 – 1.5  IQR Upper fence = UF = Q3 + 1.5  IQR

Chapter 3 – Section 4 Is the value 54 an outlier? Calculations 1, 3, 4, 7, 8, 15, 16, 19, 23, 24, 27, 31, 33, 54 Calculations Q1 = (4 + 7) / 2 = 5.5 Q3 = (27 + 31) / 2 = 29 IQR = 29 – 5.5 = 23.5 UF = Q3 + 1.5  IQR = 29 + 1.5  23.5 = 64 Using the fence rule, the value 54 is not an outlier Is the value 54 an outlier? 1, 3, 4, 7, 8, 15, 16, 19, 23, 24, 27, 31, 33, 54 Calculations Q1 = (4 + 7) / 2 = 5.5 Q3 = (27 + 31) / 2 = 29 IQR = 29 – 5.5 = 23.5 UF = Q3 + 1.5  IQR = 29 + 1.5  23.5 = 64 Is the value 54 an outlier? 1, 3, 4, 7, 8, 15, 16, 19, 23, 24, 27, 31, 33, 54 Calculations Q1 = (4 + 7) / 2 = 5.5 Q3 = (27 + 31) / 2 = 29 Is the value 54 an outlier? 1, 3, 4, 7, 8, 15, 16, 19, 23, 24, 27, 31, 33, 54

Summary: Chapter 3 – Section 4 z-scores Measures the distance from the mean in units of standard deviations Can compare relative positions in different samples Percentiles and quartiles Divides the data so that a certain percent is lower and a certain percent is higher Outliers Extreme values of the variable Can be identified using the upper and lower fences