Agresti/Franklin Statistics, 1 of 63  Section 2.4 How Can We Describe the Spread of Quantitative Data?

Slides:



Advertisements
Similar presentations
Chapter 3, Numerical Descriptive Measures
Advertisements

Describing Quantitative Variables
Chapter 2 Exploring Data with Graphs and Numerical Summaries
Descriptive Measures MARE 250 Dr. Jason Turner.
Agresti/Franklin Statistics, 1 of 52 Chapter 3 Association: Contingency, Correlation, and Regression Learn …. How to examine links between two variables.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.1 The Association.
Measures of Dispersion
EXPLORING DATA WITH GRAPHS AND NUMERICAL SUMMARIES
1 1 Slide © 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Chapter 3 Association: Contingency, Correlation, and Regression
Descriptive Statistics: Numerical Measures
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Ch. 2-1 Statistics for Business and Economics 7 th Edition Chapter 2 Describing Data:
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
MEASURES OF SPREAD – VARIABILITY- DIVERSITY- VARIATION-DISPERSION
Basic Business Statistics 10th Edition
B a c kn e x t h o m e Classification of Variables Discrete Numerical Variable A variable that produces a response that comes from a counting process.
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Vocabulary for Box and Whisker Plots. Box and Whisker Plot: A diagram that summarizes data using the median, the upper and lowers quartiles, and the extreme.
Chapter 2 Describing Data with Numerical Measurements
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
Numerical Descriptive Measures
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
Chapter 12: Describing Distributions with Numbers We create graphs to give us a picture of the data. We also need numbers to summarize the center and spread.
Chapter 3 - Part B Descriptive Statistics: Numerical Methods
1 1 Slide © 2001 South-Western /Thomson Learning  Anderson  Sweeney  Williams Anderson  Sweeney  Williams  Slides Prepared by JOHN LOUCKS  CONTEMPORARYBUSINESSSTATISTICS.
STAT 250 Dr. Kari Lock Morgan
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2014 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Numerical Descriptive Techniques
ASSOCIATION: CONTINGENCY, CORRELATION, AND REGRESSION Chapter 3.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 3 Association: Contingency, Correlation, and Regression Section 3.2 The Association.
Applied Quantitative Analysis and Practices LECTURE#08 By Dr. Osman Sadiq Paracha.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
Chapter 2 Describing Data.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
Lecture 3 Describing Data Using Numerical Measures.
Applied Quantitative Analysis and Practices LECTURE#09 By Dr. Osman Sadiq Paracha.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Business Statistics, A First Course.
Chapter 2 Section 5 Notes Coach Bridges
Agresti/Franklin Statistics, 1 of 63 Chapter 2 Exploring Data with Graphs and Numerical Summaries Learn …. The Different Types of Data The Use of Graphs.
1 Chapter 2: Exploring Data with Graphs and Numerical Summaries Section 2.1: What Are the Types of Data?
Chapter 3, Part B Descriptive Statistics: Numerical Measures n Measures of Distribution Shape, Relative Location, and Detecting Outliers n Exploratory.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Chapter 6: Interpreting the Measures of Variability.
Using Measures of Position (rather than value) to Describe Spread? 1.
1 Never let time idle away aimlessly.. 2 Chapters 1, 2: Turning Data into Information Types of data Displaying distributions Describing distributions.
1 Take a challenge with time; never let time idles away aimlessly.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
Midterm Review IN CLASS. Chapter 1: The Art and Science of Data 1.Recognize individuals and variables in a statistical study. 2.Distinguish between categorical.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc.. Chap 7-1 Day 2 Lecture Review of Descriptive Statistics.
PROBABILITY AND STATISTICS
Laugh, and the world laughs with you. Weep and you weep alone
Unit 7: Statistics Key Terms
Topic 5: Exploring Quantitative data
Quartile Measures DCOVA
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
Basic Practice of Statistics - 3rd Edition
Presentation transcript:

Agresti/Franklin Statistics, 1 of 63  Section 2.4 How Can We Describe the Spread of Quantitative Data?

Agresti/Franklin Statistics, 2 of 63 Measuring Spread: Range Range: difference between the largest and smallest observations

Agresti/Franklin Statistics, 3 of 63 Measuring Spread: Standard Deviation Creates a measure of variation by summarizing the deviations of each observation from the mean and calculating an adjusted average of these deviations

Agresti/Franklin Statistics, 4 of 63 Empirical Rule For bell-shaped data sets: Approximately 68% of the observations fall within 1 standard deviation of the mean Approximately 95% of the observations fall within 2 standard deviations of the mean Approximately 100% of the observations fall within 3 standard deviations of the mean

Agresti/Franklin Statistics, 5 of 63 Parameter and Statistic A parameter is a numerical summary of the population A statistic is a numerical summary of a sample taken from a population

Agresti/Franklin Statistics, 6 of 63  Section 2.5 How Can Measures of Position Describe Spread?

Agresti/Franklin Statistics, 7 of 63 Quartiles Splits the data into four parts The median is the second quartile, Q 2 The first quartile, Q 1, is the median of the lower half of the observations The third quartile, Q 3, is the median of the upper half of the observations

Agresti/Franklin Statistics, 8 of 63 Example: Find the first and third quartiles Prices per share of 10 most actively traded stocks on NYSE (rounded to nearest $) a. Q 1 = 2 Q 3 = 47 b. Q 1 = 12 Q 3 = 31 c. Q 1 = 11 Q 3 = 31 d. Q 1 =11.5 Q 3 = 32

Agresti/Franklin Statistics, 9 of 63 Measuring Spread: Interquartile Range The interquartile range is the distance between the third quartile and first quartile: IQR = Q3 – Q1

Agresti/Franklin Statistics, 10 of 63 Detecting Potential Outliers An observation is a potential outlier if it falls more than 1.5 x IQR below the first quartile or more than 1.5 x IQR above the third quartile

Agresti/Franklin Statistics, 11 of 63 The Five-Number Summary The five number summary of a dataset: Minimum value First Quartile Median Third Quartile Maximum value

Agresti/Franklin Statistics, 12 of 63 Boxplot A box is constructed from Q 1 to Q 3 A line is drawn inside the box at the median A line extends outward from the lower end of the box to the smallest observation that is not a potential outlier A line extends outward from the upper end of the box to the largest observation that is not a potential outlier

Agresti/Franklin Statistics, 13 of 63 Boxplot for Sodium Data Sodium Data: Five Number Summary: Min: Q1: Med: Q3: Max:

Agresti/Franklin Statistics, 14 of 63 Boxplot for Sodium in Cereals Sodium Data:

Agresti/Franklin Statistics, 15 of 63 Z-Score The z-score for an observation measures how far an observation is from the mean in standard deviation units An observation in a bell-shaped distribution is a potential outlier if its z-score +3

Agresti/Franklin Statistics, 16 of 63 Chapter 3 Association: Contingency, Correlation, and Regression Learn …. How to examine links between two variables

Agresti/Franklin Statistics, 17 of 63 Variables Response variable: the outcome variable Explanatory variable: the variable that explains the outcome variable

Agresti/Franklin Statistics, 18 of 63 Association An association exists between the two variables if a particular value for one variable is more likely to occur with certain values of the other variable

Agresti/Franklin Statistics, 19 of 63  Section 3.1 How Can We Explore the Association Between Two Categorical Variables?

Agresti/Franklin Statistics, 20 of 63 Example: Food Type and Pesticide Status

Agresti/Franklin Statistics, 21 of 63 Example: Food Type and Pesticide Status What is the response variable? What is the explanatory variable? Pesticides: Food Type: Yes No Organic Conventional

Agresti/Franklin Statistics, 22 of 63 Example: Food Type and Pesticide Status What proportion of organic foods contain pesticides? What proportion of conventionally grown foods contain pesticides? Pesticides: Food Type: Yes No Organic Conventional

Agresti/Franklin Statistics, 23 of 63 Example: Food Type and Pesticide Status What proportion of all sampled items contain pesticide residuals? Pesticides: Food Type: Yes No Organic Conventional

Agresti/Franklin Statistics, 24 of 63 Contingency Table The Food Type and Pesticide Status Table is called a contingency table A contingency table: Displays 2 categorical variables The rows list the categories of 1 variable The columns list the categories of the other variable Entries in the table are frequencies

Agresti/Franklin Statistics, 25 of 63 Example: Food Type and Pesticide Status Contingency Table Showing Conditional Proportions

Agresti/Franklin Statistics, 26 of 63 Example: Food Type and Pesticide Status What is the sum over each row? What proportion of organic foods contained pesticide residuals? What proportion of conventional foods contained pesticide residuals? Pesticides: Food Type: Yes No Organic Conventional

Agresti/Franklin Statistics, 27 of 63 Example: Food Type and Pesticide Status

Agresti/Franklin Statistics, 28 of 63 Example: For the following pair of variables, which is the response variable and which is the explanatory variable? College grade point average (GPA) and high school GPA a.College GPA: response variable and High School GPA : explanatory variable b.College GPA: explanatory variable and High School GPA : response variable

Agresti/Franklin Statistics, 29 of 63  Section 3.2 How Can We Explore the Association Between Two Quantitative Variables?

Agresti/Franklin Statistics, 30 of 63 Scatterplot Graphical display of two quantitative variables: Horizontal Axis: Explanatory variable, x Vertical Axis: Response variable, y

Agresti/Franklin Statistics, 31 of 63 Example: Internet Usage and Gross National Product (GDP)

Agresti/Franklin Statistics, 32 of 63 Positive Association Two quantitative variables, x and y, are said to have a positive association when high values of x tend to occur with high values of y, and when low values of x tend to occur with low values of y

Agresti/Franklin Statistics, 33 of 63 Negative Association Two quantitative variables, x and y, are said to have a negative association when high values of x tend to occur with low values of y, and when low values of x tend to occur with high values of y

Agresti/Franklin Statistics, 34 of 63 Example: Did the Butterfly Ballot Cost Al Gore the 2000 Presidential Election?

Agresti/Franklin Statistics, 35 of 63 Linear Correlation: r Measures the strength of the linear association between x and y A positive r-value indicates a positive association A negative r-value indicates a negative association An r-value close to +1 or -1 indicates a strong linear association An r-value close to 0 indicates a weak association

Agresti/Franklin Statistics, 36 of 63 Calculating the correlation, r

Agresti/Franklin Statistics, 37 of 63 Example: 100 cars on the lot of a used-car dealership Would you expect a positive association, a negative association or no association between the age of the car and the mileage on the odometer? Positive association Negative association No association