Statistics 270 - Lecture 3. Last class: types of quantitative variable, histograms, measures of center, percentiles and measures of spread…well, we shall.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

DESCRIBING DISTRIBUTION NUMERICALLY
Descriptive Measures MARE 250 Dr. Jason Turner.
Statistics 100 Lecture Set 6. Re-cap Last day, looked at a variety of plots For categorical variables, most useful plots were bar charts and pie charts.
Statistics 100 Lecture Set 7. Chapters 13 and 14 in this lecture set Please read these, you are responsible for all material Will be doing chapters
Measures of Variation Sample range Sample variance Sample standard deviation Sample interquartile range.
1 Chapter 1: Sampling and Descriptive Statistics.
Describing Quantitative Data with Numbers Part 2
MEASURES OF SPREAD – VARIABILITY- DIVERSITY- VARIATION-DISPERSION
Statistics Lecture 3. zLast class: histograms, measures of center, percentiles and measures of spread zHave completed Sections zToday -
1 Distribution Summaries Measures of central tendency Mean Median Mode Measures of spread Range Standard Deviation Interquartile Range (IQR)
Statistics Lecture 2. Last class began Chapter 1 (Section 1.1) Introduced main types of data: Quantitative and Qualitative (or Categorical) Discussed.
Understanding and Comparing Distributions
Chapter 2 Describing Data with Numerical Measurements
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Section 2.4 Measures of Variation.
Why use boxplots? ease of construction convenient handling of outliers construction is not subjective (like histograms) Used with medium or large size.
LECTURE 12 Tuesday, 6 October STA291 Fall Five-Number Summary (Review) 2 Maximum, Upper Quartile, Median, Lower Quartile, Minimum Statistical Software.
Chapter 12: Describing Distributions with Numbers We create graphs to give us a picture of the data. We also need numbers to summarize the center and spread.
Numerical Descriptive Techniques
Methods for Describing Sets of Data
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
LECTURE 8 Thursday, 19 February STA291 Fall 2008.
1.3: Describing Quantitative Data with Numbers
Copyright ©2006 Brooks/Cole, a division of Thomson Learning, Inc. Turning Data Into Information Chapter 2.
CHAPTER 7: Exploring Data: Part I Review
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
1 MATB344 Applied Statistics Chapter 2 Describing Data with Numerical Measures.
What is variability in data? Measuring how much the group as a whole deviates from the center. Gives you an indication of what is the spread of the data.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Applied Quantitative Analysis and Practices LECTURE#09 By Dr. Osman Sadiq Paracha.
Lecture 5 Dustin Lueker. 2 Mode - Most frequent value. Notation: Subscripted variables n = # of units in the sample N = # of units in the population x.
Categorical vs. Quantitative…
1 Chapter 4: Describing Distributions 4.1Graphs: good and bad 4.2Displaying distributions with graphs 4.3Describing distributions with numbers.
 The mean is typically what is meant by the word “average.” The mean is perhaps the most common measure of central tendency.  The sample mean is written.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 Describing Distributions Numerically.
Chapter 5: Boxplots  Objective: To find the five-number summaries of data and create and analyze boxplots CHS Statistics.
Statistics: Unlocking the Power of Data Lock 5 STAT 250 Dr. Kari Lock Morgan Describing Data: One Quantitative Variable SECTIONS 2.2, 2.3 One quantitative.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Chapter 1: Exploring Data, cont. 1.2 Describing Distributions with Numbers Measuring Center: The Mean Most common measure of center Arithmetic average,
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Describe Quantitative Data with Numbers. Mean The most common measure of center is the ordinary arithmetic average, or mean.
MAT 135 Introductory Statistics and Data Analysis Adjunct Instructor
Methods for Describing Sets of Data
Statistics 200 Lecture #4 Thursday, September 1, 2016
Mean, Median, Mode and Standard Deviation (Section 11-1)
Probability and Statistics for Computer Scientists Second Edition, By: Michael Baron Chapter 8: Introduction to Statistics CIS Computational Probability.
Describing Distributions Numerically
Summary Statistics 9/23/2018 Summary Statistics
Chapter 2b.
Lecture 2 Chapter 3. Displaying and Summarizing Quantitative Data
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Describing Distributions Numerically
Data Analysis and Statistical Software I Quarter: Spring 2003
Chapter 1: Exploring Data
Describing Distributions Numerically
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
The Five-Number Summary
Advanced Algebra Unit 1 Vocabulary
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Presentation transcript:

Statistics Lecture 3

Last class: types of quantitative variable, histograms, measures of center, percentiles and measures of spread…well, we shall finish these today Will have completed Chapter 1 Assignment #1: Chapter 1, questions: 6, 20b, 26, 36b-d, 48, 60 Some suggested problems: Chapter 1: 1, 5, 13 or 14 (DO histogram), 19, 26, 29, 33

Measures of Spread (cont.) 5 number summary often reported: Min, Q 1, Q 2 (Median), Q 3, and Max Summarizes both center and spread What proportion of data lie between Q 1 and Q 3 ?

Box-Plot Displays 5-number summary graphically Box drawn spanning quartiles Line drawn in box for median Lines extend from box to max. and min values. Some programs draw whiskers only to 1.5*IQR above and below the quartiles

Can compare distributions using side-by-side box-plots What can you see from the plot?

Other Common Measure of Spread: Sample Variance Sample variance of n observations: Can be viewed as roughly the average squared deviation of observations from the sample mean Units are in squared units of data

Sample Standard Deviation Sample standard deviation of n observations: Can be viewed as roughly the average deviation of observations from the sample mean Has same units as data

Exercise Compute the sample standard deviation and variance for the Muzzle Velocity Example

Variance and standard deviation are most useful when measure of center is As observations become more spread out, s : increases or decreases? Both measures sensitive to outliers 5 number summary is better than the mean and standard deviation for describing (I) skewed distributions; (ii) distributions with outliers

Population and Samples Important to distinguish between the population and a sample from the population A sample consisting of the entire population is called a What is the difference between the population mean and the sample mean? The population variance ( or std. deviation) and that of the population Population median and sample median?

Empirical Rule for Bell-Shaped Distributions Approximately 68% of the data lie in the interval 95% of the data lie in the interval Can use these to help determine range of typical values or to identify potential outliers

Example…Putting this all together A geyser is a hot spring that becomes unstable and erupts hot gases into the air. Perhaps the most famous of these is Wyoming's Old Faithful Geyser. Visitors to Yellowstone park most often visit Old Faithful to see it erupt. Consequently, it is of great interest to be able to predict the interval time of the next eruption.

Example…Putting this all together Consider a sample of 222 interval times between eruptions (Weisberg, 1985). The first few lines of the available data are: Goal: Help predict the interval between eruptionsConsider a variety of plots that may shed some light upon the nature of the intervals between eruptions

Example…Putting this all together Goal: Help predict the interval between eruptions Consider a histogram to shed some light upon the nature of the intervals between eruptions

Example…Putting this all together

What does the box-plot show? Is a box-plot useful at showing the main features of these data? What does the empirical rule tell us about 95% of the data? Is this useful? We will come back to this in a minute…

Scatter-Plots Help assess whether there is a relationship between 2 continuous variables, Data are paired (x 1, y 1 ), (x 2, y 2 ),... (x n, y n ) Plot X versus Y If there is no natural pairing…probably not a good idea! What sort of relationships might we see?

Example…Putting this all together What does this plot reveal?

Example…Putting this all together

Suppose an eruption of 2.5 minutes had just taken place. What would you estimate the length of the next interval to be? Suppose an eruption of 3.5 minutes had just taken place. What would you estimate the length of the next interval to be?