More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.

Slides:



Advertisements
Similar presentations
Histograms Bins are the bars Counts are the heights Relative Frequency Histograms have percents on vertical axis.
Advertisements

Describing Quantitative Variables
DESCRIBING DISTRIBUTION NUMERICALLY
Descriptive Measures MARE 250 Dr. Jason Turner.
Displaying & Summarizing Quantitative Data
It’s an outliar!.  Similar to a bar graph but uses data that is measured.
Homework Questions. Quiz! Shhh…. Once you are finished you can work on the warm- up (grab a handout)!
CHAPTER 2: Describing Distributions with Numbers
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Describing distributions with numbers
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Chapter 1 Exploring Data
CHAPTER 2: Describing Distributions with Numbers ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Let’s Review for… AP Statistics!!! Chapter 1 Review Frank Cerros Xinlei Du Claire Dubois Ryan Hoshi.
Chapter 1 – Exploring Data YMS Displaying Distributions with Graphs xii-7.
Chapter 1: Exploring Data AP Stats, Questionnaire “Please take a few minutes to answer the following questions. I am collecting data for my.
Have out your calculator and your notes! The four C’s: Clear, Concise, Complete, Context.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
What is Statistics? Statistics is the science of collecting, analyzing, and drawing conclusions from data –Descriptive Statistics Organizing and summarizing.
Describing distributions with numbers
Categorical vs. Quantitative…
Displaying Quantitative Data Graphically and Describing It Numerically AP Statistics Chapters 4 & 5.
Unit 4 Statistical Analysis Data Representations.
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
To be given to you next time: Short Project, What do students drive? AP Problems.
Chapter 3 Looking at Data: Distributions Chapter Three
MMSI – SATURDAY SESSION with Mr. Flynn. Describing patterns and departures from patterns (20%–30% of exam) Exploratory analysis of data makes use of graphical.
Organizing Data AP Stats Chapter 1. Organizing Data Categorical Categorical Dotplot (also used for quantitative) Dotplot (also used for quantitative)
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
LIS 570 Summarising and presenting data - Univariate analysis.
Describe Quantitative Data with Numbers. Mean The most common measure of center is the ordinary arithmetic average, or mean.
+ Chapter 1: Exploring Data Section 1.1 Displaying Quantitative Data with Graphs Dotplots, Stemplots and Shapes.
UNIT ONE REVIEW Exploring Data.
CHAPTER 1 Exploring Data
1.3 Measuring Center & Spread, The Five Number Summary & Boxplots
CHAPTER 1 Exploring Data
1st Semester Final Review Day 1: Exploratory Data Analysis
CHAPTER 1 Exploring Data
DAY 3 Sections 1.2 and 1.3.
Chapter 5: Describing Distributions Numerically
Describing Distributions of Data
Describing Distributions with Numbers
Drill {A, B, B, C, C, E, C, C, C, B, A, A, E, E, D, D, A, B, B, C}
Warmup Draw a stemplot Describe the distribution (SOCS)
Displaying Distributions with Graphs
Displaying and Summarizing Quantitative Data
POPULATION VS. SAMPLE Population: a collection of ALL outcomes, responses, measurements or counts that are of interest. Sample: a subset of a population.
Displaying and Summarizing Quantitative Data
Organizing Data AP Stats Chapter 1.
Means & Medians.
Chapter 1: Exploring Data
Exploratory Data Analysis
Chapter 1: Exploring Data
Honors Statistics Review Chapters 4 - 5
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Describing Distributions with Numbers
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Types of variables. Types of variables Categorical variables or qualitative identifies basic differentiating characteristics of the population.
Chapter 1: Exploring Data
Presentation transcript:

More Univariate Data Quantitative Graphs & Describing Distributions with Numbers

Quantitative Data Quantitative variables take numerical values for which it makes sense to do arithmetic operations like adding or averaging. Possible Graphs: dotplots, stemplots, histograms, Cumulative frequency plots, boxplots

Graphs Be sure to always: *Title your graphs *Label your axis including units of measure *number your axes in a consistent and reasonable manner

Quantitative Graphs Histograms A histogram’s vertical axis is counts while a relative frequency histogram’s vertical axis is percents.

Stem & Leaf This type of graph uses place values as the stems & units as the leaves. (It’s very hard to describe, we are going to make one for an example.) We can also create what’s called a back-to-back stem plot with two data sets. It is helpful for comparing to sets of univariate data. Quantitative Graphs

A histogram is preferred sometimes for larger data sets. It’s strongest asset is that it shows shape well. It’s weakness is that the individual data values are lost. A stem & leaf is preferred sometimes because it retains all data values but it’s very difficult to create for large data sets. Quantitative Graphs

Quantitative Data The distribution of a variable tells us what values the variable typically takes and how often it takes them. It is a generalization about the variable values.

When describing any Quantitative distribution: C – Center U – Unusual Features S – Shape S – Spread & B – Be S - Specific

Common Shapes of distributions/graphs Symmetric Skewed to the right Skewed to the left Bimodal Uniform

Once you have chosen a shape, you choose a measure of center and spread based on that shape.

Center when the distribution is symmetric Mean: the average formula:

Measure Spread or Variability when the distribution is Symmetric Standard deviation:

Measure of Center when the distribution is not symmetric: Median – the middle value in an ordered list. If there are two values in the middle, then average them.

Measure Spread or Variability when the distribution is not Symmetric We can also examine spread by looking at the range of middle 50% of the data. This is called the: Interquartile Range (IQR). IQR = Q3 – Q1

We also need to talk about the 5-number summary. The 5-number summary is made up of the minimum, the first quartile, Q1 (where 25% of the data lies below this value), the median, the third quartile, Q3 (where 75% of the data lies below this value), and the maximum.

Another Measure of Spread or Variability Range – the difference between the maximum and the minimum observations. This is the simplest measure of spread. We typically use this as preliminary information or if it is the only measure of spread we can calculate.

Another measure of spread or variability Variance is the average of the squares of the deviations of the observations from their mean. It is the standard deviation squared.

An outlier is an individual observation in data that falls outside the overall pattern of the data.

Using the IQR, we can perform a test for outliers. Outlier Test: Any value below Q1 – 1.5(IQR) or above Q (IQR) is considered an outlier.

Another Graph… When we graph the five-number summary along with outliers if present, it leads to a modified boxplot.

Measures that are not strongly affected by extreme values are said to be resistant. The median and IQR are more resistant than the mean and standard deviation. The standard deviation, is even less resistant than the mean.

Measures of Spread or Variability – Why? We measure spread because it’s an important description of what is happening with the data. We need to know about the amount of variation we can expect in a data set.