Skewness and choice of data analysis

Slides:



Advertisements
Similar presentations
AP Stat Day Days until AP Exam
Advertisements

EQ: How can we summarize and compare data? MM1D3a Monday – 8/29/11 Math 1: Unit 1 – Day 5.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 5- 1.
Measures of Variation. Median, Quartiles, Inter-Quartile Range and Box Plots. Measures of Spread Remember: The range is the measure of spread that goes.
Box plot Edexcel S1 Mathematics 2003 (or box and whisker plot)
Box and Whiskers with Outliers. Outlier…… An extremely high or an extremely low value in the data set when compared with the rest of the values. The IQR.
Chapter 3, Numerical Descriptive Measures
Quantitative Analysis (Statistics Week 8)
Ex1E median, the interquartile range, the range and the mode.
Quantitative Methods in Social Research 2010/11 Week 5 (morning) session 11th February 2011 Descriptive Statistics.
Five Number Summary and Box Plots
DESCRIBING DISTRIBUTION NUMERICALLY
CHAPTER 1 Exploring Data
A.P. Psychology Statistics Notes. Correlation  The way 2 factors vary together and how well one predicts the other  Positive Correlation- direct relationship.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
Percentiles Def: The kth percentile is the value such that at least k% of the measurements are less than or equal to the value. I.E. k% of the measurements.
CHAPTER 2: Describing Distributions with Numbers
5 Number Summary Box Plots. The five-number summary is the collection of The smallest value The first quartile (Q 1 or P 25 ) The median (M or Q 2 or.
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
GCSE Session 28 - Cumulative Frequency, Vectors and Standard Form.
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
2.2 Measures of Central Tendency Skewing distributions.
Drawing and comparing Box and Whisker diagrams (Box plots)
CHAPTER 2: Describing Distributions with Numbers ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Modified by ARQ, from © 2002 Prentice-Hall.Chap 3-1 Numerical Descriptive Measures Chapter %20ppts/c3.ppt.
Copyright © 2005 Pearson Education, Inc. Slide 6-1.
1.3: Describing Quantitative Data with Numbers
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Section 1 Topic 31 Summarising metric data: Median, IQR, and boxplots.
1)Construct a box and whisker plot for the data below that represents the goals in a soccer game. (USE APPROPRIATE SCALE) 7, 0, 2, 5, 4, 9, 5, 0 2)Calculate.
Measure of Central Tendency Measures of central tendency – used to organize and summarize data so that you can understand a set of data. There are three.
Homework Questions. Measures of Center and Spread Unit 5, Statistics.
Displaying Quantitative Data Graphically and Describing It Numerically AP Statistics Chapters 4 & 5.
Number of Movies Frequency TOTAL: 88 The following data shows the number of movies 88 students watched one week during the summer.
Numerical Measures of Variability
Chapter 3 Looking at Data: Distributions Chapter Three
Chapter 5: Boxplots  Objective: To find the five-number summaries of data and create and analyze boxplots CHS Statistics.
Organizing Data AP Stats Chapter 1. Organizing Data Categorical Categorical Dotplot (also used for quantitative) Dotplot (also used for quantitative)
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
1.3 Describing Quantitative Data with Numbers Pages Objectives SWBAT: 1)Calculate measures of center (mean, median). 2)Calculate and interpret measures.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Chapter 6: Interpreting the Measures of Variability.
Understanding and Comparing Distributions Ch. 5 Day 1 Notes AP Statistics EQ: How do we make boxplots and why? How do we compare distributions?
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
4.1 Measures of Center We are learning to…analyze how adding another piece of data can affect the measures of center and spread.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Statistics Unit Test Review Chapters 11 & /11-2 Mean(average): the sum of the data divided by the number of pieces of data Median: the value appearing.
AP Statistics 5 Number Summary and Boxplots. Measures of Center and Distributions For a symmetrical distribution, the mean, median and the mode are the.
Chapter 5 : Describing Distributions Numerically I
Statistics Unit Test Review
CHAPTER 2: Describing Distributions with Numbers
CHAPTER 1 Exploring Data
Measures of central tendency
Warmup What five numbers need to be mentioned in the complete sentence you write when the data distribution is skewed?
Lesson 1: Summarizing and Interpreting Data
SWBAT: Measure center with the mean and median and spread with interquartile range. Do Now:
One Quantitative Variable: Measures of Spread
1.3 Describing Quantitative Data with Numbers
Chapter 1: Exploring Data
Measures of central tendency
AP Statistics Day 4 Objective: The students will be able to describe distributions with numbers and create and interpret boxplots.
Chapter 1: Exploring Data
Describing Distributions Numerically
Using your knowledge to describe the features of graphs
MCC6.SP.5c, MCC9-12.S.ID.1, MCC9-12.S.1D.2 and MCC9-12.S.ID.3
Describing Quantitative Distributions
The Five-Number Summary
Shape, Center, Spread.
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Presentation transcript:

Skewness and choice of data analysis S1 Representing data Skewness and choice of data analysis

Skewness The first distribution shown has a positive skew. This means that it has a long tail in the positive direction. The distribution below it has a negative skew since it has a long tail in the negative direction. Finally, the third distribution is symmetric and has no skew. Distributions with positive skew are sometimes called "skewed to the right" whereas distributions with negative skew are called "skewed to the left."                           

Skewness – visuals and calculations Calculate Q1, Q2, Q3, mode, mean and standard deviation Draw all 3 boxplots on one piece of graph paper Data set 1 1, 3, 5, 5, 5, 7, 10 Data set 2 2, 7, 7, 8, 12, 14, 20 Data set 3 3, 6, 7, 9, 10, 10, 11 For each data set find a relationship between the mode, median and mean using =,>,< symbols For each data set find a relationship between Q2-Q1 and Q3-Q2 Work out 3(mean-median) standard deviation

Skewness – Using the Quartiles Q2-Q1 = Q3-Q2 Q2-Q1 < Q3-Q2 Q2-Q1 > Q3-Q2                           

Skewness – Using mode, median, mean Q2-Q1 = Q3-Q2 Q2-Q1 < Q3-Q2 Q2-Q1 > Q3-Q2                            Mode=median=mean Mode<median<mean Mode>median>mean

Skewness calculations You can calculate 3(mean-median) Standard deviation This gives you a value to tell you how skewed the data are. The closer the number to zero the more symmetrical the data Negative value means the data has a negative skew and vice versa

Comparing data sets You should always compare data sets using a measure of location (mean, median, mode) a measure of spread (range, IQR, standard deviation) skewness Range gives a rough idea of spread, but is affected by extreme values. Generally only used with small data groups IQR not affected by extreme values Tells you the spread of middle 50% Often used in conjunction with median Mean and standard deviation generally used when data are fairly symmetrical data size is reasonably large