1 שיטות גראפיות פשוטות להצגה וניתוח נתונים. 2 Star/Radar/Spider Plot Figure 1: A typical radar graph with two plots.

Slides:



Advertisements
Similar presentations
Rubric Unit Plan Univariate Bivariate Examples Resources Curriculum Statistically Thinking A study of univariate and bivariate statistics.
Advertisements

Chapter 2 Exploring Data with Graphs and Numerical Summaries
Statistics Unit 6.
Agresti/Franklin Statistics, 1 of 63  Section 2.4 How Can We Describe the Spread of Quantitative Data?
IB Math Studies – Topic 6 Statistics.
Five-Number Summary 1 Smallest Value 2 First Quartile 3 Median 4
Measures of Center and Variation
1 Business 260: Managerial Decision Analysis Professor David Mease Lecture 1 Agenda: 1) Course web page 2) Greensheet 3) Numerical Descriptive Measures.
Starter 1.Find the median of Find the median of Calculate the range of Calculate the mode.
Box and Whisker Plots A Modern View of the Data. History Lesson In 1977, John Tukey published an efficient method for displaying a five-number data summary.
Vocabulary for Box and Whisker Plots. Box and Whisker Plot: A diagram that summarizes data using the median, the upper and lowers quartiles, and the extreme.
EXAMPLE 1 Make a box-and-whisker plot SONG LENGTHS The lengths of songs (in seconds) on a CD are listed below. Make a box-and-whisker plot of the song.
Quartiles & Extremes (displayed in a Box-and-Whisker Plot) Lower Extreme Lower Quartile Median Upper Quartile Upper Extreme Back.
Numerical Descriptive Measures
Describing distributions with numbers
Definitions Data: A collection of information in context.
Chapter 3 - Part B Descriptive Statistics: Numerical Methods
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
REPRESENTATION OF DATA.
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Box-and-Whisker Plots
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Objectives Vocabulary
Chapter 3 Descriptive Statistics: Numerical Methods Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
Do Now. Chapter 5 Section E and F Vocabulary Relative frequency- frequency expressed as a fraction of the total frequency Cumulative frequency- sum of.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Chapter 11 Graphical Methods. Introduction “A picture is often better than several numerical analyses” Stand-alone procedure, or used in conjunction with.
Materials Reminders. Get out your agenda if you see your name below. You need to come to my room tomorrow. Period 2Period 7.
Measures of Position & Exploratory Data Analysis
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
Correlation Analysis. A measure of association between two or more numerical variables. For examples height & weight relationship price and demand relationship.
McGraw-Hill/Irwin Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 3 Descriptive Statistics: Numerical Methods.
Holt McDougal Algebra 2 Measures of Central Tendency and Variation Check It Out! Example 3 Make a box-and-whisker plot of the data. Find the interquartile.
Measure of Central Tendency Measures of central tendency – used to organize and summarize data so that you can understand a set of data. There are three.
1 Further Maths Chapter 2 Summarising Numerical Data.
Quantitative data. mean median mode range  average add all of the numbers and divide by the number of numbers you have  the middle number when the numbers.
Box and Whisker Plots Measures of Central Tendency.
Sample Box-and-Whisker Plot lower extreme, or minimum value 1st quartile, the median of the lower half of the data set 2nd quartile, the median of the.
Measures of Center vs Measures of Spread
Exam Review Day 6 Chapters 2 and 3 Statistics of One Variable and Statistics of Two Variable.
Engineering Statistics KANCHALA SUDTACHAT. Statistics  Deals with  Collection  Presentation  Analysis and use of data to make decision  Solve problems.
Chapter 3, Part B Descriptive Statistics: Numerical Measures n Measures of Distribution Shape, Relative Location, and Detecting Outliers n Exploratory.
Aim: What are the graphs for one and two variables data? Do Now: Use the STAT function on your calculator, enter L L
1 Chapter 4 Numerical Methods for Describing Data.
What are the effects of outliers on statistical data?
{ Box-and-Whisker Plots. Median, Quartiles, Inter-Quartile Range and Box Plots. Measures of Spread The range is not a good measure of spread because one.
Warm Up Simplify each expression
Cumulative frequency Cumulative frequency graph
Coordinate Algebra Practice EOCT Answers Unit 4. #1 Unit 4 This table shows the average low temperature, in ºF, recorded in Macon, GA, and Charlotte,
Lesson Menu Main Idea and New Vocabulary NGSSS Example 1:Construct a Box-and-Whisker Plot Example 2:Interpret Data Five-Minute Check.
7-5 Box-and-Whisker Plots Course 2. Warm Up Use the data below for Questions , 25, 37, 53, 26, 12, 70, What is the mean? 2. What is the median?
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt DefinitionsCalculationsWhat’s.
Learn to display and analyze data in box-and-whisker plots. Course Box-and-Whisker Plots.
To graph and interpret Box-and-Whisker Plots 9/17/13 Do Now: Order the set of numbers from least to greatest, then find the median.
Holt McDougal Algebra Measures of Central Tendency and Variation Recall that the mean, median, and mode are measures of central tendency—values.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
MM2D1: Using sample data, students will make informal inferences about population means and standard deviations b. Understand and calculate the means and.
Stem-and-Leaf Plots …are a quick way to arrange a set of data and view its shape or distribution A key in the top corner shows how the numbers are split.
Central Tendency  Key Learnings: Statistics is a branch of mathematics that involves collecting, organizing, interpreting, and making predictions from.
a graphical presentation of the five-number summary of data
Calculating Median and Quartiles
The absolute value of each deviation.
Measures of Central Tendency
Define the following words in your own definition
Box-and-Whisker Plots
. . Box and Whisker Measures of Variation Measures of Variation 8 12
Box-And-Whisker Plots
5 Number Summaries.
Presentation transcript:

1 שיטות גראפיות פשוטות להצגה וניתוח נתונים

2 Star/Radar/Spider Plot Figure 1: A typical radar graph with two plots

3 דיאגראמה עכביש

4 Purpose The star plot is a method of displaying multivariate data. Each star represents a single observation. Typically, star plots are generated in a multi-plot format with many stars on each page and each star representing one observation. Star plots are used to examine the relative values for a single data point (e.g., point 3 is large for variables 2 and 4, small for variables 1, 3, 5, and 6) and to locate similar points or dissimilar points

5 Sample Plot The plot below contains the star plots of 16 cars. The variable list for the sample star plot is: 1. Price 2. Mileage (MPG) Repair Record (1 = Worst, 5 = Best) Repair Record (1 = Worst, 5 = Best) 5. Headroom 6. Rear Seat Room 7. Trunk Space 8. Weight 9. Length

6

7 We can look at these plots individually or we can use them to identify clusters of cars with similar features. We can look at the star plot of the Cadillac Seville : it is one of the most expensive cars, gets below average (but not among the worst) gas mileage, has an average repair record, and has average-to-above-average roominess and size. We can then compare the Cadillac models (the last three plots) with the AMC models (the first three plots). The AMC models tend to be inexpensive, have below average gas mileage, and are small in both height and weight and in roominess. The Cadillac models are expensive, have poor gas mileage, and are large in both size and roominess.

8 Questions The star plot can be used to answer the following questions: What variables are dominant for a given observation? Which observations are most similar, i.e., are there clusters of observations? Are there outliers?

9 Weakness in Technique Star plots are helpful for small-to-moderate-sized multivariate data sets. Their primary weakness is that their effectiveness is limited to data sets with less than a few hundred points. After that, they tend to be overwhelming.

10 דיאגראמת עוגה

11 דיאגראמת עוגה

12 Pivot Chart

13 Histogram

14 Cumulative Histogram

15 Bihistogram

16 דיאגראמה פארטו

17 דיאגראמה פארטו

18 דיאגראמה פארטו

19 Box-and-Whisker Plot (1) is the median 52 is the lower quartile 87 is the upper quartile 35 is the interquartile range (IQR)

20 Box-and-Whisker Plot (2)

21 Box-and-Whisker Plot (3) There is a useful variation of the box plot that more specifically identifies outliers. To create this variation: Calculate the median and the lower and upper quartiles.medianlower and upper quartiles Plot a symbol at the median and draw a box between the lower and upper quartiles. Calculate the interquartile range (the difference between the upper and lower quartile) and call it IQ. Calculate the following points: L1 = lower quartile - 1.5*IQ L2 = lower quartile - 3.0*IQ U1 = upper quartile + 1.5*IQ U2 = upper quartile + 3.0*IQ The line from the lower quartile to the minimum is now drawn from the lower quartile to the smallest point that is greater than L1. Likewise, the line from the upper quartile to the maximum is now drawn to the largest point smaller than U1. Points between L1 and L2 or between U1 and U2 are drawn as small circles. Points less than L2 or greater than U2 are drawn as large circles. Questions The box plot can provide answers to the following questions: Is a factor significant? Does the location differ between subgroups? Does the variation differ between subgroups? Are there any outliers? Importance: Check the significance of a factor The box plot is an important EDA tool for determining if a factor has a significant effect on the response with respect to either location or variation. The box plot is also an effective tool for summarizing large quantities of information.

22 Box-and-Whisker Plot (4)

23 Box-and-Whisker Plot (5)

24 דיאגראמה פיזור

25 Scatter Plot: No Relationship

26 Scatter Plot: Strong Linear (positive correlation) Relationship

27 Scatter Plot: Strong Linear (negative correlation) Relationship

28 Scatter Plot: Exact Linear (positive correlation) Relationship

29 Scatter Plot: Quadratic Relationship

30 Scatter Plot: Sinusoidal Relationship (damped)

31 Scatter Plot: Variation of Y Does Not Depend on X

32 Scatter Plot: Variation of Y Does Depend on X

33 Scatter Plot: Outlier

34

35 תרשים רץ (1)

36 תרשים רץ (2)

37 תרשים רץ + גבולות בקרה = תרשים בקרה

38 Lag Plot-(1)

39 Lag Plot (2) x t-1 xtxtxtxt New Point Interpolate these… To get the final prediction

40 Lag Plot: Random Data

41 Lag Plot: Moderate Autocorrelation

42 Lag Plot: Strong Autocorrelation and Autoregressive Model

43 Lag Plot: Sinusoidal Models and Outliers