Graphical Examination of Data 1.12.1999 Jaakko Leppänen

Slides:



Advertisements
Similar presentations
Multi-Dimensional Data Visualization
Advertisements

Chapter 2 Summarizing and Graphing Data
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 2-4 Statistical Graphics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Slide 1 Spring, 2005 by Dr. Lianfen Qian Lecture 2 Describing and Visualizing Data 2-1 Overview 2-2 Frequency Distributions 2-3 Visualizing Data.
8.1 Types of Data Displays Remember to Silence Your Cell Phone and Put It In Your Bag!
Beginning the Visualization of Data
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Graphic representations in statistics (part II). Statistics graph Data recorded in surveys are displayed by a statistical graph. There are some specific.
1 Chapter 1: Sampling and Descriptive Statistics.
1 Probabilistic and Statistical Techniques Lecture 3 Dr. Nader Okasha.
Types of Data Displays Based on the 2008 AZ State Mathematics Standard.
Visualization and Data Mining. 2 Outline  Graphical excellence and lie factor  Representing data in 1,2, and 3-D  Representing data in 4+ dimensions.
Chapter Sixteen EXPLORING, DISPLAYING, AND EXAMINING DATA
ISE 261 PROBABILISTIC SYSTEMS. Chapter One Descriptive Statistics.
FOUNDATIONS OF NURSING RESEARCH Sixth Edition CHAPTER Copyright ©2012 by Pearson Education, Inc. All rights reserved. Foundations of Nursing Research,
Understanding and Comparing Distributions
1 How to interpret scientific & statistical graphs Theresa A Scott, MS Department of Biostatistics
Chapter 2 Frequency Distributions and Graphs 1 © McGraw-Hill, Bluman, 5 th ed, Chapter 2.
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Frequency Distributions and Graphs
How to Analyze Data? Aravinda Guntupalli. SPSS windows process Data window Variable view window Output window Chart editor window.
TAKS Mathematics Review.
Unit 1c: Identifying Relationships Quality Core: G.1.b G.1.cG.1.h.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
REPRESENTATION OF DATA.
3. Data Presentation Graphs & Charts.
Chapter 2 Summarizing and Graphing Data
Descriptive Statistics: Tabular and Graphical Methods
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Quantitative Skills 1: Graphing
Chapter 1 Review MDM 4U Mr. Lieff. 1.1 Displaying Data Visually Types of data Quantitative Discrete – only whole numbers are possible Continuous – decimals/fractions.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Describing Data.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc.
Chapter 11 Graphical Methods. Introduction “A picture is often better than several numerical analyses” Stand-alone procedure, or used in conjunction with.
2.2 Organizing Quantitative Data. Data O Consider the following data O We would like to compute the frequencies and the relative frequencies.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Elementary Statistics Eleventh Edition Chapter 2.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 2-2 Frequency Distributions.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Unit 4 Statistical Analysis Data Representations.
Descriptive statistics Petter Mostad Goal: Reduce data amount, keep ”information” Two uses: Data exploration: What you do for yourself when.
Data and Analysis Graphs. Georgia Performance Standards M5D1. Students will analyze graphs. a. Analyze data presented in a graph. b. Compare and contrast.
CHAPTER 2 SECTION 2-1 PATTERNS AND ITERATIONS SEQUENCE An arrangement of numbers in a particular order. The numbers are called terms and the pattern.
Chapter 2: Frequency Distributions. Frequency Distributions After collecting data, the first task for a researcher is to organize and simplify the data.
Histograms, Frequency Polygons, and Ogives. What is a histogram?  A graphic representation of the frequency distribution of a continuous variable. Rectangles.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 2 Descriptive Statistics: Tabular and Graphical Methods.
Histograms, Frequency Polygons, and Ogives
MATH 2311 Section 1.5. Graphs and Describing Distributions Lets start with an example: Height measurements for a group of people were taken. The results.
Data Visualization.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 2 Section 2 – Slide 1 of 37 Chapter 2 Section 2 Organizing Quantitative Data.
Chapter 5: Organizing and Displaying Data. Learning Objectives Demonstrate techniques for showing data in graphical presentation formats Choose the best.
Graphs with SPSS Aravinda Guntupalli. Bar charts  Bar Charts are used for graphical representation of Nominal and Ordinal data  Height of the bar is.
Graphical Presentation Dr. Amjad El-Shanti MD, PMH,Dr PH University of Palestine 2016.
Graphs that Enlighten and Graphs that Deceive Chapter 2 Section 4.
DISPLAYING DATA DIAGRAMMATICALLY. The Aim By the end of this lecture, the students will be aware of graphical representation of data and by using SPSS.
Descriptive Statistics: Tabular and Graphical Methods
Organizing Quantitative Data: The Popular Displays
Statistics -S1.
ISE 261 PROBABILISTIC SYSTEMS
Bar Graph A bar graph uses vertical or horizontal bars to display numerical information.
Frequency Distributions and Graphs
Topic 5: Exploring Quantitative data
2-1 Data Summary and Display 2-1 Data Summary and Display.
Constructing and Interpreting Visual Displays of Data
Organizing, Displaying and Interpreting Data
Presentation transcript:

Graphical Examination of Data Jaakko Leppänen

Sources 4 H. Anderson, T. Black: Multivariate Data Analysis, (5th ed., p.40-46). 4 Yi-tzuu Chien: Interactive Pattern Recognition, (Chapter 3.4). 4 S. Mustonen: Tilastolliset monimuuttujamenetelmät, (Chapter 1, Helsinki 1995).

Agenda 4 Examining one variable 4 Examining the relationship between two variables 4 3D visualization 4 Visualizing multidimensional data

Examining one variable 4 Histogram –Represents the frequency of occurences within data categories one value (for discrete variable) an interval (for continuous variable)

Examining one variable 4 Stem and leaf diagram (A&B) –Presents the same graphical information as histogram –provides also an enumeration of the actual data values

Examining the relationship between two variables 4 Scatterplot –Relationship of two variables Linear Non-linear No correlation

Examining the relationship between two variables 4 Boxplot (according A&B) –Representation of data distribution –Shows: Middle 50% distribution Median (skewness) Whiskers Outliers Extreme values

3D visualization 4 Good if there are just 3 variables 4 Mustonen: “Problems will arise when we should show lots of dimensions at the same time. Spinning 3D-images or stereo image pairs give us no help with them.”

Visualizing multidimensional data 4 Scatterplot with varying dots 4 Scatterplot matrix 4 Multivariate profiles 4 Star picture 4 Andrews’ Fourier transformations 4 Metroglyphs (Anderson) 4 Chernoff’s faces

Scatterplot 4 Two variables for x- and y-axis 4 Other variables can be represented by –dot size, square size –height of rectangle –width of rectangle –color

Scatterplot matrix 4 Also named as Draftsman’s display 4 Histograms on diagonal 4 Scatterplot on lower portion 4 Correlations on upper portion

Scatterplot matrix (cont…) correlations histograms scatterplots

Scatterplot matrix (cont…) 4 Shows relations between each variable pair 4 Does not determine common distribution exactly 4 A good mean to learn new material 4 Helps when finding variable transformations

Scatterplot matrix as rasterplot 4 Color level represents the value –e.g. values are mapped to gray levels

Multivariate profiles 4 A&B: ”The objective of the multivariate profiles is to portray the data in a manner that enables each identification of differences and similarities.” 4 Line diagram –Variables on x-axis –Scaled (or mapped) values on y-axis

Multivariate profiles (cont…) 4 An own diagram for each measurement (or measurement group)

Star picture 4 Like multivariate profile, but drawn from a point instead of x-axis 4 Vectors have constant angle

Andrews’ Fourier transformations 4 D.F. Andrews, Each measurement X = (X 1, X 2,..., X p ) is represented by the function below, where -  < t < .

Andrews’ Fourier transformations (cont…) 4 If severeal measurements are put into the same diagram similar measurements are close to each other. 4 The distance of curves is the Euklidean distance in p-dim space 4 Variables should be ordered by importance

Andrews’ Fourier transformations (cont…)

4 Can be drawn also using polar coordinates

Metroglyphs (Andersson) 4 Each data vector (X) is symbolically represented by a metroglyph 4 Consists of a circle and set of h rays to the h variables of X. 4 The lenght of the ray represents the value of variable

Metroglyphs (cont...) 4 Normally rays should be placed at easily visualized and remembered positions 4 Can be slant in the same direction –the better way if there is a large number of metrogyphs

Metroglyphs (cont...) 4 Theoretically no limit to the number of vectors 4 In practice, human eye works most efficiently with no more than 3-7 rays 4 Metroglyphs can be put into scatter diagram => removes 2 vectors

Chernoff’s faces 4 H. Chernoff, Based on the idea that people can detect and remember faces very well 4 Variables determine the face features with linear transformation 4 Mustonen: "Funny idea, but not used in practice."

Chernoff’s faces (cont…) 4 Originally 18 features –Radius to corner of face OP –Angle of OP to horizontal –Vertical size of face OU –Eccentricity of upper face –Eccentricity of lower face –Length of nose –Vertical position of mouth –Curvature of mouth 1/R –Width of mouth

Chernoff’s faces (cont…) 4 Face features (cont…) –Vertical position of eyes –Separation of eyes –Slant of eyes –Eccentricity of eyes –Size of eyes –Position of pupils –Vertical position of eyebrows –Slant of eyebrows –Size of eyebrows

Chernoff’s faces (cont…)

Conclusion 4 Graphical Examination eases the understanding of variable relationships 4 Mustonen: "Even badly designed image is easier to understand than data matrix.” 4 "A picture is worth of a thousand words”