On Interpreting I Interpreting Histograms, Density Functions, distributions of a single attribute What is the type of the attribute? What is the mean.

Slides:



Advertisements
Similar presentations
Lecture 2 Summarizing the Sample. WARNING: Today’s lecture may bore some of you… It’s (sort of) not my fault…I’m required to teach you about what we’re.
Advertisements

IB Math Studies – Topic 6 Statistics.
DENSITY CURVES and NORMAL DISTRIBUTIONS. The histogram displays the Grade equivalent vocabulary scores for 7 th graders on the Iowa Test of Basic Skills.
 Multiple choice questions…grab handout!. Data Analysis: Displaying Quantitative Data.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
MAT 1000 Mathematics in Today's World. Last Time 1.Three keys to summarize a collection of data: shape, center, spread. 2.Can measure spread with the.
Summarizing Quantitative Data. We have discussed how to display data in a histogram. Today learn to describe how data is distributed.
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
Revision Analysing data. Measures of central tendency such as the mean and the median can be used to determine the location of the distribution of data.
Ch. Eick: Some Ideas for Task4 Project2 Ideas on Creating Summaries and Evaluations of Clusterings Focus: Primary Focus Summarization (what kind of objects.
Normal Distributions (aka Bell Curves, Gaussians) Spring 2010.
Histograms. Histograms have some similar characteristics as other graphical representations... Shape: Left skewed, right skewed, symmetric, unimodal,
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
5-Analyzing trends in categorical data Dot plots and frequency tables 1-Ways to represent data Data 4.
Describing Distributions
Thursday, May 12, 2016 Report at 11:30 to Prairieview
Continuous random variables
Elementary Statistics
Elementary Statistics
Data graphing facts.
Exploring Data: Summary Statistics and Visualizations
Chapter 4: The Normal Distribution
CHAPTER 2 Modeling Distributions of Data
EXPLORATORY DATA ANALYSIS and DESCRIPTIVE STATISTICS
On Interpreting I Interpreting Histograms, Density Functions, distributions of a single attribute What is the type of the attribute? What is the mean.
CHAPTER 2 Modeling Distributions of Data
Good Afternoon! Agenda: Knight’s Charge-please wait for direction
Chapter 1 & 3.
Distributions and Graphical Representations
Unit 1 - Graphs and Distributions
Basics of Statistics.
3.4 Histograms.
CHAPTER 2 Modeling Distributions of Data
The Normal Distribution
Honors Statistics Chapter 4 Part 4
Displaying Quantitative Data
DS4 Interpreting Sets of Data
Example Histogram c) Interpret the following histogram that captures the percentage of body-fat in a testgroup [4]:  
CHAPTER 2 Modeling Distributions of Data
Inference for Distributions
Means & Medians.
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
How where first 3 displays generated?
Welcome!.
QUANTITATIVE DATA chapter 4 (NUMERICAL).
On Interpreting I Interpreting Histograms, Density Functions, distributions of a single attribute What is the type of the attribute? What is the mean.
Statistics for Managers Using Microsoft® Excel 5th Edition
The Normal Distribution
CHAPTER 2 Modeling Distributions of Data
DISPLAYING AND DESCRIBING THE DISTRIBUTIONS OF NUMERICAL VARIABLES
The Normal Distribution
Honors Statistics Review Chapters 4 - 5
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Lesson – Teacher Notes Standard:
Describing Data: Displaying and Exploring Data
Basic Practice of Statistics - 3rd Edition
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Fall Final Topics by “Notecard”.
Density Curves and the Normal Distributions
Objective: To introduce the characteristics of normal distribution curve. Standard 5.10.
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Warm Up 4/30 What are you thankful for?
Essential Statistics Inference about a Population Mean
CHAPTER 2 Modeling Distributions of Data
CHAPTER 2 Modeling Distributions of Data
Presentation transcript:

On Interpreting I Interpreting Histograms, Density Functions, distributions of a single attribute What is the type of the attribute? What is the mean value; what is the mode? Is the a lot of spread or not (compute the standard deviation) Is the distribution unimodal (one hill or no hill)) or multi-modal (multiple hills)? Is the distribution skewed (e.g. compare mean with median)? Are there any outliers? Are there any duplicate values? Are there any gaps in the attribute value distribution? Characterize the shape of the density function!

On Interpreting II Interpreting Scatter Plots and Similar Display Characterize the distribution of each class in the attribute space; is it unimodal or mult-imodal? Characterize the overall distribution (including all examples); do you observe any correlation or other characteristics? Analyze the separation of a single class from all the other classes. Analyze the separation between pairs of classes. If classes overlap characterize the extend to which they overlap. If decision boundaries between classes can be inferred characterize those decision boundaries. Assess the difficulty of the classification based on your findings of looking at a set of scatter plots.

Body fat Histogram

Scatter Plot Array of Iris Attributes

On Interpreting I Interpreting Histograms, Density Functions, distributions of a single attribute What is the type of the attribute? Positive real numbers What is the mean value; what is the mode? Is the a lot of spread or not (compute the standard deviation)? Not much Is the distribution unimodal (one hill or no hill)) or multi-modal (multiple hills)? One hill or two hills, depending on how you interpret the data. The second hill is not very well separated; therefore I would say unimodal. Is the distribution skewed (e.g. compare mean with median)? Are there any outliers? Yes values above 45…? Are there any duplicate values? Are there any gaps in the attribute value distribution? Yes two gaps: 1)… 2)… Characterize the shape of the density function! Bell Curve

On Interpreting II (pedal length/width) Interpreting Scatter Plots and Similar Display Characterize the distribution of each class in the attribute space; is it unimodal or mult-imodal? Unimodal each. Characterize the overall distribution (including all examples); do you observe any correlation or other characteristics? quite strong positive correlation between the two attributes. Analyze the separation of a single class from all the other classes. Analyze the separation between pairs of classes. Blue is clearly separated from the two other; red and green only slightly overlap; If classes overlap characterize the extend to which they overlap. If decision boundaries between classes can be inferred characterize those decision boundaries. Test using just sepal length will mostly do a good job. Assess the difficulty of the classification based on your findings of looking at a set of scatter plots. Easy