Univariate Visualization

Slides:



Advertisements
Similar presentations
ISE 261 PROBABILISTIC SYSTEMS. Chapter One Descriptive Statistics.
Advertisements

Statistics: Use Graphs to Show Data Box Plots.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
Programming in R Describing Univariate and Multivariate data.
Jeopardy Q $100 Q $200 Q $300 Q $400 Q $500 Q $100 Q $200 Q $300 Q $400 Q $500 Final Jeopardy.
Chapter 1 Descriptive Analysis. Statistics – Making sense out of data. Gives verifiable evidence to support the answer to a question. 4 Major Parts 1.Collecting.
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
Let’s Review for… AP Statistics!!! Chapter 1 Review Frank Cerros Xinlei Du Claire Dubois Ryan Hoshi.
Statistics.
Variable  An item of data  Examples: –gender –test scores –weight  Value varies from one observation to another.
Methods for Describing Sets of Data
Unit 2 Sections 2.1.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
Warm Up – Find the mean, median & mode of each set. Data Set I Data Set II.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
Categorical vs. Quantitative…
Bellwork 1. If a distribution is skewed to the right, which of the following is true? a) the mean must be less than the.
Unit 4 Statistical Analysis Data Representations.
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
Chapter 4: Quantitative Data Part 1: Displaying Quant Data (Week 2, Wednesday) Part 2: Summarizing Quant Data (Week 2, Friday)
Statistics topics from both Math 1 and Math 2, both featured on the GHSGT.
Vocabulary to know: *statistics *data *outlier *mean *median *mode * range.
Box Plots Show the Spread of Data BOX PLOT NOTES.
4.2 Displays of Quantitative Data. Stem and Leaf Plot A stem-and-leaf plot shows data arranged by place value. You can use a stem-and-leaf plot when you.
ALL ABOUT THAT DATA UNIT 6 DATA. LAST PAGE OF BOOK: MEAN MEDIAN MODE RANGE FOLDABLE Mean.
Chapter 5: Organizing and Displaying Data. Learning Objectives Demonstrate techniques for showing data in graphical presentation formats Choose the best.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
1 By maintaining a good heart at every moment, every day is a good day. If we always have good thoughts, then any time, any thing or any location is auspicious.
Unit 1 - Graphs and Distributions. Statistics 4 the science of collecting, analyzing, and drawing conclusions from data.
ALL ABOUT THAT DATA UNIT 6 DATA. LAST PAGE OF BOOK: MEAN MEDIAN MODE RANGE FOLDABLE Mean.
Central Tendency  Key Learnings: Statistics is a branch of mathematics that involves collecting, organizing, interpreting, and making predictions from.
All About that Data Unit 6 Data.
Organizing Quantitative Data: The Popular Displays
Exploratory Data Analysis
Methods for Describing Sets of Data
STAT 4030 – Programming in R STATISTICS MODULE: Basic Data Analysis
ISE 261 PROBABILISTIC SYSTEMS
MAT 135 Introductory Statistics and Data Analysis Adjunct Instructor
Unit 4 Statistical Analysis Data Representations
All About that Data Unit 6 Data.
STATISTICS ELEMENTARY MARIO F. TRIOLA
CHAPTER 5 Basic Statistics
Statistical Reasoning
Description of Data (Summary and Variability measures)
Laugh, and the world laughs with you. Weep and you weep alone
Bar graphs are used to compare things between different groups
def plotDrugData4(data, data1, color, name, name1): plot(data, data1, color) show() xlabel(name) ylabel(name1) title(name + ' versus ' + name1)
Descriptive Statistics
Frequency Distributions and Graphs
Distributions (Chapter 1) Sonja Swanson
Topic 5: Exploring Quantitative data
HMI 7530– Programming in R STATISTICS MODULE: Basic Data Analysis
Unit 3: Statistics Final Exam Review.
Frequency Distributions
Unit 2: Statistics Final Exam Review.
Displaying Distributions with Graphs
Displaying and Summarizing Quantitative Data
The Range Chapter Data Analysis Learning Goal: To be able to describe the general shape of a distribution in terms of its.
Measures of Central Tendency
Welcome!.
Honors Statistics Review Chapters 4 - 5
Vocabulary for Feb. 20-Mar
Lesson – Teacher Notes Standard:
Experimental Design Experiments Observational Studies
Two Way Frequency Table
Find the Mean of the following numbers.
Types of variables. Types of variables Categorical variables or qualitative identifies basic differentiating characteristics of the population.
ALGEBRA STATISTICS.
Math 341 January 24, 2007.
Biostatistics Lecture (2).
Presentation transcript:

Univariate Visualization CMSC 120: Visualizing Information 2/21/08

Types of Data Qualitative: pertaining to fundamental or distinctive characteristics Nominal: unordered (e.g., names, types) Ordinal: ordered (e.g., cold, warm, hot) Quantitative: pertaining to an amount of anything Discrete: isolated intervals Continuous: unbroken, immediate connection

Univariate Data A single attribute

Weather Conditions: 2/17/08

Univariate Data A single attribute Characterize Observations Temperature: quantitative Condition: qualitative Characterize Observations Number Type Similarity

The Raw Data: A Dot Plot n ≤ 20 Distance between individual points Emphasize clusters, gaps, outliers Reveal frequency of each observation

Frequency Table Groups observations by class Quantitative: an interval or part of the range of the sample Qualitative: a potential value Frequency: number of observations that fall into a class Relative Frequency: frequency / sample size

Frequency Table Clear 5 17 % Mostly Cloudy 1 3 % Partly Cloudy 3 10 % Condition Frequency Relative Frequency Clear 5 17 % Mostly Cloudy 1 3 % Partly Cloudy 3 10 % Overcast 16 55 % Light Rain 4 14 %

Frequency Table 25-30 7 24% 30-35 1 3% 35-40 3 10% 40-45 45-50 13 45% Temperature Frequency Relative Frequency 25-30 7 24% 30-35 1 3% 35-40 3 10% 40-45 45-50 13 45% 50-55 0 % 55-60 3 % 60-65

Stem and Leaf Plots Stem Leaf 2 5566789 3 1567 4 02555566677788999 5 8 Temperature 25 26 27 31 36 40 42 Stem Leaf 2 5 Stem Leaf 2 5566789 3 1567 4 02555566677788999 5 8 6 Stem Leaf 2 5567 Separate each number into a stem (class) and a leaf Group numbers with the same stems

Pie Charts Useful for qualitative data Must sum to 100%

Histograms Pictorial representation of a Frequency Table Set of boxes whose area represents relative frequency of observations per class Total Area of all boxes = 100% Shape of histogram determined by box Number = number of classes Width = class interval Height

Histogram

Histogram

Patterns Outliers: observations well away from main body of data Number of peaks (modes): most popular values Abrupt Changes

Shape Central Values: where data appear to be centered Mode Mean Central Values: where data appear to be centered Spread: how spread out the points are Symmetry (Skew)

How to Lie: Aggregation Process of putting data into groups Allows user to compare among groups Hides differences between groups Too little: noise of individual data overwhelms overall pattern Too much: important patterns are hidden within groups

Interval Size = 7 Degrees

Interval Size = 14 Degrees

Shape of Shell Aperture

Shape of Shell Aperture

Shape of Shell Aperture

Shape of Shell Aperture

Shape of Shell Aperture

The 5 Number Summary Continuous, Quantitative Data Order data from lowest value to highest Minimum: lowest value Lower Quartile: cuts off ¼ of the data Median: middle value Upper Quartile: cuts off ¾ of the data Maximum: highest value

Minimum = 25 Lower Quartile = 30.9 Median = 45 Upper Quartile = 46.9 26.1 27 28 28.9 30.9 35.6 37 39.9 42.1 44.6 45 46 46.4 46.9 48 48.2 48.9 57.9 60.1 Minimum = 25 Lower Quartile = 30.9 Median = 45 Upper Quartile = 46.9 Maximum = 60.1

Box and Whisker Plot Maximum = 60.1 Outlier Largest Non-Outlier Upper Quartile = 46.9 Median = 45 50% of Data Lower Quartile = 30.9 Smallest Non-Outlier Minimum = 25

Shell Shape