Displaying your data and using Classify Exploring how to use the legend classify command.

Slides:



Advertisements
Similar presentations
Brought to you by Tutorial Support Services The Math Center.
Advertisements

Agricultural and Biological Statistics
Introduction to Summary Statistics
Histograms. Definition of a Histogram A Histogram displays a range of values of a variable that have been broken into groups or intervals. Histograms.
Dual Tragedies in the B-ham Paper. Module 2 Simple Descriptive Statistics and Univariate Displays of Data A Tale of Three Cities George Howard, DrPH.
Measures of Dispersion
Measures of Variability or Dispersion
Descriptive Statistics
Attribute based and Spatial Operations Section III Part 1: Attribute Based Operations.
Measures of Central Tendency
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
GrowingKnowing.com © Variability We often want to know the variability of data. Please give me $1000, I will give you… 8% to 9% in a year. Small.
Descriptive Statistics Used to describe the basic features of the data in any quantitative study. Both graphical displays and descriptive summary statistics.
Part II Sigma Freud & Descriptive Statistics
BIOSTAT - 2 The final averages for the last 200 students who took this course are Are you worried?
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
Do Now Consider statistics. How is it relevant to real-life? What are some applications we can use it for? Give examples.
Statistics Recording the results from our studies.
BUS250 Seminar 4. Mean: the arithmetic average of a set of data or sum of the values divided by the number of values. Median: the middle value of a data.
Research Methods Chapter 8 Data Analysis. Two Types of Statistics Descriptive –Allows you to describe relationships between variables Inferential –Allows.
Harry Williams, Cartography1 THEMATIC MAPS A thematic map shows numeric or character data by colors or symbols. Data displayed in this manner is referred.
Objectives The student will be able to: find the variance of a data set. find the standard deviation of a data set.
Worked examples and exercises are in the text STROUD PROGRAMME 27 STATISTICS (contd)
HAWKES LEARNING SYSTEMS Students Matter. Success Counts. Copyright © 2013 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Section 2.1.
Copyright © 2014 by Nelson Education Limited. 3-1 Chapter 3 Measures of Central Tendency and Dispersion.
According to researchers, the average American guy is 31 years old, 5 feet 10 inches, 172 pounds, works 6.1 hours daily, and sleeps 7.7 hours. These numbers.
INVESTIGATION 1.
Descriptive Statistics: Presenting and Describing Data.
Practice Page 65 –2.1 Positive Skew Note Slides online.
STATISTICAL ANALYSIS Created by The North Carolina School of Science and Math.The North Carolina School of Science and Math Copyright North Carolina.
1 Chapter 7 – The Choropleth Map Data Classification.
STATISTICS. What is the difference between descriptive and inferential statistics? Descriptive Statistics: Describe data Help us organize bits of data.
Reclassification Methods From important a research topic to trivial computer functions Is it to easy?
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
Week 10 Ways to polish up your final Layout. Overview Changing file names to better names in Legend Choosing categories in a Graduated Legend Adding/Changing.
Measures of Central Tendency Mean, Median, Mode, and Range.
Descriptive Statistics for one Variable. Variables and measurements A variable is a characteristic of an individual or object in which the researcher.
Standard Deviation. Two classes took a recent quiz. There were 10 students in each class, and each class had an average score of 81.5.
Descriptive Statistics Research Writing Aiden Yeh, PhD.
Geographer's WorkBench G.E.M. Geotechnologies 2001 Mapping Classification techniques Groups of Features with Similar Values.
Chapter 3 DATA PROCESS & ANALYSIS OF STATISTICS Dr. BALAMURUGAN MUTHURAMAN
Demonstration How to create meaningful Maps - using graduated symbols - using graduated colours - using Classification methods Analyzing techniques.....
Symbolizing and Classifying How to improve your displayed data. ?
Review of Classification Techniques Lumpers or Splitters?
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
MDFP Mathematics and Statistics 1. Univariate Data – Today’s Class 1.STATISTICS 2.Univariate (One Variable) Data 1.Definition 2.Mean, Median, Mode, Range.
CHAPTER 11 Mean and Standard Deviation. BOX AND WHISKER PLOTS  Worksheet on Interpreting and making a box and whisker plot in the calculator.
Measures of Variation. Variation Variation describes how widely data values are spread out about the center of a distribution.
An Introduction to Statistics
Variability GrowingKnowing.com © 2011 GrowingKnowing.com © 2011.
INTRODUCTION TO STATISTICS
Different Types of Data
Practice Page Practice Page Positive Skew.
Making a Line Plot Collect data and put in chronological order
Displaying Data ENVS 521 Lecture 4.
Introduction to Summary Statistics
Introduction to Summary Statistics
Introduction to Summary Statistics
Making a Line Plot Collect data and put in chronological order
Introduction to Summary Statistics
Introduction to Summary Statistics
Standard Deviation.
Introduction to Summary Statistics
Introduction to Summary Statistics
Map Generalization and Data Classification Gary Christopherson
Introduction to Summary Statistics
Measures of Central Tendency
Standard Deviation!.
Introduction to Summary Statistics
Frequency Distributions
Presentation transcript:

Displaying your data and using Classify Exploring how to use the legend classify command

When displaying data on a map there are several things you should be aware of: 1.Since polygon sizes are different many times large areas simply have larger numbers 2.Thus, Normalizing by population can produce different results 3.How you classify your data can also emphasize different patterns 4.The number of classes you use can add to complexity

Here are the contiguous 48 states for Whites in the US In part large states end up at the highest end of the category This is using the default “natural breaks” which probably isn’t the best classification here Default with total count 5 classes and Natural Breaks natural breaks classification See Also: classification, Jenks' optimizationclassificationJenks' optimization [cartography] A method of manual data classification that seeks to partition data into classes based on natural groups in the data distribution. Natural breaks occur in the histogram at the low points of valleys. Breaks are assigned in the order of the size of the valleys, with the largest valley being assigned the first natural break.

Here I have changed to a simpler 3 classes Notice now the smaller eastern states and large but mostly low density western states fall into the lowest category But the size of the categories is quite a bit different

Real Definition of Natural Breaks Jenk’s Optimization: The method requires an iterative process. That is, calculations must be repeated using different breaks in the dataset to determine which set of breaks has the smallest in-class variance. The process is started by dividing the ordered data into groups. Initial group divisions can be arbitrary. There are four steps that must be repeated:variance Calculate the sum of squared deviations between classes (SDBC). Calculate the sum of squared deviations from the array mean (SDAM). Subtract the SDBC from the SDAM (SDAM-SDBC). This equals the sum of the squared deviations from the class means. After inspecting each of the SDBC, a decision is made to move one unit from the class with the largest SDBC toward the class with the lowest SDBC. New class deviations are then calculated, and the process is repeated until the sum of the within class deviations reaches a minimal value. [1][5] [1][5] 5 Classes 3 Classes

What is it doing? Not always clear. In my opinion works better with remotely sensed data. If data is logarithmic, then use a log or geometric classification. Make sure your classification scheme reflects whatever you’re trying to do.

Equal Interval

Results of Equal Interval Now we see the results of really big state and small one based on population in equal sized classes

Geometric Progression Since our data is highly skewed to the right, we might want to try a geometric progression

Geometric Progression Now the really small states are really small the middle size ones have a larger range, and the largest ones have the largest range

Now do it by percent white

Percent White Consider the future of Republicans.

Switch to 5 Classes to improve detail

Exploring for a Geometric Progression Given the fairly even distribution of the data there doesn’t seem to be anything gained by going to a geometric progression

Further Explorations Now explore Hispanic and Black Populations

Final note and caution How you display your data can give quite different answers