1 Chapter 7 – The Choropleth Map Data Classification.

Slides:



Advertisements
Similar presentations
Chapter 3, Numerical Descriptive Measures
Advertisements

Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
BHS Methods in Behavioral Sciences I April 18, 2003 Chapter 4 (Ray) – Descriptive Statistics.
Calculating & Reporting Healthcare Statistics
Variability Measures of spread of scores range: highest - lowest standard deviation: average difference from mean variance: average squared difference.
Basic Business Statistics 10th Edition
CHAPTER 3: Statistical Description of Data
Learning Objectives for Section 11.3 Measures of Dispersion
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
SHOWTIME! STATISTICAL TOOLS IN EVALUATION DESCRIPTIVE VALUES MEASURES OF VARIABILITY.
Measures of Central Tendency
Quiz 2 Measures of central tendency Measures of variability.
CHAPTER 4 Measures of Dispersion. In This Presentation  Measures of dispersion.  You will learn Basic Concepts How to compute and interpret the Range.
ArcGIS Overview Lecture 1: Software Layer characteristics Thematic maps.
Describing Data: Numerical
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 4.1 Chapter Four Numerical Descriptive Techniques.
May 06th, Chapter - 7 INFORMATION PRESENTATION 7.1 Statistical analysis 7.2 Presentation of data 7.3 Averages 7.4 Index numbers 7.5 Dispersion from.
Statistics. Question Tell whether the following statement is true or false: Nominal measurement is the ranking of objects based on their relative standing.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 16 Descriptive Statistics.
Numerical Descriptive Techniques
Chapter 3: Central Tendency. Central Tendency In general terms, central tendency is a statistical measure that determines a single value that accurately.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Smith/Davis (c) 2005 Prentice Hall Chapter Four Basic Statistical Concepts, Frequency Tables, Graphs, Frequency Distributions, and Measures of Central.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Statistics Recording the results from our studies.
Descriptive Statistics
Tuesday August 27, 2013 Distributions: Measures of Central Tendency & Variability.
Our objectives: We will consider four thematic map types choropleth proportional symbol dot density cartograms understand decisions involved in classifying.
8.3 Measures of Dispersion  In this section, you will study measures of variability of data. In addition to being able to find measures of central tendency.
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Model Construction: interpolation techniques 1392.
Lecture 3 Describing Data Using Numerical Measures.
INVESTIGATION 1.
Dr. Serhat Eren 1 CHAPTER 6 NUMERICAL DESCRIPTORS OF DATA.
Chap 3-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 3 Describing Data Using Numerical.
Dr. Serhat Eren Other Uses for Bar Charts Bar charts are used to display data for different categories where the data are some kind of quantitative.
© Copyright McGraw-Hill 2000
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 13 Descriptive Data Analysis. Statistics  Science is empirical in that knowledge is acquired by observation  Data collection requires that we.
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Reclassification Methods From important a research topic to trivial computer functions Is it to easy?
GEOG 370 Christine Erlien, Instructor
Unit 2 (F): Statistics in Psychological Research: Measures of Central Tendency Mr. Debes A.P. Psychology.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Edpsy 511 Exploratory Data Analysis Homework 1: Due 9/19.
LIS 570 Summarising and presenting data - Univariate analysis.
Introduction to statistics I Sophia King Rm. P24 HWB
Chapter 11 Data Descriptions and Probability Distributions Section 3 Measures of Dispersion.
Displaying your data and using Classify Exploring how to use the legend classify command.
Geographer's WorkBench G.E.M. Geotechnologies 2001 Mapping Classification techniques Groups of Features with Similar Values.
Chapter 2 Describing and Presenting a Distribution of Scores.
Descriptive Statistics(Summary and Variability measures)
Chapter 3 EXPLORATION DATA ANALYSIS 3.1 GRAPHICAL DISPLAY OF DATA 3.2 MEASURES OF CENTRAL TENDENCY 3.3 MEASURES OF DISPERSION.
©2013, The McGraw-Hill Companies, Inc. All Rights Reserved Chapter 2 Describing and Presenting a Distribution of Scores.
1 Mapping tehcniques Choropleth mapping Data classification Attribution (by) Licensees may copy, distribute, display and perform the work and make derivative.
Central Tendency Quartiles and Percentiles (الربيعيات والمئينات)
Exploratory Data Analysis
Quiz.
Different Types of Data
Chapter 7 Review.
Displaying Data ENVS 521 Lecture 4.
Key Terms Symbology Categorical attributes Style Layer file.
Data Representation and Mapping
CHAPTER 3 Data Description 9/17/2018 Kasturiarachi.
Summary descriptive statistics: means and standard deviations:
Module 5.
Summary descriptive statistics: means and standard deviations:
Presentation transcript:

1 Chapter 7 – The Choropleth Map Data Classification

2 Appropriateness of Data Enumeration mapping – describes areal classified or aggregated data Appropriateness of Data – political boundary (Census Bureau units) Not appropriate – continuous phenomena should not be mapped by choropleth maps Two enumeration data: totals or derived. Sometimes, totals are not suitable for mapping (Fig 7.4) Major Assumption – the value in the enumeration unit is spread uniformly throughout the unit. If data cannot be dealt with as ratios or proportions, then should not be portrayed by the choropleth tecnique.

3 Common Methods of Data Classification No more than 5 to 7 classes Six common methods - equal intervals, standard deviation, Arithmetic progression, Geometric progression, Quantile, Natural breaks, and optimal.

4 Equal Intervals Useful when histogram of data array has a rectangular shape (rare in geographic phenomena) Advantages: 1) easy to compute the intervals 2) easy to interpret the resulting intervals 3) no gap in the legend display 4) only lowest limits can be shown in legend Disadvantage: skewed data is not appropriate.

5 Calculating Steps 1. Calculate the range of the data (R) : R = H – L 2. Obtain the common difference (CD) : CD = R/(# of Classes) 3. Obtain the class limits by calculating  L + 1 x CD = first class limit  L + 2 x CD = second class limit  L + (n-1) x CD = last class limit

6 Quantile Ordered data are placed in classes. Ties can complicate the quantiles method. Advantages - 1) class limits can be computed manually.2) if enumeration units are same, each class will have the same map area. 3) quantile are useful for ordinal-level data, no numeric information would be necessary to create the classification. Disadvantage - 1) gap result may vary. 2) fails to consider data distribution. K = # of enumeration units / number of classes

7 Standard Deviation Used only when the data array approximates a normal distribution. Advantages: 1) Distribution of data is taken into account. 2) if normal distributed data is used, the mean is a good divider. 3) no gap in the legend. Disadvantage: 1) only work with normal- distributed data. 2) negative values may be in the range

8 Geometric Progression Useful technique when frequency of data declines continuously with increasing magnitude a, ar 1, ar 2,...ar n 1) compute common multiplier (a is the lowest value, r is the common multiplier and n is the number of classes use “Xmin x r n = Xmax” to obtain r eg. 118 x r 5 = 790 ( use Area as variable) r = 1.46 So the interval goes from 118, 118x1.46, 118x1.46 2, 118x1.46 3,118x , , , ,

9 Graphic Array Figure 7.11 Class boundaries are identified at places where slopes change remarkably Disadvantage: not suitable for large amount of data

10 Jenks Optimization Forming groups that are internally homogenous while assuring heterogeneity among classes groups are created based on gaps. Minimize differences within class and maximize differences between classes. Based on GVF (Goodness of Variance Fit) - an optimization techniques to minimize the sum of the variance within each of the class.

11 GVF (Fisher-Jenkins Algorithms) 1) compute the squared deviation of each data Compute SDCM (Squared Deviation, Class Means). Compute GVF = (SDAM - SDCM) / SDAM The goal is to maximize the value of GVF (closer to 1.0 is the better value)

12

13 Practice Copy US-states2.xls from g:\4210\data\ to your own folder. (you may need to create a folder under hw and have this file copied to) Compute 5 classes intervals of “Pop90_sqmi” for the following methods –Geometric Progression –Quantile –Equal Interval –GVF (use GIS’s range to compute GVF)

14 Practice - ArcMap Open a new project and add states.shp from c:\esri\esridata\usa to the layer Plot the US map based on Pop90_sqmi using different methods. Compute GVFs for the four methods.