Why Is It There? Getting Started with Geographic Information Systems Chapter 6.

Slides:



Advertisements
Similar presentations
Richard M. Jacobs, OSA, Ph.D.
Advertisements

Kin 304 Regression Linear Regression Least Sum of Squares
Appendix A. Descriptive Statistics Statistics used to organize and summarize data in a meaningful way.
Objectives (BPS chapter 24)
QUANTITATIVE DATA ANALYSIS
Calculating & Reporting Healthcare Statistics
Why Is It There? Lecture 6 Introduction to Geographic Information Systems Geography 176A 2006 Summer, Session B Department of Geography University of California,
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
Lect 10b1 Histogram – (Frequency distribution) Used for continuous measures Statistical Analysis of Data ______________ statistics – summarize data.
1 Basic statistics Week 10 Lecture 1. Thursday, May 20, 2004 ISYS3015 Analytic methods for IS professionals School of IT, University of Sydney 2 Meanings.
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Introduction to GIS: Lecture #7 (GIS Analysis) GIS Analysis Describing Attributes Statistical Analysis Spatial Description Spatial Analysis Searching for.
Business Statistics - QBM117 Statistical inference for regression.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
1 Chapter 4: Variability. 2 Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure.
Measures of Variability: Range, Variance, and Standard Deviation
Chapter 4 SUMMARIZING SCORES WITH MEASURES OF VARIABILITY.
Relationships Among Variables
The Data Analysis Plan. The Overall Data Analysis Plan Purpose: To tell a story. To construct a coherent narrative that explains findings, argues against.
Math 116 Chapter 12.
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Describing distributions with numbers
Fall 2013 Lecture 5: Chapter 5 Statistical Analysis of Data …yes the “S” word.
PSYCHOLOGY: Themes and Variations Weiten and McCann Appendix B : Statistical Methods Copyright © 2007 by Nelson, a division of Thomson Canada Limited.
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.
Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability usually accompanies.
Describing Behavior Chapter 4. Data Analysis Two basic types  Descriptive Summarizes and describes the nature and properties of the data  Inferential.
METHODS IN BEHAVIORAL RESEARCH NINTH EDITION PAUL C. COZBY Copyright © 2007 The McGraw-Hill Companies, Inc.
Chapter 8 Quantitative Data Analysis. Meaningful Information Quantitative Analysis Quantitative analysis Quantitative analysis is a scientific approach.
Lecture 5: Chapter 5: Part I: pg Statistical Analysis of Data …yes the “S” word.
TYPES OF STATISTICAL METHODS USED IN PSYCHOLOGY Statistics.
Statistical analysis Outline that error bars are a graphical representation of the variability of data. The knowledge that any individual measurement.
PCB 3043L - General Ecology Data Analysis. OUTLINE Organizing an ecological study Basic sampling terminology Statistical analysis of data –Why use statistics?
Statistics - methodology for collecting, analyzing, interpreting and drawing conclusions from collected data Anastasia Kadina GM presentation 6/15/2015.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
 Two basic types Descriptive  Describes the nature and properties of the data  Helps to organize and summarize information Inferential  Used in testing.
Chapter 4: Variability. Variability Provides a quantitative measure of the degree to which scores in a distribution are spread out or clustered together.
Descriptive & Inferential Statistics Adopted from ;Merryellen Towey Schulz, Ph.D. College of Saint Mary EDU 496.
Why is it there? (How can a GIS analyze data?) Getting Started, Chapter 6 Paula Messina.
Three Broad Purposes of Quantitative Research 1. Description 2. Theory Testing 3. Theory Generation.
Statistics in IB Biology Error bars, standard deviation, t-test and more.
Chapter 6: Analyzing and Interpreting Quantitative Data
PCB 3043L - General Ecology Data Analysis.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Simple Linear Regression Analysis Chapter 13.
Chapter 4: Variability. Variability The goal for variability is to obtain a measure of how spread out the scores are in a distribution. A measure of variability.
1 Chapter 10: Describing the Data Science is facts; just as houses are made of stones, so is science made of facts; but a pile of stones is not a house.
Educational Research: Data analysis and interpretation – 1 Descriptive statistics EDU 8603 Educational Research Richard M. Jacobs, OSA, Ph.D.
Jump to first page Inferring Sample Findings to the Population and Testing for Differences.
(Unit 6) Formulas and Definitions:. Association. A connection between data values.
Why Is It There? Chapter 6. Review: Dueker’s (1979) Definition “a geographic information system is a special case of information systems where the database.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Chapter 11 Summarizing & Reporting Descriptive Data.
© 2006 by The McGraw-Hill Companies, Inc. All rights reserved. 1 Chapter 10 Descriptive Statistics Numbers –One tool for collecting data about communication.
Descriptive Statistics ( )
Statistical analysis.
Statistical analysis.
PCB 3043L - General Ecology Data Analysis.
Analyzing and Interpreting Quantitative Data
CHAPTER 12 More About Regression
Research Statistics Objective: Students will acquire knowledge related to research Statistics in order to identify how they are used to develop research.
Correlation and Regression
Introduction to Statistics
Chapter 10 Correlation and Regression
CHAPTER 12 More About Regression
15.1 The Role of Statistics in the Research Process
MBA 510 Lecture 2 Spring 2013 Dr. Tonya Balan 4/20/2019.
CHAPTER 12 More About Regression
Chapter Nine: Using Statistics to Answer Questions
Descriptive Statistics
Presentation transcript:

Why Is It There? Getting Started with Geographic Information Systems Chapter 6

6 Why Is It There? l 6.1 Describing Attributes l 6.2 Statistical Analysis l 6.3 Spatial Description l 6.4 Spatial Analysis l 6.5 Searching for Spatial Relationships l 6.6 GIS and Spatial Analysis

Duecker (1979) " A geographic information system is a special case of information systems where the database consists of observations on spatially distributed features, activities or events, which are definable in space as points, lines, or areas. A geographic information system manipulates data about these points, lines, and areas to retrieve data for ad hoc queries and analyses ".

GIS is capable of data analysis l Attribute Data –Describe with statistics –Analyze with hypothesis testing l Spatial Data –Describe with maps –Analyze with spatial analysis

Describing one attribute

Attribute Description l The extremes of an attribute are the highest and lowest values, and the range is the difference between them in the units of the attribute. l A histogram is a two-dimensional plot of attribute values grouped by magnitude and the frequency of records in that group, shown as a variable-length bar. l For a large number of records with random errors in their measurement, the histogram resembles a bell curve and is symmetrical about the mean.

If the records are: l Text –Length of text –word frequency –address matching l Example: Display all places called “State Street”

If the records are: l Classes –histogram by class –numbers in class –contiguity description

Describing a classed raster grid P (blue) = 19/48

If the records are: l Numbers –statistical description –min, max, range –variance and standard deviation

Statistical description l Range (min, max, max-min) l Central tendency (mode, median, mean) l Variation (variance, standard deviation)

Elevation (book example)

Mean l Statistical average l Sum of the values for one attribute divided by the number of records X i i1= n  = X

Computing the Mean l Sum of attribute values across all records, divided by the number of records. l A representative value, and for measurements with normally distributed error, converges on the true reading. l A value lacking sufficient data for computation is called a missing value.

Variance l The total variance is the sum of each record with its mean subtracted and then multiplied by itself. l The standard deviation is the square root of the variance divided by the number of records less one.

l Average difference from the mean l Sum of the mean subtracted from the value for each record, squared, divided by the number of records- 1, square rooted. st.dev. = (X - X ) 2 i  n - 1 Standard Deviation

GPS Example Data: Elevation Standard deviation l Same units as the values of the records, in this case meters. l The average amount by which the readings differ from the average l Can be above or below the mean l Elevation is the mean (459.2 meters), plus or minus the expected error of meters l Elevation is most likely to lie between meters and meters. l These limits are called the error band or margin of error.

Hypothesis testing l Establish NULL hypothesis (e.g. Values or Means are the same) l Establish ALTERNATIVE hypothesis, based on some expectation. l Test hypothesis. Try to reject NULL. l If null hypothesis is rejected, there is some support for the alternative (theory-based) hypothesis.

Uses of the standard deviation l Shorthand description : given the mean and s.d., we know where 67% of a random distribution lies. l A standardized measure : –a score of 80% can be good or bad, depending on the mean and s.d.

Testing the Mean l A test of means can establish whether two samples from a population are different from each other, or whether the different measures they have are the result of random variation.

Samples and populations l A sample is a set of measurements taken from a larger group or population. l Sample means and variances can serve as estimates for their populations.

Spatial analysis with GIS l GIS data description answers the question: Where? l GIS data analysis answers the question: Why is it there? l GIS data description is different from statistics because the results can be placed onto a map for visual analysis.

Spatial Statistical Description l For coordinates, the means and standard deviations correspond to the mean center and the standard distance l A centroid is any point chosen to represent a higher dimension geographic feature, of which the mean center is only one choice. l The standard distance for a set of point spatial measurements is the expected spatial error.

Spatial Statistical Description l For coordinates, data extremes define the two corners of a bounding rectangle.

Geographic extremes l Southernmost point in the continental United States. l Range: e.g. elevation difference; map extent

Mean Center mean y mean x

Centroid: mean center of a feature

GIS and Spatial Analysis l Descriptions of geographic properties such as shape, pattern, and distribution are often verbal l Quantitative measure can be devised, although few are computed by GIS. l GIS statistical computations are most often done using retrieval options such as buffer and spread. l Also by manipulating attributes with arithmetic commands (map algebra).

An example l Lower 48 United States l 1994 Data from the U.S. Census on gender l Gender Ratio = # females per 100 males l Range is l What does the spatial distribution look like?

Gender Ratio by State: 1994

Searching for Spatial Pattern l A linear relationship is a predictable straight-line link between the values of a dependent and an independent variable. It is a simple model of the relationship. l A linear relation can be tested for goodness of fit with least squares methods. The coefficient of determination r-squared is a measure of the degree of fit, and the amount of variance explained.

Simple linear relationship dependent variable independent variable observation best fit regression line y = a + bx intercept gradient y=a+bx

Testing the relationship gr = long.

Patterns in Residual Mapping l Differences between observed values of the dependent variable and those predicted by a model are called residuals. l A GIS allows residuals to be mapped and examined for spatial patterns. l A model helps explanation and prediction after the GIS analysis. l A model should be simple, should explain what it represents, and should be examined in the limits before use.

Mapping residuals from a model

Unexplained variance l More variables? l Different extent? l More records? l More spatial dimensions? l More complexity? l Another model? l Another approach?

GIS and Spatial Analysis l Many GIS systems have to be coaxed to generate a full set of spatial statistics.

Analytic Tools and GIS l Tools for searching out spatial relationships and for modeling are only lately being integrated into GIS. l Statistical and spatial analytical tools are also only now being integrated into GIS, and many people use separate software systems outside the GIS: “loosely coupled” analyses.

Analytic Tools and GIS l Real geographic phenomena are dynamic, but GISs have been mostly static. Time-slice and animation methods can help in visualizing and analyzing spatial trends. l GIS organizes real-world data to allow numerical description and allows the analyst to model, analyze, and predict with both the map and the attribute data.

You can lie with... l Maps l Statistics l Correlation is not causation!