Spatial Association Defining the relationship between two variables.

Slides:



Advertisements
Similar presentations
Prepared by Lloyd R. Jaisingh
Advertisements

Contingency Table Analysis Mary Whiteside, Ph.D..
Chapter 18: The Chi-Square Statistic
Lesson 10: Linear Regression and Correlation
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
Hypothesis Testing Steps in Hypothesis Testing:
COURSE: JUST 3900 INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE Instructor: Dr. John J. Kerbs, Associate Professor Joint Ph.D. in Social Work and Sociology.
Hypothesis: It is an assumption of population parameter ( mean, proportion, variance) There are two types of hypothesis : 1) Simple hypothesis :A statistical.
Chi Square Tests Chapter 17.
Chapter 13: The Chi-Square Test
Chapter 11 Contingency Table Analysis. Nonparametric Systems Another method of examining the relationship between independent (X) and dependant (Y) variables.
Correlation and Autocorrelation
QUANTITATIVE DATA ANALYSIS
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 12 Chicago School of Professional Psychology.
Statistical Analysis SC504/HS927 Spring Term 2008 Week 17 (25th January 2008): Analysing data.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Cross Tabulation and Chi-Square Testing. Cross-Tabulation While a frequency distribution describes one variable at a time, a cross-tabulation describes.
AM Recitation 2/10/11.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
LIS 570 Summarising and presenting data - Univariate analysis continued Bivariate analysis.
Selecting the Correct Statistical Test
Statistics for the Behavioral Sciences
1 Psych 5500/6500 Chi-Square (Part Two) Test for Association Fall, 2008.
EDRS 6208 Analysis and Interpretation of Data Non Parametric Tests
Chapter 26: Comparing Counts AP Statistics. Comparing Counts In this chapter, we will be performing hypothesis tests on categorical data In previous chapters,
Irkutsk State Medical University Department of Faculty Therapy Correlations Khamaeva A. A. Irkutsk, 2009.
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
Chi-Square as a Statistical Test Chi-square test: an inferential statistics technique designed to test for significant relationships between two variables.
Chi-square (χ 2 ) Fenster Chi-Square Chi-Square χ 2 Chi-Square χ 2 Tests of Statistical Significance for Nominal Level Data (Note: can also be used for.
Chapter 9: Non-parametric Tests n Parametric vs Non-parametric n Chi-Square –1 way –2 way.
Correlation and Regression Used when we are interested in the relationship between two variables. NOT the differences between means or medians of different.
1 Chi-Square Heibatollah Baghi, and Mastee Badii.
Tests for Random Numbers Dr. Akram Ibrahim Aly Lecture (9)
Correlation Patterns.
Chapter 20 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 These tests can be used when all of the data from a study has been measured on.
Chapter 16 The Chi-Square Statistic
Final review - statistics Spring 03 Also, see final review - research design.
© 2014 by Pearson Higher Education, Inc Upper Saddle River, New Jersey All Rights Reserved HLTH 300 Biostatistics for Public Health Practice, Raul.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
CHI SQUARE TESTS.
Inferential Statistics. The Logic of Inferential Statistics Makes inferences about a population from a sample Makes inferences about a population from.
Chapter 11: Chi-Square  Chi-Square as a Statistical Test  Statistical Independence  Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
N318b Winter 2002 Nursing Statistics Specific statistical tests Chi-square (  2 ) Lecture 7.
Copyright © 2010 Pearson Education, Inc. Warm Up- Good Morning! If all the values of a data set are the same, all of the following must equal zero except.
Statistics in Applied Science and Technology Chapter14. Nonparametric Methods.
BASIC STATISTICAL CONCEPTS Chapter Three. CHAPTER OBJECTIVES Scales of Measurement Measures of central tendency (mean, median, mode) Frequency distribution.
Week 13a Making Inferences, Part III t and chi-square tests.
Chapter 14 Chi-Square Tests.  Hypothesis testing procedures for nominal variables (whose values are categories)  Focus on the number of people in different.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chi Square & Correlation
Copyright © 2014 by Nelson Education Limited Chapter 11 Introduction to Bivariate Association and Measures of Association for Variables Measured.
Chapter 13. The Chi Square Test ( ) : is a nonparametric test of significance - used with nominal data -it makes no assumptions about the shape of the.
Correlation. u Definition u Formula Positive Correlation r =
Chapter 14 – 1 Chi-Square Chi-Square as a Statistical Test Statistical Independence Hypothesis Testing with Chi-Square The Assumptions Stating the Research.
Chi Square Tests Chapter 17. Assumptions for Parametrics >Normal distributions >DV is at least scale >Random selection Sometimes other stuff: homogeneity,
Ch 13: Chi-square tests Part 2: Nov 29, Chi-sq Test for Independence Deals with 2 nominal variables Create ‘contingency tables’ –Crosses the 2 variables.
Chapter 10 Section 5 Chi-squared Test for a Variance or Standard Deviation.
Chapter 11: Categorical Data n Chi-square goodness of fit test allows us to examine a single distribution of a categorical variable in a population. n.
Comparing Observed Distributions A test comparing the distribution of counts for two or more groups on the same categorical variable is called a chi-square.
Determining and Interpreting Associations between Variables Cross-Tabs Chi-Square Correlation.
Bivariate Association. Introduction This chapter is about measures of association This chapter is about measures of association These are designed to.
Cross Tabulation with Chi Square
Chapter 9: Non-parametric Tests
INTRODUCTORY STATISTICS FOR CRIMINAL JUSTICE
Slides to accompany Weathington, Cunningham & Pittenger (2010), Chapter 16: Research with Categorical Data.
The Chi-Square Distribution and Test for Independence
Hypothesis testing. Chi-square test
UNIT V CHISQUARE DISTRIBUTION
S.M.JOSHI COLLEGE, HADAPSAR
Chapter 18: The Chi-Square Statistic
Presentation transcript:

Spatial Association Defining the relationship between two variables.

Method Depends On Data Type The statistical/spatial analysis method is the function of measurement level and the spatial data model. NominalOrdinalInterval/ Ratio Nominal Chi-sq Median by Nominal Class Mean by nominal class K-S test Ordinal Rank Correlation Coefficient Mean of the ordinal class Interval/ Ratio Co-variance Cross correlation Correlation

Chi-Square Chi-Square can be used to compare: –Area A to Area B –Area A to Line –Area A to Point The hypothesis is always the same: –HO: The distribution of observations across Area A is equal to the Expected Distribution, where the Expected Distribution is usually CSR. Or does the distribution of Area A explains the distribution of the observations, assume that random observations would be distributed proportionally across Area A. –HA: Not equal to Expected Distribution, indicating a potential first order effect.

Chi-Square

Advantages: –Easy to compute and interpret –Non-parametric, distribution neutral –“Easy” to determine expected values – proportional to area –Can be applied to nominal (count) data. Disadvantages: –Results are influenced by the scale of the observations – use other indices –Ideal for points, more problematic with areas and lines. –Influenced by zone systems – arbitrary areas

Cramer V Statistic Cramer's coefficient is a measure of association that ranges from 0 to 1. A Cramer's coefficient of 0 indicates that the calculated chi-square is 0, i.e., the observed frequencies are all equal to the expected frequencies. This means that the there is perfect independence between the rows and columns and the column variable provides no information about the row variable. A Cramer's coefficient of 1 indicates that the calculated chi-square is the highest possible chi-square value [n(L-1)]; this indicates a perfect relationship between the rows and columns -- the column variable provides perfect information about the row variable. A V greater the 0.7 is strong, between is moderate, between is weak. V equals the square root of chi-square divided by sample size, n, times m, which is the smaller of (rows - 1) or (columns - 1): –V = SQRT(X 2 /nm).

Chi-Square Calculate the chi-square statistic and the Cramer's coefficient for the following data. Test for significance at the 0.05 level. The Table value for a Chi-square statistic with 4 degrees of freedom at the 0.05 level is is between right-tail probability of 0.7 and 0.5 Where V = [3.28 / 16 * 1] 1/2 Veg TypeVeg Area sq.km. Fraction of Area Observed Fire area sq km Expected Fire Area sq km Chi-sq A B C D E Total V = 0.453

Chi-Square Chi-square is sensitive to scale. The bigger the numbers the large chi-square. V will normalize for scale. Here, although the chi-square is very high the results may still be only moderately strong with a V of (note vegetation type a and b). Veg TypeVeg Area haFraction of Area Observed Fire area haExpected Fire Area haChi-sq A B C D E Total V= 0.453

Kolmogorov – Smirnov Test Compare Observed CFD to Expect CDF –HO: Observed EQ Expect –HA: Observed NE Expect, indication of a 1st order effect Expected can be any distribution – usually CSR. Advantages: –Ideal for comparing points to fields, more problematic with areas and lines. –Nonparametric – distribution neutral –Easy to compute and interpret Disadvantages: –How to compute the Expect CDF? –Use random number of point with the same sample size as the observed. If the sample is “small” you random points may not appear random. –Create a CDF for the population using all measurements or a large sample. In this case you are sampling the environment and you are asking are the sample points randomly located across the environment.

K-S Test Archeology sites vs. distance from wadis Random n = 250 Sites n = 84 P=0.01; Dmax = 0.21 P=0.05; Dmax = 0.17 P=0.10; Dmax = 0.15 A Dmax = 0.12 indicates the distance distributions may be the same

Dmax = 0.98 – 0.86 = 0.12

Analysis of Environmental Justice Point in Polygon Analysis

Erie Chi-Squared V = 0.86V = 0.37

Interpreting Chi Square Zero indicates no relationship Large numbers indicate stronger relationship Or, a table of significance can be consulted to determine if the specific value is statistically significant The fact that we have shown that there is a correlation between variables does NOT mean that we have found out anything about WHY this is so. In our analysis we might state our assumptions as to why this is so, but we would need to perform other analyses to show causation.

Spatial Correspondence of Areal Distributions Quadrat and nearest-neighbor analysis deal with a single distribution of points Often, we want to measure the distribution of two or more variables The coefficient of Areal correspondence and chi-square statistics perform these tasks

Coefficient of Areal Correspondence Simple measure of the extent to which two distributions correspond to one another –Compare wheat farming to areas of minimal rainfall Based on the approach of overlay analysis

Overlay Analysis Two distributions of interest are mapped at the same scale and the outline of one is overlaid with the other

Coefficient of Areal Correspondence CAC is the ratio between the area of the region where the two distributions overlap and the total area of the regions covered by the individual distributions of the entire region

Result of CAC Where there is no correspondence, CAC is equal to 0 Where there is total correspondence, CAC is equal to 1 CAC provides a simple measure of the extent of spatial association between two distributions, but it cannot provide any information about the statistical significance of the relationship

Resemblance Matrix Proposed by Court (1970) Advantages over CAC –Limits are –1 to +1 with a perfect negative correspondence given a value of –1 –Sampling distribution is roughly normal, so you can test for statistical significance