AN EXPLORATION OF SCHOOL QUALITY, HOUSE PRICES AND GEOGRAPHIC LOCATION IN WELLINGTON, NEW ZEALAND Sarah Crilly Higher Diploma in Data Science and Analytics.

Slides:



Advertisements
Similar presentations
Data: Quantitative (Histogram, Stem & Leaf, Boxplots) versus Categorical (Bar or Pie Chart) Boxplots: 5 Number Summary, IQR, Outliers???, Comparisons.
Advertisements

On the application of GP for software engineering predictive modeling: A systematic review Expert systems with Applications, Vol. 38 no. 9, 2011 Wasif.
Bi-Variate Data PPDAC. Types of data We are looking for a set of data that is affected by the other data sets in our spreadsheet. This variable is called.
Forecasting Using the Simple Linear Regression Model and Correlation
Children’s subjective well-being Findings from national surveys in England International Society for Child Indicators Conference, 27 th July 2011.
1 Simple Linear Regression and Correlation The Model Estimating the Coefficients EXAMPLE 1: USED CAR SALES Assessing the model –T-tests –R-square.
Cross-national Variations in Educational Achievement and Child Well-being Dominic Richardson International Society for Child Indicators Inaugural Conference.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
*Wahida Kihal 1, Cindy Padilla 1,2, Benoit Lalloué 1,2,3, Marcello Gelormini1, Denis Zmirou-Navier 1,2,3, Séverine Deguen 1,2 1 EHESP School of Public.
Collinearity. Symptoms of collinearity Collinearity between independent variables – High r 2 High vif of variables in model Variables significant in simple.
Spatial Characteristics of Serial Sexual Assault in New Zealand Dr Samantha Lundrigan Victoria University of Wellington.
1 BA 275 Quantitative Business Methods Simple Linear Regression Introduction Case Study: Housing Prices Agenda.
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
Correlation and Regression. Correlation What type of relationship exists between the two variables and is the correlation significant? x y Cigarettes.
Why Geography is important.
Slide Copyright © 2010 Pearson Education, Inc. Active Learning Lecture Slides For use with Classroom Response Systems Business Statistics First Edition.
Multiple Regression. Want to find the best linear relationship between a dependent variable, Y, (Price), and 3 independent variables X 1 (Sq. Feet), X.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Linear Regression and Linear Prediction Predicting the score on one variable.
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
IS415 Geospatial Analytics for Business Intelligence
Spreadsheet Modeling & Decision Analysis A Practical Introduction to Management Science 5 th edition Cliff T. Ragsdale.
Title: Spatial Data Mining in Geo-Business. Overview  Twisting the Perspective of Map Surfaces — describes the character of spatial distributions through.
This Week: Testing relationships between two metric variables: Correlation Testing relationships between two nominal variables: Chi-Squared.
1 DSCI 3023 Linear Regression Outline Linear Regression Analysis –Linear trend line –Regression analysis Least squares method –Model Significance Correlation.
Chapter 14 Introduction to Multiple Regression Sections 1, 2, 3, 4, 6.
Internal migration flows in Northern Ireland: exploring patterns and motivations in a divided society Gemma Catney PhD Research Student Centre for Spatial.
Chapter 6 & 7 Linear Regression & Correlation
1 Representations of the Childhood Overweight Problem in Los Angeles County June 24, 2007 County of Los Angeles Public Health Department Nutrition Program.
HOW TO WRITE RESEARCH PROPOSAL BY DR. NIK MAHERAN NIK MUHAMMAD.
Regression Models Residuals and Diagnosing the Quality of a Model.
Statistical Reasoning for everyday life Intro to Probability and Statistics Mr. Spering – Room 113.
Scatterplots & Regression Week 3 Lecture MG461 Dr. Meredith Rolfe.
Chapter 11 Correlation and Simple Linear Regression Statistics for Business (Econ) 1.
Developments in Economics Education Conference MBA students and threshold concepts in Economics Dr Keith Gray, Peri Yavash & Dr Mark Bailey* Coventry University.
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
Introduction to Biostatistics and Bioinformatics Regression and Correlation.
CHAPTER 5 CORRELATION & LINEAR REGRESSION. GOAL : Understand and interpret the terms dependent variable and independent variable. Draw a scatter diagram.
Chapter 2 Examining Relationships.  Response variable measures outcome of a study (dependent variable)  Explanatory variable explains or influences.
Dummy Variables; Multiple Regression July 21, 2008 Ivan Katchanovski, Ph.D. POL 242Y-Y.
CEM (NZ) Centre for Evaluation & Monitoring College of Education Dr John Boereboom Director Centre for Evaluation & Monitoring (CEM) University of Canterbury.
Lesson 14 - R Chapter 14 Review. Objectives Summarize the chapter Define the vocabulary used Complete all objectives Successfully answer any of the review.
D/RS 1013 Data Screening/Cleaning/ Preparation for Analyses.
Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc
Canonical Correlation. Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of.
Lecture 7: Bivariate Statistics. 2 Properties of Standard Deviation Variance is just the square of the S.D. If a constant is added to all scores, it has.
CEM (NZ) Centre for Evaluation & Monitoring College of Education Dr John Boereboom Director Centre for Evaluation & Monitoring (CEM) University of Canterbury.
Carbon Emission Reduction Strategy Analysis Using Geographic Information System Dr. John P Deevairakkam TenneT.
Introduction Many problems in Engineering, Management, Health Sciences and other Sciences involve exploring the relationships between two or more variables.
26134 Business Statistics Week 4 Tutorial Simple Linear Regression Key concepts in this tutorial are listed below 1. Detecting.
AP Review Exploring Data. Describing a Distribution Discuss center, shape, and spread in context. Center: Mean or Median Shape: Roughly Symmetrical, Right.
Chapter 12 REGRESSION DIAGNOSTICS AND CANONICAL CORRELATION.
Chapter 13 Simple Linear Regression
Faculty of Public Health, Universitas Indonesia
Regression Analysis.
Correlation & Regression
Lecture #26 Thursday, November 17, 2016 Textbook: 14.1 and 14.3
Principles and Worldwide Applications, 7th Edition
REGRESSION (R2).
A Statistical and GIS Approach to Analyzing a Museum’s Customer Base
Chapter 5 STATISTICS (PART 4).
Regression Analysis Simple Linear Regression
Correlation and regression
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Understanding Research Results: Description and Correlation
Residuals and Diagnosing the Quality of a Model
Bivariate Linear Regression July 14, 2008
BEC 30325: MANAGERIAL ECONOMICS
BEC 30325: MANAGERIAL ECONOMICS
Correlation & Regression
Presentation transcript:

AN EXPLORATION OF SCHOOL QUALITY, HOUSE PRICES AND GEOGRAPHIC LOCATION IN WELLINGTON, NEW ZEALAND Sarah Crilly Higher Diploma in Data Science and Analytics Supervisor: Aengus Daly In association with Dr Mairead de Roiste of Victoria University of Wellington and Dr Toby Daglish of New Zealand Institute for the Study of Competition and Regulation (ISCR)

PROJECT BACKGROUND In conjunction with the University of Wellington, New Zealand and New Zealand Institute for the Study of Competition and Regulation (ISCR) Wider project explores the interrelated decisions of where you live, how many cars you own and how you commute to work A model called Wellington-Spatial Econometric Transport (W- SET)used to analyse these choices House prices effect residential location decisions The quality of available schools plays an important role in this pricing

PROJECT QUESTIONS Can the school data available measure school quality? What are the best school quality measures? Is there a relationship between school quality and house prices?

INTERNATIONAL PERSPECTIVE ON SCHOOL QUALITY Internationally, the key school quality measures were found to be: Word of mouth- “good school” Test or Assessment Scores Ethnicity of Students Value-added Education Outcomes Expenditure per Pupil Student/Teacher Ratio All, bar the first, measure were available for this analysis

METHODOLOGY AND ANALYSIS Statistical, GIS and Machine Learning Techniques were used to explore School Quality and House Prices. Techniques included: Multiple Linear Regression K-Means Clustering GIS Nearest Variable Algorithm GIS Residuals Mapping Moran’s I Spatial Autocorrelation Thiessen Polygons

SCHOOL DECILE School decile is a measure specific to NZ It measures the socio- economic status of the students at each school The socioeconomic variables used are linked to educational achievement Schools in Wellington are high decile with a mean of 6.4

K MEANS CLUSTERING OF PRIMARY AND INTERMEDIATE SCHOOLS

MULTIPLE LINEAR REGRESSION OF SECONDARY SCHOOL MEASURES High, statistically significant correlation between Decile, NCEA Level 3 results and Ethnicity R square of But! Small sample High collinearity of some variables Durbin Watson statistic of 1.534

HOUSE PRICES IN WELLINGTON House prices are measured using the Median Current Value (CV) of the meshblock Prices range from $130k to $2.8m with a mean of $423k

MULTIPLE LINEAR REGRESSION OF HOUSE PRICES House prices as independent variable School Decile and % of Māori and Pasifika students at three closest schools as dependent variables Findings R square of 0.304, Durbin Watson of High collinearity seen with decile and proportion of Māori and Pasifika students at the closest school, exclusion of these variables is problematic Large amount of variation unaccounted for by the model

MAPPING OF HOUSE PRICE REGRESSION RESIDUALS Residuals from regression mapped to check if spatial autocorrelation exists Moran’s I test was run against the data A z-score of 235 and p-value of > were found Strong clustering and spatial autocorrelation

CONCLUSIONS School Decile, Assessment Scores and proportion of Māori and Pasifika students are likely school quality measures Analysis is not conclusive as it is assumed that decile is a school quality measure Weak but statistically significant association between school quality measures and house prices Spatial autocorrelation clearly demonstrates that additional factors with a geographic component are present

RECOMMENDATIONS Further investigation of school quality measures Add time component by calculating Value Added Outcomes Refine school availability component factoring in school types and distance travelled

Thanks!