Some Application of Statistical Methods in Data Analysis Assoc. Prof. Dr. Abdul Hamid b. Hj. Mar Iman, Former Director, Centre for Real Estate Studies,

Slides:



Advertisements
Similar presentations
Prepared by Lloyd R. Jaisingh
Advertisements

Simple Linear Regression and Correlation by Asst. Prof. Dr. Min Aung.
CHAPTER TWELVE ANALYSING DATA I: QUANTITATIVE DATA ANALYSIS.
1 Chapter 20: Statistical Tests for Ordinal Data.
Kruskal Wallis and the Friedman Test.
Parametric & Nonparametric Models for Tests of Association Models we will consider X 2 Tests for qualitative variables Parametric tests Pearson’s correlation.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Basic Statistics The Chi Square Test of Independence.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Learning Objectives 1 Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Correlation and Regression
Ordinal Data. Ordinal Tests Non-parametric tests Non-parametric tests No assumptions about the shape of the distribution No assumptions about the shape.
Comparing k Populations Means – One way Analysis of Variance (ANOVA)
Principles of Statistics Assoc. Prof. Dr. Abdul Hamid b. Hj. Mar Iman Former Director, Centre for Real Estate Studies Faculty of Geoinformation Science.
Linear Regression and Correlation
The Simple Regression Model
Final Review Session.
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Chapter 19 Data Analysis Overview
Lecture 12 One-way Analysis of Variance (Chapter 15.2)
Educational Research by John W. Creswell. Copyright © 2002 by Pearson Education. All rights reserved. Slide 1 Chapter 8 Analyzing and Interpreting Quantitative.
Chapter 14 Introduction to Linear Regression and Correlation Analysis
Summary of Quantitative Analysis Neuman and Robson Ch. 11
Statistical Analysis KSE966/986 Seminar Uichin Lee Oct. 19, 2012.
Inferential Statistics
Leedy and Ormrod Ch. 11 Gray Ch. 14
Hypothesis Testing Charity I. Mulig. Variable A variable is any property or quantity that can take on different values. Variables may take on discrete.
Copyright © Cengage Learning. All rights reserved. 13 Linear Correlation and Regression Analysis.
Correlation.
Anthony Greene1 Correlation The Association Between Variables.
Statistical Significance R.Raveendran. Heart rate (bpm) Mean ± SEM n In men ± In women ± The difference between means.
Analysis of Variance ( ANOVA )
EQT 272 PROBABILITY AND STATISTICS
RESEARCH MANAGEMENT DATA ANALYSIS Assoc. Prof. Dr. Abdul Hamid b. Hj. Mar Iman, Centre for Real Estate Studies, Faculty of Geoinformation Engineering &
Analysis of Variance ST 511 Introduction n Analysis of variance compares two or more populations of quantitative data. n Specifically, we are interested.
Statistics 11 Correlations Definitions: A correlation is measure of association between two quantitative variables with respect to a single individual.
A Repertoire of Hypothesis Tests  z-test – for use with normal distributions and large samples.  t-test – for use with small samples and when the pop.
Analyzing Data: Comparing Means Chapter 8. Are there differences? One of the fundament questions of survey research is if there is a difference among.
Common Nonparametric Statistical Techniques in Behavioral Sciences Chi Zhang, Ph.D. University of Miami June, 2005.
Chap 12-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 12 Introduction to Linear.
SUMMARY FOR EQT271 Semester /2015 Maz Jamilah Masnan, Inst. of Engineering Mathematics, Univ. Malaysia Perlis.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 10-1 Chapter 10 Two-Sample Tests and One-Way ANOVA Business Statistics, A First.
Stats 2022n Non-Parametric Approaches to Data Chp 15.5 & Appendix E.
Hypothesis testing Intermediate Food Security Analysis Training Rome, July 2010.
Recap of data analysis and procedures Food Security Indicators Training Bangkok January 2009.
C M Clarke-Hill1 Analysing Quantitative Data Forming the Hypothesis Inferential Methods - an overview Research Methods.
STATISTICAL ANALYSIS FOR THE MATHEMATICALLY-CHALLENGED Associate Professor Phua Kai Lit School of Medicine & Health Sciences Monash University (Sunway.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
Chapter 13 CHI-SQUARE AND NONPARAMETRIC PROCEDURES.
Chapter Seventeen. Figure 17.1 Relationship of Hypothesis Testing Related to Differences to the Previous Chapter and the Marketing Research Process Focus.
Hypothesis Testing. Why do we need it? – simply, we are looking for something – a statistical measure - that will allow us to conclude there is truly.
Angela Hebel Department of Natural Sciences
Chap 18-1 Copyright ©2012 Pearson Education, Inc. publishing as Prentice Hall Chap 18-1 Chapter 18 A Roadmap for Analyzing Data Basic Business Statistics.
NON-PARAMETRIC STATISTICS
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chapter 13 Understanding research results: statistical inference.
SUMMARY EQT 271 MADAM SITI AISYAH ZAKARIA SEMESTER /2015.
Copyright © 2008 by Nelson, a division of Thomson Canada Limited Chapter 18 Part 5 Analysis and Interpretation of Data DIFFERENCES BETWEEN GROUPS AND RELATIONSHIPS.
Interpretation of Common Statistical Tests Mary Burke, PhD, RN, CNE.
Non-parametric Tests Research II MSW PT Class 8. Key Terms Power of a test refers to the probability of rejecting a false null hypothesis (or detect a.
Determining and Interpreting Associations between Variables Cross-Tabs Chi-Square Correlation.
Bivariate analysis. * Bivariate analysis studies the relation between 2 variables while assuming that other factors (other associated variables) would.
Introduction to Marketing Research
Basic Statistics The Chi Square Test of Independence.
Correlation – Regression
CHOOSING A STATISTICAL TEST
قياس المتغيرات في المنهج الكمي
Non – Parametric Test Dr. Anshul Singh Thapa.
NONPARAMETRIC METHODS
15.1 The Role of Statistics in the Research Process
Presentation transcript:

Some Application of Statistical Methods in Data Analysis Assoc. Prof. Dr. Abdul Hamid b. Hj. Mar Iman, Former Director, Centre for Real Estate Studies, Universiti Teknologi Malaysia.

Forms of “statistical” relationship Correlation Contingency Cause-and-effect * Causal * Feedback * Multi-directional * Recursive The last two categories are normally dealt with through regression

Statistical Data Analysis Methods – A Summary Scale of measurement One-sampleTwo independent Sample K independent Sample Measures of Association Independent Sample Single treatment repeat Measures Multiple treatment repeat Measures NominalBinomial test; one-way contingency Table McNemar test Cochrane Q Test Two-way contingency Table Contingency Table Contingency Coefficients OrdinalRuns testWilcoxon signed rank test Friedman test Mann- Whitney Test Kruskal- Wallis Test Spearman rank Correlation Interval/ratioZ- or t-test of variance Paired t-testRepeat measures ANOVA Unpaired t-test; tests of variance ANOVARegression, Pearson correlation, time series

One-Sample Test McNemar Test: tests for change in a sample upon a “treatment”. Example. Two condominium projects K&L. Respondents decide their preferences for K or L before and after “advertising”. Hypothesis: Advertising does not influence buyers to change their mind on product choice BeforeAfter Project LProject K A = 40B = 60 Project LC = 30D = people switch from K to L and 50 people switch from L to K before and after

One-Sample Test (contd.) Test statistics: r c Q =  (0 ij – E ij ) 2 /E ij i=1 j=1 where E = (A+D)/2 Therefore, r c Q =  (0 ij – E ij ) 2 /E ij i=1 j=1 [A-(A+D)/2] 2 [D-(A+D)/2] 2 (A-D) = (A+D)/2 (A+D)/2 A+D Thus, Q = (40-45) 2 /(40+45) = 25/85 = 0.29  (2-1)(2-1); 0.05 = 3.84 H o not rejected. No influence of advertising on choice of project

One-Sample Test (contd.) Friedman Test: tests for equal preferences for something of various characteristics. Example. Buyers’ rank of preference for three condominium types A, B, C. Hypothesis: Buyers’ preferences for all condo type do not differ Resp.Type A Type B Type C Man231 Min123 Lee132 Ling312 Dass123 Total811

One-Sample Test (contd.) Test statistics: (n-1)k k F r =  R j 2 – 3n(k+1) nk(k+1) j=1 where n = sample size, k = number of categories; R = is column’s total For large n and k, F r follows X 2 (k-1);α

One-Sample Test (contd.) (5-1)3 F = [ ] – 3x5(3+1) 5x3(3+1) = 1.2 X 2 (3-1); 0.05 = 5.99 H o not rejected. Buyers do not show different preference for condo type

One-Sample Test (contd.) Repeated measures ANOVA: tests outcome of a phenomenon under different conditions. Example. Waiting time at junctions in the city area to determine level of congestion at different times of the day. Test statistics: t/(m-1) F = r/[(n-1)(m-1) where t = sum of squares due to treatment, r =sum of squares of residual, m = number of treatment, n = number of observations. Critical region based on: F v1. v2; α where v1 = (m-1), v2 = (n-1)(m-1)

One-Sample Test (contd.) Waiting time at junction (min.)Row meanSum Sq. about row mean(Wi) MorningNoonEvening Junction Junction Junction Junction Junction Column mean M = 6.00W = 23.34

One-sample test (contd.) m n T =   (c ij – M) 2 i=1 j=1 = 30 W i =  (c ij – ) 2 = B = m  ( - M) 2 = 6.65 t = n  ( - M) 2 = 10 W = t + r r = W – t = – 10 = 13.34

One-Sample Test (contd.) 10/(3-1) F c = = /(5-1)(3-2) F t (3-1),(3-1)(5-1); 0.05 = 4.46 H o not rejected. Congestion is quite the same at all times during the day.

Two-Sample Test Two-way Contingency Table: test whether two independent groups differ on a given characteristic. Hypothesis: choice for type of house does not relate to location. Test: GroupTotal (R) Inner suburbs Outer suburbs Terraced Semi- detached Total (C) r c Q =  (0 ij – E ij ) 2 /E ij i=1 j=1

Two-Sample Test (contd.) D.o.f. = (r-1)(c-1), where r=number of rows, c=number of columns E ij = R i C j /N Inner suburbsOuter suburbs Terraced125 x 80/180 = x 100/180 = 69.4 Semi- detached 55 x 80/180 = x 100/180 = 30.6 Q = ( ) 2 / ( ) 2 / ( ) 2 / ( ) 2 /30.6 = 3.33  (2-1)(2-1); 0.05 = 3.84 H o not rejected

K Independent Test - Correlation “Co-exist”.E.g. * left shoe & right shoe, sleep & lying down, food & drink Indicate “some” co-existence relationship. E.g. * Linearly associated (-ve or +ve) * Co-dependent, independent But, nothing to do with C-A-E r/ship! Example: After a field survey, you have the following data on the distance to work and distance to the city of residents in J.B. area. Interpret the results? data Formula:

K Independent Test - Correlation and regression – matrix approach

Correlation and regression – matrix approach

Test yourselves! Q1: Calculate the min and std. variance of the following data: Q2: Calculate the mean price of the following low-cost houses, in various localities across the country: PRICE - RM ‘ SQ. M OF FLOOR PRICE - RM ‘000 (x) NO. OF LOCALITIES (f)

Test yourselves! Q3: From a sample information, a population of housing estate is believed have a “normal” distribution of X ~ (155, 45). What is the general adjustment to obtain a Standard Normal Distribution of this population? Q4: Consider the following ROI for two types of investment: A: 3.6, 4.6, 4.6, 5.2, 4.2, 6.5 B: 3.3, 3.4, 4.2, 5.5, 5.8, 6.8 Decide which investment you would choose.

Test yourselves! Q5: Find:  (AGE > “30-34”)  (AGE ≤ 20-24)  ( “35-39”≤ AGE < “50-54”)

Test yourselves! Q6: You are asked by a property marketing manager to ascertain whether or not distance to work and distance to the city are “equally” important factors influencing people’s choice of house location. You are given the following data for the purpose of testing: Explore the data as follows: Create histograms for both distances. Comment on the shape of the histograms. What is you conclusion? Construct scatter diagram of both distances. Comment on the output. Explore the data and give some analysis. Set a hypothesis that means of both distances are the same. Make your conclusion.

Perception about Influence of New Neighbourhood Degree of perception Locality Total BblautPatau1Patau2Racha2 Not worried at all Not so worried Worried Quite worried So Worried Total Q 7. You have surveyed a group of local people and asked them to express their feeling about a new project that will attract a new population and thus a new neighbourhood. You believe that the local people are concerned about the negative influence the new neighbourhood will have on them as a result of the proposed project. Using the collected data, test your hypohesis.

Test yourselves! (contd.) Q7: From your initial investigation, you belief that tenants of “low-quality” housing choose to rent particular flat units just to find shelters. In this context,these groups of people do not pay much attention to pertinent aspects of “quality life” such as accessibility, good surrounding, security, and physical facilities in the living areas. (a)Set your research design and data analysis procedure to address the research issue (b) Test your hypothesis that low-income tenants do not perceive “quality life” to be important in paying their house rentals.