Analyze Statistic by Using SPSS

Slides:



Advertisements
Similar presentations
Quntative Data Analysis SPSS Exploring Assumptions
Advertisements

What is Chi-Square? Used to examine differences in the distributions of nominal data A mathematical comparison between expected frequencies and observed.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Chapter 12 Simple Linear Regression
Inference for Regression
Chapter 12 Simple Linear Regression
Project #3 by Daiva Kuncaite Problem 31 (p. 190)
CJ 526 Statistical Analysis in Criminal Justice
Correlation. Two variables: Which test? X Y Contingency analysis t-test Logistic regression Correlation Regression.
Simple Linear Regression Analysis
Data Analysis Statistics. Inferential statistics.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
SW388R7 Data Analysis & Computers II Slide 1 Multiple Regression – Basic Relationships Purpose of multiple regression Different types of multiple regression.
Chi-Square Distributions
Statistics for the Social Sciences Psychology 340 Fall 2013 Thursday, November 21 Review for Exam #4.
Inferential Statistics: SPSS
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Selecting the Correct Statistical Test
CJ 526 Statistical Analysis in Criminal Justice
Analyze Statistic by Using SPSS 2 nd Day 1Fadwa Flemban.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Inferential Statistics 2 Maarten Buis January 11, 2006.
1 1 Slide Simple Linear Regression Coefficient of Determination Chapter 14 BA 303 – Spring 2011.
Inference for Regression Chapter 14. Linear Regression We can use least squares regression to estimate the linear relationship between two quantitative.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
Regression Chapter 16. Regression >Builds on Correlation >The difference is a question of prediction versus relation Regression predicts, correlation.
United Stats Of AMERICA. Unit 7 chapters Jordo, Rob III, Kins and Toph.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
Chi-Square Distributions. Recap Analyze data and test hypothesis Type of test depends on: Data available Question we need to answer What do we use to.
Nonparametric Tests: Chi Square   Lesson 16. Parametric vs. Nonparametric Tests n Parametric hypothesis test about population parameter (  or  2.
© Copyright McGraw-Hill Correlation and Regression CHAPTER 10.
1 Regression Analysis The contents in this chapter are from Chapters of the textbook. The cntry15.sav data will be used. The data collected 15 countries’
Environmental Modeling Basic Testing Methods - Statistics III.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Scatter Diagrams scatter plot scatter diagram A scatter plot is a graph that may be used to represent the relationship between two variables. Also referred.
Introducing Communication Research 2e © 2014 SAGE Publications Chapter Seven Generalizing From Research Results: Inferential Statistics.
Conduct Simple Correlations Section 7. Correlation –A Pearson correlation analyzes relationships between parametric, linear (interval or ratio which are.
Chapter 15 The Chi-Square Statistic: Tests for Goodness of Fit and Independence PowerPoint Lecture Slides Essentials of Statistics for the Behavioral.
Chapter 12 Simple Linear Regression n Simple Linear Regression Model n Least Squares Method n Coefficient of Determination n Model Assumptions n Testing.
1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y =  0 +  1 x +  n Simple Linear Regression Equation E( y ) =  0 + 
STATS 10x Revision CONTENT COVERED: CHAPTERS
Research Methodology Lecture No :26 (Hypothesis Testing – Relationship)
SUMMARY EQT 271 MADAM SITI AISYAH ZAKARIA SEMESTER /2015.
© The McGraw-Hill Companies, Inc., Chapter 10 Correlation and Regression.
Interpretation of Common Statistical Tests Mary Burke, PhD, RN, CNE.
Introduction. We want to see if there is any relationship between the results on exams and the amount of hours used for studies. Person ABCDEFGHIJ Hours/
Lecture 11: Simple Linear Regression
Chapter 14 Introduction to Multiple Regression
Regression and Correlation
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Inference for Regression (Chapter 14) A.P. Stats Review Topic #3
Dr. Siti Nor Binti Yaacob
Statistics for Managers using Microsoft Excel 3rd Edition
Statistics for Business and Economics (13e)
Chapter 14: Correlation and Regression
SPSS تطبيقات إحصائية بـاستخدام د. وليــد محمد عفيفي محمد
نموذج الانحدار الخطي البسيط Simple Linear Regression Model
Reasoning in Psychology Using Statistics
Topic 8 Correlation and Regression Analysis
Parametric versus Nonparametric (Chi-square)
Correlation A measure of the strength of the linear association between two numerical variables.
Reasoning in Psychology Using Statistics
CLASS 6 CLASS 7 Tutorial 2 (EXCEL version)
Introduction to Regression
St. Edward’s University
Correlation and Simple Linear Regression
Correlation and Simple Linear Regression
Presentation transcript:

Analyze Statistic by Using SPSS Statistics and useing SPSS Analyze Statistic by Using SPSS 3rd Day Fadwa Flemban

الاعجاز الرقمي في القرآن الكريم الرقم 7 له مدلول كبير في القرآن والكون والحياة ، فعدد أحرف الأبجدية العربية (لغة القرآن) هو 28 حرفاً (أي 7 × 4) ، والحديث الصحيح (أُنزل القرآن على سبعة أحرف) يؤكد أن الرقم 7 له علاقة بالقرآن ، وقد خلق اللّه تعالى سبع سماوات وسبع أراضين وجعل الجمعة سبعة أيام . أما عبادة الحج فتعتمد على الرقم 7 (سبعة أشواط في الطواف والسعي) وسبع جمرات . والذي لا يؤمن بكل هذا فجزاؤه نار جهنم التي خلق لها اللّه تعالى سبعة أبواب لكل بابٍ ملاحظة : كلمة جهنم تكررت في القرآن 77 مرة أي 7 × 11 . ولا ننسى أن أعظم سورة في القرآن هي الفاتحة التي سمَّاها اللّه [ السبع المثاني ] ، عدد آياتها 7 . كما أن عبارة السماوات السبع (وسبع سماوات) تكررت في القرآن 7 مرات بالضبط . كلمة [ سبعة ] تكررت في القرآن 4 مرات في الآيات التالية : 1 ـ { فَصِيَامُ ثَلَاثَةِ أَيَّامٍ فِي الْحَجِّ وَسَبْعَةٍ إِذَا رَجَعْتُمْ } [ البقرة : 196 [ 2 ـ { لَهَا سَبْعَةُ أَبْوَابٍ لِكُلِّ بَابٍ مِنْهُمْ جُزْءٌ مَقْسُومٌ } [ الحجر : 44 [ 3 ـ { وَيَقُولُونَ سَبْعَةٌ وَثَامِنُهُمْ كَلْبُهُمْ } [ الكهف : 22 [ 4 ـ { مِنْ بَعْدِهِ سَبْعَةُ أَبْحُرٍ مَا نَفِدَتْ كَلِمَاتُ اللَّهِ] { لقمان : 27 [ كلمة [ سبعة ] تكررت في القرآن 4 مرات لقمان الكهف الحج البقرة اسم السورة 27 22 44 196 رقم الآية 272244196 = 7 × 38892028 = 7 × 7 × 5556004 272244196 إذاً : العدد الذي يمثل الآيات الأربعة (التي وردت فيها كلمة [ سبعة]) يقبل القسمة على 7 مرتين متتاليتين ، فمن الذي نظَّم مواضع هذه الكلمة بهذا التناسب المذهل مع الرقم 7 ؟ أليس هو اللّه ؟ Fadwa Flemban

Chi-Squared Tests اختبارات مربع كاي (1) Goodness of fit tests (2) Independent tests (3) Homogeneity tests Fadwa Flemban

(1) Goodness of fit tests اختبار جودة التوفيق لمقارنة توزيع البيانات مع عدة توزيعات احتمالية وهي: 1- التوزيع الطبيعي Normal Dist. 2 - توزيع بواسون Poisson Dist. 3- التوزيع الأسي Exponential Dist. 4- التوزيع المنتظم Uniform Dist. Fadwa Flemban

(1) Goodness of fit tests اختبار جودة التوفيق Hypotheses of Test : Hₒ: The data are consistent with a S distribution. : Hₒ البيانات تتبع التوزيع س. H1: The data are not consistent with S distribution. H1: البيانات لا تتبع التوزيع س. Fadwa Flemban

Goodness of fit tests Example This data are representing the number of persons who ate the dinner in a small restaurant on 50 days: Is a variable of the persons' number who ate the dinner in the restaurant following the normal distribution at the level of significance (0.05)? 20 12 16 19 24 6 10 1 15 23 8 30 25 7 22 5 14 27 21 18 4 17 9 Fadwa Flemban

Solution Hₒ: The data are consistent with the normal distribution. H1: The data are not consistent with the normal distribution. Fadwa Flemban

Normality Test two way: By (1) Analyze  Descriptive Statistics Explore Plots  check in Normality plots with test Fadwa Flemban

Normality Test for (male) Fadwa Flemban

Normality Test for (female) Fadwa Flemban

Output بما أن : جميع النقاط تقع على وحول الخط المستقيم إذن : العينة تتبع التوزيع الطبيعي Fadwa Flemban

Normality Test two way: By (2) Analyze  Nonparametric test  1-sample kolmogorov-smirnov test Fadwa Flemban

Analyze  Nonparametric test  1-sample kolmogorov-smirnov test Fadwa Flemban

Fadwa Flemban

Output : P-value (0.898)>α(0.05) We don't reject Hₒ the persons' number who ate the dinner in the restaurant following the normal distribution at degree of confidence 95%. Fadwa Flemban

Make the same steps but : Choose Poison test distribution Fadwa Flemban

Output : P-value (0.047)<α(0.05) We reject Hₒ the persons' number who ate the dinner in the restaurant don’t following the Poisson distribution at degree of confidence 95%. Fadwa Flemban

(2) Independent tests اختبارات الاستقلال Hypotheses of Test : H0: The variables are independent. H1: The variables are not independen. : Hₒ المتغيران مستقــلان. H1: المتغيران غيرمستقــلان, أي توجد علاقة بينهما. Fadwa Flemban

Independent tests Example In a study of the relationship between the grade of student in the university and his gender: There is a relationship between the student’s grade & his gender? F B A D Female C F C B A Male D Fadwa Flemban

Solution Hₒ: The student’s grade & his gender are independent. H1: There is a relationship between the student’s grade & his gender. Fadwa Flemban

Analyze  Descriptive Statistics Crosstabs Fadwa Flemban

Crosstabs Window: Press Statistics button Fadwa Flemban

Chi-square to Independent Test Fadwa Flemban

The two variables are independent P-value = 0.656 P-value > 0.05 We don’t reject Hₒ The two variables are independent Fadwa Flemban

(3) Homogeneity tests اختبارات التجانس للإجابة عن السؤال: هل تكرارات المشاهدات موزعة بشكل متجانس (متماثل) بين فئات المجتمع. Hypotheses of Test : H0: Pi1= Pi2 =…………= Pis OR σ²1=σ²2=……= σ²i H1: at least one of the null hypothesis statements is false. Fadwa Flemban

Homogeneity tests Example for Clarification In a study of the television viewing habits of children, a developmental psychologist selects a random sample of 300 first graders - 100 boys and 200 girls. Do the boys' preferences for these TV programs differ significantly from the girls' preferences? Use a 0.05 level of significance. Rows total The Simpsons Sesame Street Lone Ranger 100 20 30 50 Boys 200 70 80 Girls 300 90 110 Column total Fadwa Flemban

Mathematical Solution H0: Pboys who prefer Lone Ranger = Pgirls who prefer Lone Ranger H0: Pboys who prefer Sesame Street = Pgirls who prefer Sesame Street H0: Pboys who prefer The Simpsons = Pgirls who prefer The Simpsons H1: At least one of the null hypothesis statements is false. DF = (r - 1) * (c - 1) = (2 - 1) * (3 - 1) = 2 Er,c = (nr * nc) / n E1,1 = (100 * 100) / 300 = 10000/300 = 33.3 E1,2 = (100 * 110) / 300 = 11000/300 = 36.7 E1,3 = (100 * 90) / 300 = 9000/300 = 30.0 E2,1 = (200 * 100) / 300 = 20000/300 = 66.7 E2,2 = (200 * 110) / 300 = 22000/300 = 73.3 E2,3 = (200 * 90) / 300 = 18000/300 = 60.0 Χ2 = Σ [ (Or,c - Er,c)2 / Er,c ] Χ2 = (50 - 33.3)2/33.3 + (30 - 36.7)2/36.7 + (20 - 30)2/30     + (50 - 66.7)2/66.7 + (80 - 73.3)2/73.3 + (70 - 60)2/60 Χ2 = (16.7)2/33.3 + (-6.7)2/36.7 + (-10.0)2/30 + (-17.7)2/66.7 + (3.3)2/73.3 + (10)2/60 Χ2 = 8.38 + 1.22 + 3.33 + 4.70 + 0.61 + 1.67 = 19.91 P(Χ2 > 19.91) = 0.0000 Since the P-value (0.0000) is less than the significance level (0.05), we cannot accept the null hypothesis. Fadwa Flemban

Homogeneity tests Example We have the following data: 1- Are two factories homogeneity ? 2-Test the hypothesis, the factories them the same calories (by million calories),Use a 0.05 level of significance? Calories 8400 8230 8380 7860 7930 Factory 1 7510 7690 7720 8070 7660 Factory 2 Fadwa Flemban

Solution Hₒ : σ²1 = σ²2 H1 : σ²1 ≠ σ²2 NOTE: we have two variables (scale & nominal). Hypotheses of Homogeneity test: Hₒ : σ²1 = σ²2 H1 : σ²1 ≠ σ²2 Fadwa Flemban

Analyze  Compare means  Independent Samples Fadwa Flemban

Define Groups Fadwa Flemban

The samples are Homogeneity Output : P-value = 0.330 P-value > α We don’t reject Hₒ The samples are Homogeneity Fadwa Flemban

we reject Hₒ, the means of two factories are not equal. Also: From t-test of equality of means: Hₒ : µ1=µ2 H1 : µ1≠µ2 Sig. = 0.018 , α = 0.05 Sig. < α we reject Hₒ, the means of two factories are not equal. Fadwa Flemban

Summary In Nominal Variables Normality Test Data from Normal Dist. T test Data not from Normal Dist. Non Parametric Tests Make Homogeneity Test Fadwa Flemban

Regression & Correlation الانحدار و الارتباط Fadwa Flemban

Regression الانحدار استخدام معادلة خط الإنحدار في التنبؤ المستقبلي. معادلة خط الإنحدار تستخدم للتنبؤ لقيم ”ضمن“ قيم المتغير المستقل. Fadwa Flemban

Simple linear Regression الانحدار الخطي البسيط Simple linear Regression يستخدم الانحدار الخطي لتقدير معامل المتغير المستقل للمعادلة الخطية بغرض تقدير المتغير التابع فى حالة وجود متغير مستقل واحد فإن معادلة الخط تأخذ الصورة: Y = a + b*X حيث تعبر X عن المتغير المستقل وتعبر Y عن المتغير التابع. Fadwa Flemban

Example Suppose that X symbolize to the temperature between (3:00 pm & 4:00 pm) through the summer season, Y symbolize to electricity consumption representative by levels from 1 to 10 where level 10 is higher consumption. And the data were recorded during a period of 10 days: X: 38 38 30 32 23 30 34 25 31 21 Y: 9.5 9 6 6 4.5 7 8 5 7 4 - Draw the scatter diagram for this data? 2-Estimate the linear regression equation between (X,Y) at a temperature ? 3- If X=35, then the level of electricity consumption =…… Fadwa Flemban

mathematical solution = 6.6 – (0.3073)(30.2) = -2.680

SPSS Solution 1- Graphs  Legacy Dialogs  Scatter/Dot  Fadwa Flemban

Simple Scatter  Define  Fadwa Flemban

Output : To add the regression line on the chart: Double click on the chart add fit line at total linear  close Fadwa Flemban

the straight line is best representation to this data. Output : the straight line is best representation to this data. The next step >> Fadwa Flemban

2- Analyze  Regression  Linear Fadwa Flemban

Correlation Coefficient the linear regression equation Output : Correlation Coefficient the linear regression equation Yi = -2.681 + 0.307 Xi a = b = Fadwa Flemban

التنبؤ باستخدام معادلة الانحدار: تقدير الاستهلاك من الطاقة الكهربائية عندما تكون درجة الحرارة 35 درجة مئوية معادلة خط الانحدار هي Yi = -2.681 + 0.307 Xi بما أن X = 35 إذن استهلاك الطاقة الكهربائية يقدّر بـ : Y = -2.681 + 0.307 (35) Y = 8.075 أ.فدوى فلمبان

Correlation الارتباط Can be used as another measure to determine strength of the relationship between and among phenomena, this measure is the correlation coefficient.   Fadwa Flemban

Correlation الارتباط ان واحدا من اهم اهداف اى بحث هى إيجاد علاقات بين المتغيرات وذلك هو هدف أساسي لعلم الاحصاء. ويجب قبل حساب معاملات الارتباط للبيانات الكمية مشاهدة البيانات من خلال شكل الانتشار Scatter diagram وذلك لملاحظة طبيعة العلاقة (خطية او غير ذلك) او لملاحظة وجود قيم شاذة outliers والتى قد يؤدى وجودها الى نتائج مضللة. تنحصر قيمة معامل الارتباط بين 1- و 1+. إذا كانت قيمة معامل الإرتباط مساوية 1+ عندها يكون الإرتباط طردي تام، وكذلك عندما تكون قيمة معامل الإرتباط مساوية 1 - عندها يكون الإرتباط عكسي تام. Fadwa Flemban

Scatter Diagram this scatter diagram means the coefficient of correlation ( r=0) : There is no relationship between the variables or there is relationship but not linear. this scatter diagram means the coefficient of correlation (r=-1 or r=+1) : Of all points on the regression line which is the relationship between the variables (x,y). this scatter diagram means the coefficient of correlation (0<r<+1 or -1<r<0): All points concentrated around and above the regression line. Fadwa Flemban

Values of the correlation coefficients Its mean +1 Perfect positive correlation -1 Perfect negative correlation 0.99 <r<0.90 Very strong positive correlation -0.90<r<-0.99 Very strong negative correlation 0.89<r<0.70 strong positive correlation -0.70<r<-0.89 strong negative correlation 0.69<r<0.50 Moderate positive correlation -0.50<r<-0.69 Moderate negative correlation 0.49<r<0.30 Weak positive correlation -0.30<r<-0.49 Weak negative correlation 0.29<r<0.01 Very weak positive correlation -0.01<r<-0.29 Very weak negative correlation r = 0 Zero correlation Fadwa Flemban

معاملات الارتباط تبعاً لقياس المتغيرات Fadwa Flemban

Two different correlation techniques are available: for quantitative variables 1- Pearson correlation coefficient for ordinal scales 2- Spearman correlation coefficient Fadwa Flemban

1- Pearson correlation coefficient for quantitative variables Fadwa Flemban

Example Find the correlation between the outside temperature (y) and the height by thousands of foot (x) for a plane in different times. Height (x) 0 4 4 10 6 Temperature (y) 27 21 18 10 16 Calculate the coefficient of correlation between the height & the temperature? Fadwa Flemban

mathematical solution No. x y x² y² xy 1 0 27 0 729 0 2 4 21 16 441 84 3 4 18 16 324 72 4 10 10 100 100 100 5 6 16 36 257 96 ∑ 24 92 168 1850 352 . =18.4; = 4.8 Sx=3.2496; Sy=5.6071  It means there is strong negative correlation between the height & the temperature Fadwa Flemban

SPSS Solution 1- Graphs  Legacy Dialogs  Scatter/Dot Simple Scatter Fadwa Flemban

Output : To add the regression line on the chart: Double click on the chart add fit line at total linear  close Fadwa Flemban

the straight line is best representation to this data. Output : the straight line is best representation to this data. The next step >> Fadwa Flemban

2- Analyze  Correlate  Bivariate Fadwa Flemban

Bivariate Correlations Windows: Fadwa Flemban

Output : From Output of correlation: r= -0.983 It means there is strong negative correlation between the height & the temperature. Fadwa Flemban

2- Spearman correlation coefficient for ordinal scales Fadwa Flemban

Example If we have the grade of 5 students in both articles : Statistics A C D F B Mathematics B C F D A Find the correlation between the students' grade in the statistics and the mathematics? Fadwa Flemban

mathematical solution d squared d Rank of Stat Rank of Math Stat Math 1 -1 2 A B 3 C 4 5 D F Total There is strong positive correlation between the students' grade in the statistics and the mathematics. Fadwa Flemban

Solution by SPSS By same steps in the previous example: Fadwa Flemban

Analyze  Correlate  Bivariate Fadwa Flemban

Output : From this table we find the same result: r=0.8, there is strong positive correlation. Fadwa Flemban

معامل بيرسون للإرتباط يعكس ”خطية العلاقة“. أخطاء شائعة استخدام معامل بيرسون للإرتباط لبيانات غير خطية لذلك يجب التأكد من ”خطية“ العلاقة بين الظاهرتين. معامل بيرسون للإرتباط يعكس ”خطية العلاقة“. Fadwa Flemban

Question ??? A national consumer magazine reported the following correlations. 1-The correlation between car weight and car reliability is -0.30. 2-The correlation between car weight and annual maintenance cost is 0.20. Which of the following statements are true? I. Heavier cars tend to be less reliable. II. Heavier cars tend to cost more to maintain. III. Car weight is related more strongly to reliability than to maintenance cost. Fadwa Flemban

Statistical Humor A ONE-WAY ANOVA shouted at a TWO-WAY ANOVA: "STOP! Turn around - You are going the wrong way!" The TWO-WAY ANOVA yelled back: "Sorry! I will turn when I see an interaction!" Fadwa Flemban