Analyzing Stability in Colorado K-12 Public Schools

Slides:



Advertisements
Similar presentations
Multiple Regression II Fenster Multiple Regression Let’s go through an example using multiple regression and compare results between simple regression.
Advertisements

Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Hypothesis Testing making decisions using sample data.
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Quantitative Data Analysis: Hypothesis Testing
Chapter 13 Multiple Regression
© 2010 Pearson Prentice Hall. All rights reserved Single Factor ANOVA.
Korelasi Ganda Dan Penambahan Peubah Pertemuan 13 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Interaksi Dalam Regresi (Lanjutan) Pertemuan 25 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Regresi dan Rancangan Faktorial Pertemuan 23 Matakuliah: I0174 – Analisis Regresi Tahun: Ganjil 2007/2008.
Chapter 12 Multiple Regression
© 2003 Prentice-Hall, Inc.Chap 14-1 Basic Business Statistics (9 th Edition) Chapter 14 Introduction to Multiple Regression.
Biol 500: basic statistics
Lecture 23 Multiple Regression (Sections )
Stat 217 – Day 25 Regression. Last Time - ANOVA When?  Comparing 2 or means (one categorical and one quantitative variable) Research question  Null.
Correlation & Regression
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 12 Analyzing the Association Between Quantitative Variables: Regression Analysis Section.
Chapter 13: Inference in Regression
Hypothesis Testing in Linear Regression Analysis
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Multiple regression - Inference for multiple regression - A case study IPS chapters 11.1 and 11.2 © 2006 W.H. Freeman and Company.
Multiple Regression and Model Building Chapter 15 Copyright © 2014 by The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 26.
Hypothesis Testing. Why do we need it? – simply, we are looking for something – a statistical measure - that will allow us to conclude there is truly.
Agresti/Franklin Statistics, 1 of 88 Chapter 11 Analyzing Association Between Quantitative Variables: Regression Analysis Learn…. To use regression analysis.
Chapter 6: Analyzing and Interpreting Quantitative Data
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Simple Linear Regression Analysis Chapter 13.
Hypothesis Testing. Suppose we believe the average systolic blood pressure of healthy adults is normally distributed with mean μ = 120 and variance σ.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
Methods of Presenting and Interpreting Information Class 9.
Stats Methods at IC Lecture 3: Regression.
Outline Sampling Measurement Descriptive Statistics:
Inferential Statistics
Chapter 15 Multiple Regression Model Building
32931 Technology Research Methods Autumn 2017 Quantitative Research Component Topic 4: Bivariate Analysis (Contingency Analysis and Regression Analysis)
Step 1: Specify a null hypothesis
Chapter 14 Introduction to Multiple Regression
Regression and Correlation
Regression Analysis.
Chapter 14 Inference on the Least-Squares Regression Model and Multiple Regression.
Practical Statistics Abbreviated Summary.
Dr. Siti Nor Binti Yaacob
Chapter 9 Multiple Linear Regression
Inference and Tests of Hypotheses
Comparing Three or More Means
PCB 3043L - General Ecology Data Analysis.
Hypothesis Testing and Confidence Intervals (Part 1): Using the Standard Normal Lecture 8 Justin Kern October 10 and 12, 2017.
Multiple Regression Analysis and Model Building
APPROACHES TO QUANTITATIVE DATA ANALYSIS
Regression Analysis.
Dr. Siti Nor Binti Yaacob
By: Allison Wilson and Victoria Fernandez Period: 2nd
Stats Club Marnie Brennan
CHAPTER 26: Inference for Regression
Chapter 12 Inference on the Least-squares Regression Line; ANOVA
Comparing Groups.
Comparing Several Means: ANOVA
More About ANOVA BPS 7e Chapter 30 © 2015 W. H. Freeman and Company.
Hypothesis testing and Estimation
WHO MAKES THE INTERNET SO POPULAR?
Pemeriksaan Sisa dan Data Berpengaruh Pertemuan 17
Korelasi Parsial dan Pengontrolan Parsial Pertemuan 14
Chapter 7: The Normality Assumption and Inference with OLS
15.1 The Role of Statistics in the Research Process
Don’t Let Them Go Without a Fight How the Migrant Program Works to Prevent Dropouts Sarah Seamount Migrant Education Program Coordinator Vickie McCullough.
Chapter Nine: Using Statistics to Answer Questions
Practice As part of a program to reducing smoking, a national organization ran an advertising campaign to convince people to quit or reduce their smoking.
Essentials of Statistics for Business and Economics (8e)
InferentIal StatIstIcs
STATISTICS INFORMED DECISIONS USING DATA
Presentation transcript:

Analyzing Stability in Colorado K-12 Public Schools Final Presentation Shelby Harold INFO 3110

What is Stability? Stability is a measures the amount of students who stayed within a school district during the year. Students leave the school district for many reasons Moving houses, opting into a better district, dropping out, graduating early, etc. The stability rate is the percentage of students who remained in a district without interruption throughout the school year.  𝐶𝑜𝑢𝑛𝑡 𝑜𝑓 𝑆𝑡𝑢𝑑𝑒𝑛𝑡𝑠 𝑊ℎ𝑜 𝑅𝑒𝑚𝑎𝑖𝑛𝑒𝑑 𝑖𝑛 𝑡ℎ𝑒 𝐷𝑖𝑠𝑡𝑟𝑖𝑐𝑡÷ 𝑆𝑡𝑢𝑑𝑒𝑛𝑡𝑠 𝑎𝑡 𝑡ℎ𝑒 𝐵𝑒𝑔𝑖𝑛𝑛𝑖𝑛𝑔 𝑜𝑓 𝑡ℎ𝑒 𝑌𝑒𝑎𝑟 For example, if a school started with 100 students and 87 students remained in the district at the end of the year… The stability rate would be 87% Even if kids transferred in, it would be 87%

Why is it Important? Having a stable student base is essential for a school’s future success Schools that have higher stability rates perform better Teachers make better connections with the students and can tailor their teaching accordingly

Why is this important to me? My passion is reforming public K-12 schools in America. I am interested in what makes our schools work and what doesn't. If increasing stability is the way to better our public schools, then investigating what effects stability is important.

My Data Data collected and published by Colorado Department of Education Combination of 19 data sets Experimental Region: School Districts (179)

More Data Stuff SUBGROUPS Race Gender Grade Level Students with Disabilities English as Second Language Economically Disadvantaged Homeless Students VARIABLES Average Salary of Teachers Average Salary of Principals Funding per Pupil Percentage of Free and Reduced Lunch Graduation Rate Urban vs. Rural Average Home Value

Overall Aim of the Project Analyze what factors effect stability What subgroups are the most unstable? NOT analyzing the effects of stability, but rather what may affect it

Basic Descriptive Statistics of the Stability Rates in Colorado Mean: 81.52 SE Mean: .429 Standard Deviation: 5.744 Minimum: 63.4 Q1: 77.8 Median: 82.6 Q3: 85.6 Maximum: 93.7

Dot Plot of Overall Stability Rate The data looks pretty normally distributed so unfortunately there might not be a need for non-parametric statistics.

Is it really normally distributed? In Minitab, there is a normality test that determines how well a given set of data follows a normal distribution!! The p- value is very low so we can conclude that the data follows a normal distribution.

Correlation of Variables I constructed a correlation chart to determine if any of my variables were related. If two variables have too high of a correlation, it could distort regression results. There is no need for two variables that are highly correlated; one will do.

Pearson Correlation: -.66 Correlated Variables Pearson Correlation: .564 Pearson Correlation: -.66

Test 1: Is the mean stability rate between urban and rural schools different? Null: There is no statistically significant difference in the means of the urban and rural school district’s stability rates. Alternative: There is a statistically significant difference between the means of the urban and rural school district’s stability rates. Two Sample Test Binary Variable: Urban and Rural Continuous: Stability Rate

Urban schools look like they might be slightly more stable. Let’s see! Dot Plot Urban schools look like they might be slightly more stable. Let’s see!

Two Sample Test As predicted, the mean of the urban schools was higher. However, it is not statistically significant. * The p-value is too high and the confidence interval contains 0. Therefore, we fail to reject the null and must accept the alternative hypothesis. There is no difference in the means of urban and rural schools.

Test 2: Does funding have an effect on stability rate? Null: There is no correlation between funding per pupil and stability rate. Alternative: There is a correlation between funding per pupil and stability rate. Simple Linear Regression Continuous Input: Funding Per Pupil Continuous Output: Stability Rate In other words, does funding per pupil predict stability rate?

Scatter Plot It’s not looking so good  Let’s run the regression anyways!

Simple Linear Regression Output The p-value of .852 is way too high!! R-squared is super low Because the p-value is higher than .1, we fail to reject the null. We cannot say that there is a correlation between funding per pupil and stability. We cannot use funding as a predictor.

What single factor is the best predictor of stability? Since funding was not a good predictor, I wanted to investigate what the single best predictor among my variables is. So I ran simple linear regressions on stability rate against each of my variables. The p-values are as follows… Graduation Rate: 0 Average Principal Salary: 0.05 Average Teacher Salary: 0.011 Drop-out rate: 0 Free and Reduced Lunch Percentage: 0 Average Home Value: 0.1 Urban v. Rural: 0.186 Clearly, I picked the worst predictor in my first regression analysis! Graduation Rate, Drop-Out Rate, and FRL Percentage would all be good predictors of stability.

Test 3: What variables affect stability rate? Null: None of the variables have significant p-values. Alternative: At least one variable has a significant p-value Multiple Linear Regression Single continuous output: Stability rate Multiple continuous inputs: All other variables

Multiple Linear Regression JMP Multiple Linear Regression Output

Eliminating Variables I used backward stepwise to delete any insignificant variables. The output below shows what I was left with. The R-Squared Value is still not the best, but all the variables have significant p-values

Equation Using the significant variables, we can develop an equation to predict stability rates. Stability Rate= 80.062+(0.000141*Average Teacher Salary)+ (-1.0091*Total Dropout Rate)+(-0.0587*Free and Reduced Lunch Percentage)

Test 4: Which subgroups are the most unstable? Using ANOVA

Gender

Gender There is no significant difference in stability rate between male and female students.

Race

ANOVA Test for Race There could be a significant difference between races

Graphic Representation of Race

Notable Conclusions on Race Hispanic and White students could have a similar mean. The biggest difference was between White students and Native Hawaiian students. Overall, Native Hawaiians were the most unstable race.

Grade Level

Tukey Pairwise Comparisons for Grade

Notable Grade Level Conclusions Even though, some grades have different means, the majority of the grades could have similar means. The most unstable group is half-day kindergarten with a stability rate of 18.74% Seniors have the highest mean stability rate.

Types of Students

ANOVA Test for Types of Students ESL Students are in a group of their own Migrant and Homeless students could have similar means

Notable Type of Student Conclusions Most of the subgroups of students could have a mean similar to the overall stability rate. Homeless students and migrant students are far less stable than other types of students. It might be useful to partner with a homeless shelter to keep students engaged and in one location.

My Favorite Findings  The mean stability rate is only 81.5%! There is no difference in the means between urban and rural schools. I assumed urban schools would be much more unstable because of the types of jobs. I thought funding would have an impact on stability. If funding was low, students will leave the district. It turned out this was the worst predictor of stability. Grade level has very little to do with stability. Gender has no significant effect on stability. If school boards want to increase stability, their polices should target Native Hawaiians students, migrant students, and homeless students.