Using R and Longitudinal Data to Answer Policy Questions Jared Knowles Wisconsin Department of Public Instruction NCES DC-STATS 2012 Washington DC, July.

Slides:



Advertisements
Similar presentations
Second Information Technology in Education Study (SITES) A Project of the International Association for the Evaluation of Educational Achievement (IEA)
Advertisements

The Teacher Work Sample
Square Peg and Round Hole… As parents and educators, the change in grading systems requires a fundamental switch in our thinking… 4=A 1=F 2=D 3=B.
PD Plan Agenda August 26, 2008 PBTE Indicators Track
Model Evaluation Tools MET. What is MET Model Evaluation Tools ( MET )- a powerful and highly configurable verification package developed by DTC offering:
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
Comparing Growth in Student Performance David Stern, UC Berkeley Career Academy Support Network Presentation to Educating for Careers/ California Partnership.
Case Tools Trisha Cummings. Our Definition of CASE  CASE is the use of computer-based support in the software development process.  A CASE tool is a.
Oral Presentation Rubrics Standards-based Assessment of and for Learning.
Welcome to E-Prime E-Prime refers to the Experimenter’s Prime (best) development studio for the creation of computerized behavioral research. E-Prime is.
Data Analysis Statistics. Inferential statistics.
Robert M. Saltzman © DS 851: 4 Main Components 1.Applications The more you see, the better 2.Probability & Statistics Computer does most of the work.
1 The New York State Education Department New York State’s Student Reporting and Accountability System.
Data Mining By Andrie Suherman. Agenda Introduction Major Elements Steps/ Processes Tools used for data mining Advantages and Disadvantages.
Face Detection using the Viola-Jones Method
SharePoint 2010 Business Intelligence Module 6: Analysis Services.
How do you know?: Interpreting and Analyzing Data NCLC 203 New Century College, George Mason University April 6, 2010.
B. RAMAMURTHY EAP#2: Data Mining, Statistical Analysis and Predictive Analytics for Automotive Domain CSE651C, B. Ramamurthy 1 6/28/2014.
1 Beyond California Water Plan Update 2005 California Water and Environmental Modeling Forum Annual Meeting, March 3 rd, 2005.
1 Introduction to Modeling Languages Striving for Engineering Precision in Information Systems Jim Carpenter Bureau of Labor Statistics, and President,
Connecting the Dots BISD’S LEARNING PLATFORM AND ESTABLISHING EXPECTATIONS FOR LEARNING RUBRIC.
“Every Child a Graduate” Wisconsin Department of Public Instruction SPECIAL EDCATION FRAMEWORK FOR PROFESSIONAL DEVELOPMENT Life-long learning and continuous.
Whiteboard Zoom Out Surveying Year One of the Oklahoma Value-Added Model.
Evaluating the Vermont Mathematics Initiative (VMI) in a Value Added Context H. ‘Bud’ Meyers, Ph.D. College of Education and Social Services University.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
PPT Presentation Template: This PPT includes all slides to present a district or building level overview of PVAAS. This was used with a district- wide.
The Role of Information in Systems for Learning Paul Nichols Charles DePascale The Center for Assessment.
User Study Evaluation Human-Computer Interaction.
Why Is It There? Getting Started with Geographic Information Systems Chapter 6.
Process Improvement with Solitaire Using the PC Solitaire game to learn basic (and advanced) techniques of Process Improvement (So easy, even a can do.
Welcome to... Introduction to A Framework for Teaching 10/12/2015pbevan 1.
Karen Seay PARENTAL INVOLVEMENT 101 – Writing a compliant policy and compact We’re all in this together:  State Department of Education 
Using Human Component Mapping TO ANALYSE & INTEGRATE HUMAN FACTORS ISSUES & RECORDS WITH RAILWAY HAZARD LOGS 1 Dr. Amanda C. Elliott, Simon Macmull & Harry.
WELCOME EPAS Reports: Maximizing the Impact on Student Achievement in Mathematics.
Developing a Statewide Early Warning System Doug White, DPI Jared Knowles, DPI.
AP STATISTICS LESSON 10 – 2 DAY 1 TEST OF SIGNIFICANCE.
Biostatistics Class 1 1/25/2000 Introduction Descriptive Statistics.
Introduction – Addressing Business Challenges Microsoft® Business Intelligence Solutions.
Weka: Experimenter and Knowledge Flow interfaces Neil Mac Parthaláin
Ch 6. The Evolution of Analytic Tools and Methods Taming The Big Data Tidal Wave 31 May 2012 SNU IDB Lab. Sengyu Rim.
FALCON Meeting #3 Preparation for Harnett County Schools Thursday, March 8, 2012.
Data Analysis Econ 176, Fall Populations When we run an experiment, we are always measuring an outcome, x. We say that an outcome belongs to some.
The Software Development Process
1 National Center on Educational Outcomes What’s so Difficult About Including Special Education Teachers and Their Students in Growth Models Used to Evaluate.
Introduction to Statistics Osama A Samarkandi, PhD, RN BSc, GMD, BSN, MSN, NIAC Deanship of Skill development Dec. 2 nd -3 rd, 2013.
Right Hand Side (Independent) Variables Ciaran S. Phibbs.
© 2015 The College Board The Redesigned SAT/PSAT Key Changes.
WEB FORM DESIGN. Creating forms for a web page For your web project you have to design a form for inclusion on your web site (the form information should.
Fidelity of Implementation A tool designed to provide descriptions of facets of a coherent whole school literacy initiative. A tool designed to provide.
Stuart Birnbaum Department of Geological Sciences The University of Texas at San Antonio Learning objectives and assessments June 15, 2015.
Brian Lukoff Stanford University October 13, 2006.
Researching Technology in South Dakota Classrooms Dr. Debra Schwietert TIE Presentation April 2010 Research Findings.
1 The Software Development Process ► Systems analysis ► Systems design ► Implementation ► Testing ► Documentation ► Evaluation ► Maintenance.
[School Name]’s Student Perception Survey Results This presentation is a template and should be customized to reflect the needs and context of your school.
Copyright © 2014 American Institutes for Research and Cleveland Metropolitan School District. All rights reserved. March 2014 Interpreting Vendor Assessment.
SAS Integration Trainer (SASIT) September Please Do the Following: Connect to the Internet Navigate to:
1 Innovative Teaching and Learning (ITL) Research Corinne Singleton SRI International.
Why Is It There? Chapter 6. Review: Dueker’s (1979) Definition “a geographic information system is a special case of information systems where the database.
California Assessment of STUDENT PERFORMANCE and PROGRESS
Criteria Rollout Meeting October 30, 2016
Chapter 16: Sample Size “See what kind of love the Father has given to us, that we should be called children of God; and so we are. The reason why the.
Evaluating state of the art in AI
Supplementary Table 1. PRISMA checklist
Prepared by Kimberly Sayre and Jinbo Bi
Data Science Process Chapter 2 Rich's Training 11/13/2018.
EVAAS Overview.
Smarter Balanced Assessments: What do families need to know?
Dr. Morgan C. Wang Department of Statistics
Continuous Improvement/eProve Regional Trainings
Interim Assessment Training NEISD Testing Services
Presentation transcript:

Using R and Longitudinal Data to Answer Policy Questions Jared Knowles Wisconsin Department of Public Instruction NCES DC-STATS 2012 Washington DC, July 12, 2012

Overview Why R? Examples of R Analyses? Share R code across states Develop joint methods Produce reports

Why R? R is free R is open source R is best in class and state of the art R is free

R works for government R is transparent R is reproducible R is accurate R works! Today!

R

Google Scholar Hits R has recently passed Stata on Google Scholar hits and it is catching up to the two major players SPSS and SAS

Google Scholar Hits These links come from the explosion of add-on packages to R

R has an Active Web presence Usage of the R listserv for help has really exploded recently

R Examples Read in data

Simple Diagnostics plot of chunk unamed-chunk-1

Advanced Diagnostics plot of chunk unamed-chunk-2

More advanced Grade 3 Score Cohort Trends

Can this generate information? Graphics help explain, but are still descriptive R can help on two fronts: 1.R can do advanced analytics that provide insight 2.R can graphically depict those analytics in simple ways that are intuitive to policy makers Oh yeah…? Prove it. 1.BLBC study in Wisconsin 2.Regression Trees 3.Machine Learning Algorithms

BLBC in Wisconsin We fit statistical models to all the ESL students in Wisconsin modeling their future test score as best as possible through each strategy described above (on all three major assessments) We also include an indicator of whether the student was eligible for BLBC instruction or not We compare to see if the average student receiving BLBC did better than the average student receiving other ESL services, all else equal We report the range of uncertainty around this difference between student groups and compare them to see if any meaningful differences emerge Due to our large sample size we expect our results to be biased in favor of finding statistically significant results, so we also examine the magnitude of findings to see if they are substantively significant; i.e. is the difference big enough to matter in the lives of students?

Evaluations of Policy Results are presented in effect sizes, or standard deviation units of change in test scores. 0.1 is small, 0.2 to 0.4 is reasonable and is about a year of education in most cases. Bigger than 0.4 is huge.

Results I The bars represent the 95% confidence internal around the estimate. The VAM model is consistently statistical significant, not overlapping 0, and negative. The length of the bars represent the uncertainty about the estimate. But the mean effect size is quite small, less than 0.1 standard deviations in most cases. This represents a Year-to-year change in student’s score Between BLBC and non-BLBC instruction.

Results I (graphic)

Results II Language is different. Wisconsin has a large sample of both Hmong and Spanish speakers and they have different results when analyzed separately

Conclusions and Next Steps In Wisconsin BLBC does not have the negative effects found in recent research on California, though a slight negative effect may exist in some cases Unlike other states where BLBC has been studied, Wisconsin has substantially different results between language groups on mathematics and possibly on English proficiency achievement There is still a lot of imprecision in the estimates used here and more precision would be helpful, but effects are not substantively large in terms of relative student performance, even in the uppe r and lower bounds

Next Steps Get more data over more years and use a more precise estimation technique to reduce uncertainty about effects Explore the variation across BLBC programs in addition to the mean effect Learn more about the non-cognitive non-academic outcomes for BLBC in order to understand the costs and benefits of BLBC programs more fully Estimate a “treatment-on-the-treated” parameter to more directly compare to prior research Survey teachers and merge teacher/program practice data with student outcomes to begin exploring the effective components of BLBC and ESL programs

Inference Trees Get more data over more years and use a more precise estimation technique to reduce uncertainty about effects Explore the variation across BLBC programs in addition to the mean effect Learn more about the non-cognitive non-academic outcomes for BLBC in order to understand the costs and benefits of BLBC programs more fully Estimate a “treatment-on-the-treated” parameter to more directly compare to prior research Survey teachers and merge teacher/program practice data with student outcomes to begin exploring the effective components of BLBC and ESL programs

Inference Tree Example plot of chunk parttree

R is a powerful platform Can Standardize and Share / Compare Results Execute the same code on each other’s data Compare results Compare methods of analysis and improve them Build a professional community Collaborate

Some code sharing exists DPI has begun working with the Strategic Data Project at Harvard to prepare their toolkit using R Currently written in Stata Goal is to package the toolkit into R functions that can be applied to any dataset that has the required elements This work has begun with the creation of a few functions in R and some documentation Can be found online at GitHub [github.com/jknowles/SDP- Toolkit-for-R] pfpie/index.php/sdp/strategic- data-project-the-vision

Race/Ethnicity Example Consider this example data from the Strategic Data Project Toolkit: Student 1 and Student 2 in this data have different races in different years This doesn’t happen often in most of our data systems, but it does happen, especially across different datasets For research this can cause problems and requires different business rules

What business rules do we use? Ad hoc and up to the researcher Need standards Need commonality Need consistency Need tools that make those things easy to do!

What to do Should this student be declared H, the modal race? Should this student be declared B, the first occurring race? Should this student be flagged as inconsistent? Should this student be coded as multi-racial?

Fix the data We can do the modal category easily in R using a simple function

Fixing the data in a few simple steps

Some code sharing exists DPI has begun working with the Strategic Data Project at Harvard to prepare their toolkit using R Currently written in Stata Goal is to package the toolkit into R functions that can be applied to any dataset that has the required elements This work has begun with the creation of a few functions in R and some documentation Can be found online at GitHub [github.com/jknowles/SDP- Toolkit-for-R] pfpie/index.php/sdp/strategic- data-project-the-vision

What happened We implemented two business rules on over 59,000 observations in a few seconds on a few lines of code First, the modal race is chosen for multiple race categories per student If a tie exists (more than 1 mode), we map the most recent race These business rules can be readily changed, i.e. we could use the first race or a multi-race code for students with multiple modes

What happened (continued) This script can be run every time data is extracted from the warehouse to do work on It can be run by every analyst on every machine because R is free and easy to deploy! Consistency and repeatability. It can be swapped out with a different business rule implemented just as easily for different purposes flexibility It can be mapped and translated onto CEDS elements

Next Steps Once we clean up the data, analytics can be shared Doing analytics is a simple next step in R R has best in class machine learning algorithms used to classify data and predict R is the tool of choice for data science algorithms

A Data Mining Example If we are interested in pure predictive analytics, R provides hundreds of best in class algorithms and methods to evaluate them This is done primarily through the caret package, which provides an easy to use framework for comparing these algorithms These models can be used to predict “classes” of students, predict student scores, or predict anything else of interest

Do analytics on fixed data

A Data Mining Example If we are interested in pure predictive analytics, R provides hundreds of best in class algorithms and methods to evaluate them This is done primarily through the caret package, which provides an easy to use framework for comparing these algorithms These models can be used to predict “classes” of students, predict student scores, or predict anything else of interest

Machine Learning

Predictions

Deviance

Deviance (II)

The best part R is a programming language and can be used to produce reports R can produce HTML, PDF, or other formats of reports Examples: Dropout risk reports for each high school NSC reports by school district

The best part (II) R can do this by simply building a template and running analytics on the appropriate data subset, automatically R can even reports, seriously Your data analysts have access to a full programming language once they understand R, flexibility far above and beyond other analytics packages for a slightly steeper learning curve

How to learn Online with tutorials DPI R Bootcamp in August PD workshops elsewhere

Online Tutorials Visit this presentation online and look at these great resources for learning that are free and open source as well R Features List [ source-r/r-language-features/] Video Tutorials [ R Tutorials from Around the World [pairach.com/2012/02/26/r-tutorials-from-universities-around-the- world/] R for SPSS/SAS Users [r4stats.com/add-ons]

DPI R Bootcamp DPI is offering a bootcamp on R August 2nd and 3rd. Slots are limited for this two full days of R training. Training materials will be made available online. As they are developed, they can be viewed at For more information, visit the website We will probably do another bootcamp soon, a few slots still available for this bootcamp. Interested? See me.

Session Info This document is produced with knitr version Here is my session info:

Jared E. Knowles Wisconsin Department of Public Instruction Phone