Fundraising Analytics to identify potential prospects using SAS 12.1

Slides:



Advertisements
Similar presentations
1 Incorporating Statistical Process Control and Statistical Quality Control Techniques into a Quality Assurance Program Robyn Sirkis U.S. Census Bureau.
Advertisements

Data Analytics : A powerful insight into your donors’ giving potential Insight SIG 19th February, 2013.
J. Milito & Associates, Inc. We have been successfully conducting telephone fundraising for the past 15 years We work exclusively with Arts, Academic,
Departments of Medicine and Biostatistics
Introductions John White, Director of Product Management
April 29 - May 1, 2015 Better Donor Engagement Through Cluster Analysis.
Computer Science Universiteit Maastricht Institute for Knowledge and Agent Technology Data mining and the knowledge discovery process Summer Course 2005.
Beyond Opportunity; Enterprise Miner Ronalda Koster, Data Analyst.
Enterprise systems infrastructure and architecture DT211 4
1 Chapter 1: Introduction 1.1 Introduction to SAS Enterprise Miner.
Chapter 1: Introduction
April 11, 2008 Data Mining Competition 2008 The 4 th Annual Business Intelligence Symposium Hualin Wang Manager of Advanced.
First Steps in Predictive Modeling & Analytics Presented By: Mark Mathyer, Greater Chicago Food Depository Tré Geoghegan, Museum of Science and Industry.
Factors that Associated with Stress in Nursing Faculty in Thailand
A N A L Y T I C S E R V I C E S Expanding the Scope of Prospect Research: Data Mining and Data Modeling Chad Mitchell Blackbaud Analytics September 2,
Data Mining Chun-Hung Chou
Understanding Data Analytics and Data Mining Introduction.
© 2005 Bentz Whaley Flessner Profile, Identify and Track Major and Planned Giving Prospects in Team Approach Joshua Birkholz.
Chapter 7: Generating Funds Part 2 Meg Giddings. 3 Types of Individual Fundraising A) Annual Giving: campaigns run each year soliciting past and new donors.
by B. Zadrozny and C. Elkan
Abstract Background Significance Proposed Methods Research Trajectory Aims Children with Complex Chronic Conditions: A Formative Study to Support Development.
Can we help? the impact on entrepreneurial performance of publicly funded business advisory services. Eileen Fischer Schulich School of Business York University.
Copyright © 2010, SAS Institute Inc. All rights reserved. Applied Analytics Using SAS ® Enterprise Miner™
Patterns of Event Causality Suggest More Effective Corrective Actions Abstract: The Occurrence Reporting and Processing System (ORPS) has used a consistent.
Modeling and Money: The Two DO Mix TAIR February 1, 2006.
 Mail Order Company in USA › Would like to find out if there is a way › To reduce mailing cost › By analyzing the past data.
Copyright © 2012, SAS Institute Inc. All rights reserved. ANALYTICS IN BIG DATA ERA ANALYTICS TECHNOLOGY AND ARCHITECTURE TO MANAGE VELOCITY AND VARIETY,
Jennifer Lewis Priestley Presentation of “Assessment of Evaluation Methods for Prediction and Classification of Consumer Risk in the Credit Industry” co-authored.
Lecture DSCI 4520/5240 DATA MINING MYRAW Nonprofit donor data MYRAW Overview Determine who is likely to donate to a non-profit organization campaign.
Copyright © 2001, SAS Institute Inc. All rights reserved. Data Mining Methods: Applications, Problems and Opportunities in the Public Sector John Stultz,
2015 NEMA Conference Major Gifts for Small Shops Laura Ewing-Mahoney Co-Founder and Principal.
Which socio-demographic living arrangement helps to reach 100? Michel POULAIN & Anne HERM Orlando 8 January 2014.
Multivariate Analysis - Introduction. What is Multivariate Analysis? The expression multivariate analysis is used to describe analyses of data that have.
Prepared by Fayes Salma.  Introduction: Financial Tasks  Data Mining process  Methods in Financial Data mining o Neural Network o Decision Tree  Trading.
Prediction of Box Office Gross Revenue
Energy Consumption Forecast Using JMP® Pro 11 Time Series Analysis
Chapter 1: Statistics, Data and Statistical Thinking
QUALITY OF CARE TRENDS FOR CALIFORNIA CHILDREN
DATA COLLECTION METHODS IN NURSING RESEARCH
Enhancing the digital experience at the Smithsonian
Quality of Electronic Emergency Department Data: How Good Are They?
KAIR 2013 Nov 7, 2013 A Data Driven Analytic Strategy for Increasing Yield and Retention at Western Kentucky University Matt Bogard Office of Institutional.
Huyen Nguyen, Dung Phan, and Girish Shirodkar
Age at First Measles-Mumps-Rubella Vaccination in Children with Autism and School-Matched Control Subjects William W. Thompson, PhD Presented at the.
Understanding Non Response in the 1958 Birth Cohort
1% had incomes above $171,650 5% had incomes above $93,900
Creation of synthetic microdata in 2021 Census Transformation Programme (proof of concept) Robert Rendell.
Customer Segmentation Based on RFM and Predicting Defaulters
Campaign Fundamentals
Impact of State Reporting Laws on Central Line– Associated Bloodstream Infection Rates in U.S. Adult Intensive Care Units Hangsheng Liu, Carolyn T. A.
Trena M. Ezzati-Rice, Frederick Rohde, Robert Baskin
Antidepressant Use Among Working Age Canadians:
Predicting the Market Value of the Property Using JMP® Pro 11
A Longitudinal Study of Medical Student Primary Care Intent
Delwin Derksen Carlene Gilbert Kim Luchsinger Beth Mammenga Jon White
Sports & Entertainment Marketing II
Partners for Cancer Care & Prevention Fundraiser
Advanced Analytics Using Enterprise Miner
Predicting Government Spending on Professional Services
The European Statistical Training Programme (ESTP)
Initiating a Research Effort
Analytics: Its More than Just Modeling
Study Design/Methods Used
Clinical prediction models
Multivariate Analysis - Introduction
“The Spread of Physical Activity Through Social Networks”
Chapter 5: The analysis of nonresponse
Chapter 1: Statistics, Data and Statistical Thinking
Karen Wooten, MA Elizabeth Luman, MS Lawrence Barker, PhD
Presentation transcript:

Fundraising Analytics to identify potential prospects using SAS 12.1 Ramcharan Kakarla MS in MIS and SAS and OSU Data Mining Certificate, Oklahoma State University, Stillwater, OK 74078 Data has also been collected externally from the US census at a county level. External factors were considered in the following ratios: Health Behaviors 30%, Clinical Care 20%, Social and Economic Environment 40%, Physical Environment 10%. Different external data fields were combined to form a single external data set at a county level. Internal and External Data sets are merged to create the final dataset. The final dataset contained 221 variables with 247,000 records. Results Abstract Fundraising is an important activity for any foundation or charitable trust. It plays a pivotal role in development of the university propelling the growth and research. In fact it is often noted that the best universities around the world have deep roots in the history of Alumni giving. Solicitation involves cost; therefore it is important to keep a check on the dollar amount raised. The primary objective of the project is to distinguish the characteristics of donors from non-donors in the alumni using SAS Enterprise Miner 12.1. This enables the university foundation to minimize the solicitations count and raise maximum money. The predictive model estimates the likelihood of donation based on different inputs such as Age, Marital Status, Number of education records, Alumni pairs, degrees, scholarships, awards, events and so on related to the donor characteristics. The data for the project has been obtained from XYZ University Foundation Alumni database and enhanced with external data resulting in 221 variables and 247,000 records. Models such as Regression, Neural Networks, LARS, Decision Trees and Ensemble models have been built to predict the likelihood of donation. Age, Marital Status and Attendance at university events emerged as significant variables in the analysis. It is found that donation patterns vary widely across age groups based on marital status. Age, Martial Status and attendance to events conducted by XYZ university were found to be the most influencing factors in distinguishing donors from non donors for Annual Giving. Single are least likely to give. Widowed and divorced are most likely to make a donation starting from their late 30s. More Results Methodology Conclusions SAS 12.1 was used to analyze the data. The modeling approach followed for the project is SEMMA (Sample, Explore, Modify, Model and Assess). The data was portioned into two stratified samples (Training, Validation). Exploration is used for detecting trends and refining the data. The main objective of the project was to discover the factors that influence the annual giving patterns. The facts discovered through the process not only give insights to the strategic management but also provide a decision making process backed up by the data. SAS Enterprise miner is a very powerful tool that helped in finding these patterns given the amount of data thrown in with larger number of variables The variables are modified and transformed to adjust skewness and kurtosis values. Complete data for all the fields was available for only 10% of the data. Tree based Imputation methods were used to make necessary imputations. Directly related variables were rejected as part of analysis. Models ranging from decision trees, neural networks, logistic regressions, Dmine regressions were applied as part of the analysis. References Birkholz, Joshua. 2008. Fundraising Analytics: Using Data to Guide Strategy. Hoboken, New Jersey : John Wiley & Sons, Inc., 2008 Sargeant, Adrian. Marketing management for nonprofit organizations. Oxford: Oxford University Press, 1999 Baker, Michael John, and Susan J. Hart, eds. The marketing book. Routledge, 2008. Sun, Xiaogeng, Sharon C. Hoffman, and Marilyn L. Grady. "A multivariate causal model of alumni giving: Implications for alumni fundraisers." International Journal of Educational Advancement 7.4 (2007): 307-332. Background & Data Preparation Data Exploration & Model Building The project is focused on annual giving prospects of XYZ foundation. Annual Giving concentrates on donor acquisition. It inculcates the habit of donating on a regular basis. The main objective is to broaden the base of constituents in this area by building sustained relationships. XYZ foundation wants to make use of in-house data structure to identify the influencing factors for annual giving. Data has been obtained from XYZ University foundation Alumni database. Several tables were joined together before making up the final internal dataset. Data was spread across 9 tables with multiple instances and transaction histories. The information is flattened out using various transformations, recoding and binning of the variables. Few data fields had a substantial amount of missing values. The internal data encompassed details such as date of birth, gender, marital status, student activities, graduation year, solicitations, education details and other demographic details. Initial Data Exploration was done to identify the relations between the target and inputs. StatExplore was useful in identifying the usefulness of variables against the targets. Highly correlated input variables were removed. Clustering node was useful in identifying and picking up the important variables from the initial list of variables. After reducing the input space, the filtered set of variables were used to make the predictions about the responses. Tree models in general were more robust and predicted the responses well in this case. Dmine regression was found to be the best model based on Validation Misclassification rate. Acknowledgement I would like to thank Dr. Goutam Chakraborty for his generous support and supervision. Contact Your Comments and Questions are valued and encouraged. Ramcharan Kakarla, Oklahoma State University, OK email: ramcharan.kakarla@okstate.edu

Fundraising Analytics to identify potential prospects using SAS 12.1 Ramcharan Kakarla MS in MIS and SAS and OSU Data Mining Certificate, Oklahoma State University, Stillwater, OK 74078 Results Married are likely to make donations around age groups of 45 and above. It is also found that if the constituent is a volunteer there is a greater chance of donation The chances of donation are likely to go up if the constituent has attended an event conducted in the past 6 to 12 months 2 in every 3 are likely to make a donation if the constituent and spouse are from the university (Alumni Pair) The chances of donations are directly related to the number of educational records of the constituent Couples of same gender have a high likelihood of giving Scholarship is not a strong distinguisher among the constituents who donate and those who don’t Direct Mail Solicitations are found to be most influencing followed by email solicitations statistically Counties with better median incomes have a better prospect base As percent of smokers increase in a county the chances of donation are likely to go down The best months in terms of donation are December, January and May