CS548 Showcase Using SPSS for Data Mining Ahmedul Kabir.

Slides:



Advertisements
Similar presentations
Everything I wish I had known about research design and data analysis… Statlab Workshop Spring 2005 Heather Lord and Melanie Dirks.
Advertisements

Introduction to SPSS Johan Smits Senior lecturer Statistics, Research and SPSS Saxion Market Research.
Gerrit de Bolster September 24, 2013 Generating Blaise from DDI.
© 2013 IBM Corporation Discover how to simulate and forecast the impact of changes on your workload environment How can you better satisfy Service Level.
Copyright © 2014 Pearson Education, Inc. 1 Managers from across organizations are involved in developing and acquiring information systems Chapter 5 -
Text Exercise 4.43 (a) 1 for level A X = 0 otherwise Y =  0 +  1 X +  or E(Y) =  0 +  1 X  0 =  1 = the mean of Y for level B the amount that the.
Srinivasulu Rajendran Centre for the Study of Regional Development (CSRD) School of Social Sciences (SSS) Jawaharlal Nehru University (JNU) New Delhi -
Ann Arbor ASA ‘Up and Running’ Series: SPSS Prepared by volunteers of the Ann Arbor Chapter of the American Statistical Association, in cooperation with.
Shipi Kankane Prashanth Nakirekommula.  Applying analytics and risk- management capabilities to health insurance through LexisNexis data platforms. 
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Computing in Statistical Education Pang Du Department of Statistics Virginia Tech.
Data analysis Use of computers. Computers  Increasing power  Decreasing costs  Miniaturization.
1 SPSS Recently it has gone through a name change so your icon on your computer may be under a different name (i.e. PASW- Predictive Analytics SoftWare).
UCB CS Research Fair Search Text Mining Web Site Usability Marti Hearst SIMS.
Finding Data for Quantitative Analysis Lecture 11.
Introduction to SPSS Short Courses Last created (Feb, 2008) Kentaka Aruga.
2 Excel* MegaStat Minitab SPSS JMP POM* *We will focus on this readily available software in the demonstrations to follow Statistical Software.
SPSS Statistical Package for Social Sciences Multiple Regression Department of Psychology California State University Northridge
 The Weka The Weka is an well known bird of New Zealand..  W(aikato) E(nvironment) for K(nowlegde) A(nalysis)  Developed by the University of Waikato.
Biostatistics, statistical software II. A brief survey of statistical program systems Krisztina Boda PhD Department of Medical Informatics, University.
Research Terminology for The Social Sciences.  Data is a collection of observations  Observations have associated attributes  These attributes are.
SPSS Presented by Chabalala Chabalala Lebohang Kompi Balone Ndaba.
Highline Class, BI 348 Basic Business Analytics using Excel, Chapter 01 Intro to Business Analytics BI 348, Chapter 01.
Week 4: Multiple regression analysis Overview Questions from last week What is regression analysis? The mathematical model Interpreting the β coefficient.
Understanding Regression Analysis Basics. Copyright © 2014 Pearson Education, Inc Learning Objectives To understand the basic concept of prediction.
IBM SPSS Information Factory A SELECT INTERNATIONAL COMPANY.
Chartese Jones - MCIS Department - Mississippi Valley State University Mentor: Dr. Raymond Williams, Mississippi Valley State University What effect does.
Outline Class Intros Overview of Course & Series Example Research Projects Beginning R.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 1 Statistics: The Art and Science of Learning from Data Section 1.3 Using Calculators.
Part IV Significantly Different Using Inferential Statistics Chapter 15 Using Linear Regression Predicting Who’ll Win the Super Bowl.
Part IV Significantly Different: Using Inferential Statistics
Factors Predicting Individual Health among Pilgrims of Kurdistan County: an application of Health Belief Model.
Achieving High Software Reliability Using a Faster, Easier and Cheaper Method NASA OSMA SAS '01 September 5-7, 2001 Taghi M. Khoshgoftaar The Software.
A Simple Guide to Using SPSS ( Statistical Package for the Social Sciences) for Windows.
Chapter 13 Multiple Regression
Week 8: QUANTITATIVE RESEARCH (2) An introduction to using SPSS to summarise & analyse survey data MA CORPORATE SOCIAL RESPONSIBILITY ACP011C – RESEARCH.
Analysis Introduction Data files, SPSS, and Survey Statistics.
STA302: Regression Analysis. Statistics Objective: To draw reasonable conclusions from noisy numerical data Entry point: Study relationships between variables.
Copyright © Texas Education Agency, All rights reserved. Software Proficiency Statistics & Risk Management Copyright © Texas Education Agency, 2012.
Frap Time! Linear Regression. You will have a question to try to answer and I’ll give you 15 minutes. Then we will stop, look at some commentary then.
Feature Engineering Studio September 9, Welcome to Feature Engineering Studio Design studio-style course teaching how to distill and engineer features.
New Information Technologies in Learning Statistics M. Mihova, Ž. Popeska Institute of Informatics Faculty of Natural Sciences and Mathematics, Macedonia.
Sociology 680 SPSS Introduction. Using SPSS The Statistical Package for the Social Sciences (SPSS) started at Stanford University in the late 1960’s.
1 FREE SAS SOFTWARE. 2 FREE SOFTWARE Free SAS ® software. SAS STUDIO; An interactive, online community. Superior training and documentation. And the analytical.
Get out p. 193 HW and notes. LEAST-SQUARES REGRESSION 3.2 Interpreting Computer Regression Output.
STA302: Regression Analysis. Statistics Objective: To draw reasonable conclusions from noisy numerical data Entry point: Study relationships between variables.
Canonical Correlation. Canonical correlation analysis (CCA) is a statistical technique that facilitates the study of interrelationships among sets of.
Real Time Remote Access: Educational resources Susan Mowers, University of Ottawa.
Data Science Interview Questions 1.What do you mean by word Data Science? Data Science is the extraction of knowledge from large.
IENG-385 Statistical Methods for Engineers SPSS (Statistical package for social science) LAB # 1 (An Introduction to SPSS)
Data Mining Introduction to data mining concepts.
Import Live Mail Contacts to Outlook Get Live Mail Contacts converter solution to recover live.
CHAPTER 15: THE NUTS AND BOLTS OF USING STATISTICS.
Section 12.2 Linear Regression
Atatürk University Scool of Medicine Biostatistics Department
Exploring SPSS for Data Analysis
Statistics in SPSS Lecture 2
CHAPTER 3 Describing Relationships
Project 5 Data Mining & Structural Equation Modeling
Introduction to SPSS.
By Dr. Madhukar H. Dalvi Nagindas Khandwala college
SPSS Assignment Help. Sage-Fox.com Free PowerPoint Templates SPSS is an abbreviation to Statistical Package for Social Science. It’s a windows based software.
Multiple Regression.
Linear Regression Prof. Andy Field.
The cf-python software library
Today’s Beginner Workshop
The Science of Predicting Outcome
Statistics---SPSS.
Descriptive Statistics Univariate Data
SEM: Step by Step In AMOS and Mplus.
Presentation transcript:

CS548 Showcase Using SPSS for Data Mining Ahmedul Kabir

References  01.ibm.com/software/analytics/spss/ 01.ibm.com/software/analytics/spss/   reg_spss.htm (for a good interpretation of the results) reg_spss.htm

What is SPSS  Software Package used for Statistical Analysis of data.  Produced by SPSS Inc. in  SPSS used to stand for “ Statistical Package for the Social Sciences ”  Later changed to “ Statistical Product and Service Solutions ”  Acquired by IBM in Now known as IBM-SPSS Statistics

More on SPSS software  Current version is 22.0  SPSS is a commercial software  Statistic 17.0 (basic package) is freely available for WPI students  Several specialized packages can be bought:  SPSS Data Collection (for surveys)  SPSS Modeler (for data mining)  SPSS Analytic Catalyst (for Big data) etc.

File formats  Basic format is.SAV  Supports other common formats such as.XLSX,.CSV,.DAT etc  SPSS syntax file (.SPS) can be used to convert other formats to SPSS format

Reasons for NOT using SPSS  Expensive (at least, not free!)  Basic Package is not tailored for Data Mining.  Heavy software

Why use it then?  Very rich collection of Statistical tests and methods  Outputs an extensive set of metrics and statistically important factors  Support available  Well known in non-CS fields

Demo (Data view)

Demo (Variable view)

Available Features

Demo (Linear Regression)

Demo Output

Predicted Score = * Age * Level of Education – 0.22* Years with current employer + …..

Thank You! Questions?