RESEARCH HUB AT THE UNIVERSITY LIBRARIES PENN STATE UNIVERSITY TOUR OF STATISTICAL PACKAGES.

Slides:



Advertisements
Similar presentations
Summary Statistics/Simple Graphs in SAS/EXCEL/JMP.
Advertisements

Statistical Methods Lynne Stokes Department of Statistical Science Lecture 7: Introduction to SAS Programming Language.
Tutorial 8: Developing an Excel Application
Tutorial 12: Enhancing Excel with Visual Basic for Applications
Microsoft Excel 2013 An Overview. Environment Quick Access Toolbar Customizable toolbar for one-click shortcuts Tabs Backstage View Tools located outside.
Financial Data Calculator© Produced by: Mathematical Investment Decisions, Inc. 95 West Gate Drive – 2 nd Floor Cherry Hill, NJ Web site:
Software for data management: The contribution of Stata Dr Karen Robson, Senior Research Fellow, The Geary Institute, University College Dublin, Ireland.
Srinivasulu Rajendran Centre for the Study of Regional Development (CSRD) School of Social Sciences (SSS) Jawaharlal Nehru University (JNU) New Delhi -
Welcome to E-Prime E-Prime refers to the Experimenter’s Prime (best) development studio for the creation of computerized behavioral research. E-Prime is.
Ann Arbor ASA ‘Up and Running’ Series: SPSS Prepared by volunteers of the Ann Arbor Chapter of the American Statistical Association, in cooperation with.
Introduction to Statistical Computing in Clinical Research Biostatistics 212 Course director: Mark Pletcher Teaching Assistant: Lee Zane.
SADC Course in Statistics Adding a statistics package Module I3, Session 13.
A Simple Guide to Using SPSS© for Windows
Lecture 2 LISAM. Statistical software.. LISAM What is LISAM? Social network for Creating personal pages Creating courses  Storing course materials (lectures,
1 Business Intelligence: Report Creation and Automation Using Business Objects Dylan Black University of Wisconsin – Platteville
SPSS 1: An Introduction to the Statistical Package SPSS Suzie Cro MRC Clinical Trials Unit.
Version 4 for Windows NEX T. Welcome to SphinxSurvey Version 4,4, the integrated solution for all your survey needs... Question list Questionnaire Design.
Introduction to SPSS Short Courses Last created (Feb, 2008) Kentaka Aruga.
Introduction to SAS Math 3200 Jan Jimin Ding.
May 21, 2009 Stata for micro targeting using C++ and ODBC Masahiko Aida.
What is R Muhammad Omer. What is R  R is the programing language software for statistical computing and data analysis  The R language is extensively.
Air Quality Data Analysis Using Open Source Tools
XP New Perspectives on Microsoft Access 2002 Tutorial 71 Microsoft Access 2002 Tutorial 7 – Integrating Access With the Web and With Other Programs.
Introduction to SPSS (For SPSS Version 16.0)
Introduction to R Statistical Software Anthony (Tony) R. Olsen USEPA ORD NHEERL Western Ecology Division Corvallis, OR (541)
TrendReader Standard 2 This generation of TrendReader Standard software utilizes the more familiar Windows format (“tree”) views of functions and file.
SENG521 (Fall SENG 521 Software Reliability & Testing Software Reliability Tools (Part 8a) Department of Electrical & Computer.
Lecture 1: Introduction Lecture series based on the text: Essential MATLAB for Engineers and Scientists By Hahn & Valentine
Biostatistics, statistical software II. A brief survey of statistical program systems Krisztina Boda PhD Department of Medical Informatics, University.
11 Chapter 3: Getting Started with Tasks 3.1 Introduction to Tasks and Wizards 3.2 Creating a Frequency Report 3.3 Generating HTML, PDF, and RTF Output.
Advanced Statistics for Interventional Cardiologists.
Tutor: Prof. A. Taleb-Bendiab Contact: Telephone: +44 (0) CMPDLLM002 Research Methods Lecture 9: Quantitative.
 Overview of SPSS  Interface  Getting Started  Managing Data  Descriptive Statistics  Basic Analysis  Additional Resources.
Data Analysis Using SPSS
Objectives Understand what MATLAB is and why it is widely used in engineering and science Start the MATLAB program and solve simple problems in the command.
SPSS Presented by Chabalala Chabalala Lebohang Kompi Balone Ndaba.
Ranjeet Department of Physics & Astrophysics University of Delhi Working with Origin.
Chapter 1 Introduction to SAS ® Enterprise Guide ®
Introduction to STATA for Clinical Researchers Jay Bhattacharya August 2007.
1 Computer Programming (ECGD2102 ) Using MATLAB Instructor: Eng. Eman Al.Swaity Lecture (1): Introduction.
CERN - IT Department CH-1211 Genève 23 Switzerland t DB Development Tools Benthic SQL Developer Application Express WLCG Service Reliability.
ISU Basic SAS commands Laboratory No. 1 Computer Techniques for Biological Research Animal Science 500 Ken Stalder, Professor Department of Animal Science.
Introduction to SPSS. Object of the class About the windows in SPSS The basics of managing data files The basic analysis in SPSS.
Introduction to Statistical Computing in Clinical Research Biostatistics 212.
Developed By Information Technology Services University Of Saskatchewan.
An Overview of SAS University Edition Cheng Lei Department of Electrical and Computer Engineering University of Victoria Mar 12, 2015.
WRITING REPORTS Introduction Section 0 Lecture 1 Slide 1 Lecture 6 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall 2015.
1 An Introduction to SPSS for Windows Jie Chen Ph.D. 6/4/20161.
SPSS- Tutorial The following power-point slides show you how to use some of the features in SPSS. A survey of 20 randomly selected companies asked them.
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
Comparison of different output options from Stata
Analysis Introduction Data files, SPSS, and Survey Statistics.
1.Introduction to SPSS By: MHM. Nafas At HARDY ATI For HNDT Agriculture.
1 FREE SAS SOFTWARE. 2 FREE SOFTWARE Free SAS ® software. SAS STUDIO; An interactive, online community. Superior training and documentation. And the analytical.
Data Analysis with SPSS Lee Pierce Keith Mulbery Jason Archibald.
1 EPIB 698C Lecture 1 Instructor: Raul Cruz-Cano
Based on Learning SAS by Example: A Programmer’s Guide Chapters 1 & 2
Online Programming| Online Training| Real Time Projects | Certifications |Online Classes| Corporate Training |Jobs| CONTACT US: STANSYS SOFTWARE SOLUTIONS.
With the support of the LPP programme of the European Union 1 This project has been funded with support from the European Commission. This publication.
N U I T, Research Support Our remit is to support researchers and includes support in: Statistical Computing Numerical Analysis (Condor, HPC, cloud computing,
A quick guide to other statistical software
CST 1101 Problem Solving Using Computers
Statistics in SPSS Lecture 2
By Dr. Madhukar H. Dalvi Nagindas Khandwala college
DEPARTMENT OF COMPUTER SCIENCE
Lecture 1: Introduction
Welcome to E-Prime E-Prime refers to the Experimenter’s Prime (best) development studio for the creation of computerized behavioral research. E-Prime is.
CSCI N207 Data Analysis Using Spreadsheet
Analytics: Its More than Just Modeling
Using SPSS Note: The use of another statistical package such as Minitab is similar to using SPSS.
Presentation transcript:

RESEARCH HUB AT THE UNIVERSITY LIBRARIES PENN STATE UNIVERSITY TOUR OF STATISTICAL PACKAGES

OVERVIEW Explore six different common statistical software packages Overview Common fields Pros and cons General usage Examples Where can we use these on campus? Additional resources

PACKAGES R SAS Minitab JMP STATA SPSS Others not explored: Excel, MATLAB, Stat-Ease, SQL, Nvivo, AMOS, S-plus

WHERE CAN WE USE THESE ON CAMPUS? R is free and can be downloaded in both permanent and portable forms online All those explored here can be found at all labs on campus Find labs at Nvivo (not explored) is only found in Hammond 317 and Sparks 6 The following can be found on WebApps: Excel Minitab SAS JMP MATLAB

ADDITIONAL RESOURCES Research Hub: Training and tutorials Consulting for data, statistics, and GIS Research guides Data management toolkit Other services Quick tutorials in Minitab, SAS, R, and SPSS: Statistical Consulting Center: Survey Research Center: HHD Methodology Consulting Group: Penn State Census Research Data Center (coming soon)

EXPLORING R

R: OVERVIEW Free, open-source software; similar to S-plus Multiple add-ons and extensions available, including integration with LaTeX ( a word processor) via RStudio, and Excel via RExcel Extensive online help manuals and forums Used by many statisticians and computer scientists for data mining, data analysis, and development of statistical methodology Case-sensitive language Common fields: Statistical science Computational biology Computer science Quantitative finance Engineering

R: PROS AND CONS Pros: Widely used in both industry and academia Flexible and customizable analyses and graphics Great for: Data manipulation, editing, and coding Data mining Simulations Survival analysis Linear and nonlinear modeling Data warehousing Multivariate analysis Nonparametric methods Hypothesis testing Categorical analysis Time series analysis Sample size calculation/power analysis Optimization Cons: Scripting programming language Mediocre graphics Not as useful for: Graphical analysis Data summary Exploratory analysis Quality assessment and improvement Design of experiments

R: USAGE Data can be read in through code or created Variables and functions can be created and renamed Multiple data sets can be handled at once Editor window is used to write and save commands Console window reads commands and displays output, which is best saved by copying and pasting into a word processing document Graphs are outputted in separate window, which is overwritten for each new graph unless otherwise indicated in commands Workspaces can be saved, meaning data sets and variables do not need to be recreated (especially useful if data creation and manipulation take a long time to run)

R: EXAMPLES Read in data set from a text file Create a variable Find online help Run a t-test Create a histogram

R: EXAMPLES Read in data set from a text file

R: EXAMPLES Create a variable

R: EXAMPLES Find online help

R: EXAMPLES Run a t-test

R: EXAMPLES Create a histogram

EXPLORING SAS

SAS: OVERVIEW Major statistical software in many industries Multiple add-ons and extensions available, including integration of SQL programming language and integration with JMP Extensive online help manuals and forums Used by many statisticians and computer scientists for data mining, data analysis, and development of statistical methodology Not case-sensitive language Offers various certifications, which many employers value highly Common fields: Statistical science Sociology Manufacturing Pharmaceutical science Agriculture Computer science Quantitative finance Engineering

SAS: PROS AND CONS Pros: Widely used in both industry and academia High-performance architecture that supports computationally-intensive algorithms Flexible and customizable analyses and graphics Great for: Data manipulation, editing, and coding Data mining Graphical analysis Data summary Exploratory analysis Simulations Forecasting Survival analysis Linear and nonlinear modeling Quality assessment and improvement Data warehousing Multivariate analysis Nonparametric methods Hypothesis testing Categorical analysis Time series analysis Sample size calculation/power analysis Design of experiments Optimization Cons: Scripting programming language Expensive Some versions are not 100% compatible Not as useful for: Simple analysis and manipulation

SAS: USAGE Data can be read in through a command or imported through menu-driven prompts Variables and functions can be created and renamed Multiple data sets can be handled at once and are stored in various workspaces (“libraries”) Four types of commands: DATA step (read & edit data); Procedure steps (run built-in functions); macros (create and run own function); ODS statements (set output settings, styles, etc.) Editor window is used to write and save commands Log window reads commands and displays any errors or comments Output window displays some output created by commands Results viewer window displays most output, including graphs Can save only commands, only data, or whole project

SAS: EXAMPLES Import data from a text file Display data set Create new data set and add a variable Run a regression with diagnostic plots

SAS: EXAMPLES Import data from a text file

SAS: EXAMPLES Import data from a text file

SAS: EXAMPLES Display data set

SAS: EXAMPLES Create new data set and add a variable

SAS: EXAMPLES Run a regression with diagnostic plots

SAS: EXAMPLES Run a regression with diagnostic plots

EXPLORING MINITAB

MINITAB: OVERVIEW Menu-driven statistical software, but does have scripting language available for typing commands or creating macros Used in most Six Sigma courses and workshops Help documentation located in software as well as online Used by many analysts to quantitatively make decisions Common fields: Social science Marketing Education Sociology Manufacturing Agriculture Pharmaceutical science Engineering

MINITAB: PROS AND CONS Pros: Commonly used in industry and some academic settings Easy-to-use menu-driven software Clear output and graphics with some interactive features Has an “Assistant” feature that includes flow- charts and takes users step-by-step to analyze data properly Used in most undergraduate statistics courses; there are example data sets included in software Great for: Data manipulation, editing, and coding Graphical analysis Exploratory data analysis Data summary Forecasting Survival analysis Linear and nonlinear modeling (standard) Quality assessment and improvement Hypothesis testing Categorical analysis Time series analysis Design of experiments Optimization Cons: Limited options for analyses Can only analyze one data set at a time Does not work as well with large data sets Not as much help available as some other packages Not as useful for: Simulations Data mining Data warehousing Multivariate analysis Nonparametric methods Sample size calculation/power analysis Advanced or complex modeling

MINITAB: USAGE Data can be typed in, copied and pasted from a text or Excel file, or imported through menu-driven prompts New variables can be added to worksheet or created using formulas Worksheets contain raw data and only one worksheet can be active at a time Can create and save macros and/or commands Session window displays output Graphs and other visual charts are shown in individual windows Project manager contains outline that helps you to jump to particular output Worksheet can be saved separately, but saving whole project will save both worksheet and output

MINITAB: EXAMPLES Copy data into Minitab from a text file Create a new variable using formula Use Assistant to do a graphical analysis Create a factorial design for an experiment

MINITAB: EXAMPLES Copy data into Minitab from a text file

MINITAB: EXAMPLES Create a new variable using formula

MINITAB: EXAMPLES Use Assistant to do a graphical analysis

MINITAB: EXAMPLES Use Assistant to do a graphical analysis

MINITAB: EXAMPLES Use Assistant to do a graphical analysis

MINITAB: EXAMPLES Create a factorial design for an experiment

MINITAB: EXAMPLES Create a factorial design for an experiment

EXPLORING JMP

JMP: OVERVIEW Menu-driven statistical software, but does have scripting language available for typing commands or creating macros Can integrate with SAS, including running SAS commands, importing or exporting SAS data sets, and opening SAS projects Help documentation located in software as well as online Common fields: Statistical science Manufacturing Pharmaceutical science Engineering

JMP: PROS AND CONS Pros: Easy-to-use menu-driven software Many menu option windows are interactive and intuitive Powerful software with more options than other menu-driven software Output and graphs are very customizable and interactive, with options even after running the analysis Great for: Data manipulation, editing, and coding Graphical analysis Exploratory data analysis Data summary Forecasting Survival analysis Linear and nonlinear modeling (standard) Quality assessment and improvement Multivariate analysis Categorical analysis Nonparametric methods Time series analysis Sample size calculation/power analysis Design of experiments Optimization Cons: Not as widely used as some other packages but still very powerful Can only analyze one data set at a time Does not work as well with large data sets Not as much help available as some other packages Not as useful for: Simulations Data mining Data warehousing Hypothesis testing Advanced or complex modeling

JMP: USAGE Data can be typed in, copied and pasted from a text or Excel file, imported from SAS, or converted from other files (such as a.txt, etc.) New variables can be added to worksheet or created using formulas Data tables contain raw data and only one data table can be active at a time Can create and save macros and/or commands Log window allows you to input commands and view output Script window contains the commands used to run the same analysis done through the menu-driven prompts Each data table will create its own output window for graphs and other output Data tables and projects are saved separately Graphics and other output can be saved into a Journal, which is saved separately and can be opened in Word, etc., making it convenient to store results

JMP: EXAMPLES Convert text file into a JMP data table Summarize group means Change table values from mean values to standard deviation values Fit a binary logistic regression model

JMP: EXAMPLES Convert text file into a JMP data table

JMP: EXAMPLES Summarize group means

JMP: EXAMPLES Summarize group means

JMP: EXAMPLES Change table values from mean values to standard deviation values

JMP: EXAMPLES Fit a binary logistic regression model

EXPLORING STATA

STATA: OVERVIEW Utilizes both menu-driven selections and scripting commands Multiple versions available depending on needs (commercial, educational, etc.) Extensive help documentation and technical support Contains both basic and advanced statistical methods Not case-sensitive language Common fields: Economics Sociology Political science Pharmaceutical Epidemiology

STATA: PROS AND CONS Pros: Somewhat common in both industry and academia Somewhat flexible and customizable Contains up-to-date advanced methods Quality graphics Great for: Data manipulation, editing, and coding Graphical analysis Data summary Exploratory analysis Data mining Simulations Survival analysis Linear and nonlinear modeling Data warehousing Multivariate analysis Nonparametric methods Hypothesis testing Categorical analysis Time series analysis Sample size calculation/power analysis Cons: Scripting programming language Can only analyze one data set at a time Does not work as well with large data sets Not as useful for: Quality assessment and improvement Design of experiments Optimization

STATA: USAGE Data can be typed in, read in through code, copied and pasted from a text or Excel file, or imported and converted from other files (such as a.txt, etc.) Command window is used to write and run commands Review window displays previous analysis, which can be selected to run again Project window displays all input and output, including graphs Store and edit data in the Data Editor, which can be saved on its own Log will copy and automatically save the project for you (must start and close log before and after the analyses you want to save)

STATA: EXAMPLES Copy data from a text file into STATA Recode variable Create a frequency table using commands Run a Wilcoxon Rank-Sum test using menu options

STATA: EXAMPLES Copy data from a text file into STATA

STATA: EXAMPLES Recode variable

STATA: EXAMPLES Create a frequency table using commands

STATA: EXAMPLES Run a Wilcoxon Rank-Sum test using menu options

STATA: EXAMPLES Run a Wilcoxon Rank-Sum test using menu options

EXPLORING SPSS

SPSS: OVERVIEW Menu-driven statistical software, but does have scripting language available for typing commands or creating macros Used in conjunction with many common survey platforms, and is the leading software for analyzing survey data Help documentation located in software as well as online Plug-ins available for other programming languages, such as JAVA, Python, R, and VB Used by many analysts to quantitatively make decisions Common fields: Social science Marketing Education Sociology Healthcare Government

SPSS: PROS AND CONS Pros: Commonly used in industry, especially those that utilize survey data Easy-to-use menu-driven software Output and graphics are clear and well- organized Separate “Data” and “Variable” tabs in data worksheet make it easy to switch from raw data to variable information (labels, codes, variable type, etc.) Can use other programing languages (Python, R, JAVA, VB) with plug-ins Great for: Data manipulation, editing, and coding Graphical analysis Exploratory data analysis Data summary Data warehousing Forecasting Linear and nonlinear modeling (standard) Quality assessment and improvement Hypothesis testing Multivariate analysis Nonparametric methods Categorical analysis Time series analysis Cons: Limited options for analyses Can only analyze one data set at a time Not as much help available as some other packages Not as useful for: Simulations Data mining Survival analysis Sample size calculation/power analysis Advanced or complex modeling Design of experiments Optimization

SPSS: USAGE Data can be typed in, copied and pasted from a text or Excel file, imported through menu-driven prompts, or read in from a ASCII file using Syntax editor New variables can be added to worksheet or created using formulas Datasets contain raw data and only one dataset can be active at a time Can create and save macros and/or commands Output window displays output, including graphs Output can be copied and pasted into other documents Project manager contains outline that helps you to jump to particular output Dataset and Outputs are saved separately Optional syntax window can read and run commands and can also be saved separately

SPSS: EXAMPLES Cody data from text file into SPSS spreadsheet Edit variable names and information Create a contingency table Fit a linear model

SPSS: EXAMPLES Cody data from text file into SPSS spreadsheet

SPSS: EXAMPLES Edit variable names and information

SPSS: EXAMPLES Edit variable names and information

SPSS: EXAMPLES Create a contingency table

SPSS: EXAMPLES Create a contingency table

SPSS: EXAMPLES Fit a linear model

SPSS: EXAMPLES Fit a linear model