Statistical Modeling with SAS/STAT Cheng Lei Department of Electrical and Computer Engineering University of Victoria April 9, 2015.

Slides:



Advertisements
Similar presentations
Uncertainty Quantification & the PSUADE Software
Advertisements

Experimental Design, Response Surface Analysis, and Optimization
Prepared by Diane Tanner University of North Florida Chapter 10 1 Cost Estimation.
Building and Testing a Theory Steps Decide on what it is you want to explain or predict. 2. Identify the variables that you believe are important.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
BA 555 Practical Business Analysis
GRA 6020 Multivariate Statistics Ulf H. Olsson Professor of Statistics.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
CHAPTER 4 ECONOMETRICS x x x x x Multiple Regression = more than one explanatory variable Independent variables are X 2 and X 3. Y i = B 1 + B 2 X 2i +
Met 2212 Multivariate Statistics
©2003 Prentice Hall Business Publishing, Cost Accounting 11/e, Horngren/Datar/Foster Determining How Costs Behave Chapter 10 2/07/05.
GRA 6020 Multivariate Statistics Ulf H. Olsson Professor of Statistics.
Arizona State University DMML Kernel Methods – Gaussian Processes Presented by Shankar Bhargav.
Discriminant Analysis Testing latent variables as predictors of groups.
Determining How Costs Behave
Quantitative Business Analysis for Decision Making Multiple Linear RegressionAnalysis.
3. Multiple Regression Analysis: Estimation -Although bivariate linear regressions are sometimes useful, they are often unrealistic -SLR.4, that all factors.
1. An Overview of the Data Analysis and Probability Standard for School Mathematics? 2.
Simple Linear Regression. Types of Regression Model Regression Models Simple (1 variable) LinearNon-Linear Multiple (2
Inference for regression - Simple linear regression
Cost Behavior Analysis
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Learning Objective 1 Explain the two assumptions frequently used in cost-behavior estimation. Determining How Costs Behave – Chapter10.
Chapter 12: Linear Regression 1. Introduction Regression analysis and Analysis of variance are the two most widely used statistical procedures. Regression.
Copyright © 2003 Pearson Education Canada Inc. Slide Chapter 10 Determining How Costs Behave.
© 2001 Prentice-Hall, Inc. Statistics for Business and Economics Simple Linear Regression Chapter 10.
1 Dr. Jerrell T. Stracener EMIS 7370 STAT 5340 Probability and Statistics for Scientists and Engineers Department of Engineering Management, Information.
Prediction of Malignancy of Ovarian Tumors Using Least Squares Support Vector Machines C. Lu 1, T. Van Gestel 1, J. A. K. Suykens 1, S. Van Huffel 1, I.
Repeated Measurements Analysis. Repeated Measures Analysis of Variance Situations in which biologists would make repeated measurements on same individual.
EMBC2001 Using Artificial Neural Networks to Predict Malignancy of Ovarian Tumors C. Lu 1, J. De Brabanter 1, S. Van Huffel 1, I. Vergote 2, D. Timmerman.
Clustering Procedure Cheng Lei Department of Electrical and Computer Engineering University of Victoria April 16, 2015.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Regression Analysis Part B Calculation Procedures Read Chapters 3, 4 and 5 of Forecasting and Time Series, An Applied Approach.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
An Overview of SAS University Edition Cheng Lei Department of Electrical and Computer Engineering University of Victoria Mar 12, 2015.
Multivariate Dyadic Regression Trees for Sparse Learning Problems Xi Chen Machine Learning Department Carnegie Mellon University (joint work with Han Liu)
Chapter 13 Multiple Regression
GoldSim Technology Group LLC, 2006 Slide 1 Sensitivity and Uncertainty Analysis and Optimization in GoldSim.
Linear Discriminant Analysis and Its Variations Abu Minhajuddin CSE 8331 Department of Statistical Science Southern Methodist University April 27, 2002.
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Multivariate Data Analysis Chapter 1 - Introduction.
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
1 Inferences about a Mean Vector Shyh-Kang Jeng Department of Electrical Engineering/ Graduate Institute of Communication/ Graduate Institute of Networking.
Chapter 10: Determining How Costs Behave 1 Horngren 13e.
Mixed Cost Analysis 3 Fixed And Variable Costs Cost Behavior – Mixed Costs y x Cost Activity level y x Cost Activity level a y x Cost Activity level.
Week 41 How to find estimators? There are two main methods for finding estimators: 1) Method of moments. 2) The method of Maximum likelihood. Sometimes.
1 Statistics & R, TiP, 2011/12 Neural Networks  Technique for discrimination & regression problems  More mathematical theoretical foundation  Works.
© 2009 Pearson Prentice Hall. All rights reserved. CHAPTER 10 Determining How Costs Behave.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Stats Term Test 4 Solutions. c) d) An alternative solution is to use the probability mass function and.
Blackbox classifiers for preoperative discrimination between malignant and benign ovarian tumors C. Lu 1, T. Van Gestel 1, J. A. K. Suykens 1, S. Van Huffel.
Classification Cheng Lei Department of Electrical and Computer Engineering University of Victoria April 24, 2015.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. Part Four ANALYSIS AND PRESENTATION OF DATA.
Prepared by Diane Tanner University of North Florida ACG Cost Estimation 4-1.
Stochasticity and Probability. A new approach to insight Pose question and think of the answer needed to answer it. Ask: How do the data arise? What is.
Determining How Costs Behave
The simple linear regression model and parameter estimation
Multiple Random Variables and Joint Distributions
Linear Regression.
CJT 765: Structural Equation Modeling
Determining How Costs Behave
Ch3: Model Building through Regression
Probability & Statistics Probability Theory Mathematical Probability Models Event Relationships Distributions of Random Variables Continuous Random.
Regression Models - Introduction
The Regression Model Suppose we wish to estimate the parameters of the following relationship: A common method is to choose parameters to minimise the.
BASIC REGRESSION CONCEPTS
Multivariate Methods Berlin Chen
Name:_________________
A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models Jeff A. Bilmes International.
A Gentle Tutorial of the EM Algorithm and its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models Jeff A. Bilmes International.
Presentation transcript:

Statistical Modeling with SAS/STAT Cheng Lei Department of Electrical and Computer Engineering University of Victoria April 9, 2015

Outline ❖ Overview of SAS/STAT ❖ Statistical Models ❖ Classes of Statistical Models ❖ Next step

SAS/STAT Overview Over 70 procedures Most of the procedures dedicated to solving problems in statistical modeling The Output Delivery System (ODS) included to draw all kinds of graphs and produce different format result reports (PDF, HTML, CSV, and etc.)

Statistical Models Deterministic and Stochastic Models Model-Based and Design-Based randomness Model Specification

Deterministic & Stochastic Models Deterministic Models The relationship between inputs and outputs are theoretically in deterministic fashion Very important theoretical tools But impractical for experimental data

Deterministic & Stochastic Models Stochastic Models The outputs are uncertainly and affected by some parameters Parameters are unknown constants and needed to be estimated based on the assumptions about the model and the data

Model-Based & Design- Based Randomness Model-Based Randomness Innate randomness Source of random variation comes from the unknown variation in the response Design-Based Randomness Induced randomness Random variation in the data is induced by random selection

Model Specification Model selection diagnosis discrimination

Classes of Statistical Models Linear & nonlinear Models Univariate & Multivariate Models Regression Models & Models with Classification Effects Fixed, Random, and Mixed Models Generalized Linear Models Latent Variable Models Bayesian Models

Univariate & Multivariate Models Univariate Models Each response variable is modeled separately i.e.: Mutilvariate Models Multiple response variables are modeled jointly i.e.:

Latent Variable Models Involve variables not directly observed in the research Make a hypothesis the factors as the potential ones Apply regression models Apply the evaluation methods to validate the model

Next week’s work Cluster Procedures Classifier Procedures Output Delivery System Ways to plot graphs Ways to form result reports

Thank You!!!