Stat 112: Notes 1 Main topics of course: –Simple Regression –Multiple Regression –Analysis of Variance –Chapters 3-9 of textbook Readings for Notes 1:

Slides:



Advertisements
Similar presentations
Stat 112: Lecture 7 Notes Homework 2: Due next Thursday The Multiple Linear Regression model (Chapter 4.1) Inferences from multiple regression analysis.
Advertisements

Probabilistic & Statistical Techniques Eng. Tamer Eshtawi First Semester Eng. Tamer Eshtawi First Semester
Class 16: Thursday, Nov. 4 Note: I will you some info on the final project this weekend and will discuss in class on Tuesday.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Statistics for the Social Sciences Psychology 340 Spring 2005 Prediction cont.
Stat 112: Lecture 15 Notes Finish Chapter 6: –Review on Checking Assumptions (Section ) –Outliers and Influential Points (Section 6.7) Homework.
Lecture 23: Tues., Dec. 2 Today: Thursday:
Class 15: Tuesday, Nov. 2 Multiple Regression (Chapter 11, Moore and McCabe).
Lecture 16 – Thurs., March 4 Chi squared test for M&M experiment Simple linear regression (Chapter 7.2) Next class after spring break: Inference for simple.
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Statistics 350 Lecture 16. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Lecture 23: Tues., April 6 Interpretation of regression coefficients (handout) Inference for multiple regression.
Lecture 25 Regression diagnostics for the multiple linear regression model Dealing with influential observations for multiple linear regression Interaction.
Class 6: Tuesday, Sep. 28 Section 2.4. Checking the assumptions of the simple linear regression model: –Residual plots –Normal quantile plots Outliers.
Class 3: Thursday, Sept. 16 Reliability and Validity of Measurements Introduction to Regression Analysis Simple Linear Regression (2.3)
Examining Relationship of Variables  Response (dependent) variable - measures the outcome of a study.  Explanatory (Independent) variable - explains.
Gordon Stringer, UCCS1 Regression Analysis Gordon Stringer.
Stat 112: Lecture 8 Notes Homework 2: Due on Thursday Assessing Quality of Prediction (Chapter 3.5.3) Comparing Two Regression Models (Chapter 4.4) Prediction.
Chapter Topics Types of Regression Models
Lecture 24: Thurs., April 8th
RESEARCH STATISTICS Jobayer Hossain Larry Holmes, Jr November 6, 2008 Examining Relationship of Variables.
Stat Today: Multiple comparisons, diagnostic checking, an example After these notes, we will have looked at (skip figures 1.2 and 1.3, last.
Stat 112: Lecture 13 Notes Finish Chapter 5: –Review Predictions in Log-Log Transformation. –Polynomials and Transformations in Multiple Regression Start.
Stat Notes 5 p-values for one-sided tests Caution about forecasting outside the range of the explanatory variable (Chapter 3.7.2) Fitting a linear.
Stat Notes 4 Chapter 3.5 Chapter 3.7.
Stat 112 Notes 11 Today: –Fitting Curvilinear Relationships (Chapter 5) Homework 3 due Friday. I will Homework 4 tonight, but it will not be due.
Statistics 350 Lecture 17. Today Last Day: Introduction to Multiple Linear Regression Model Today: More Chapter 6.
Stat 112: Lecture 16 Notes Finish Chapter 6: –Influential Points for Multiple Regression (Section 6.7) –Assessing the Independence Assumptions and Remedies.
Least Squares Regression
1 1 Slide Simple Linear Regression Chapter 14 BA 303 – Spring 2011.
Simple Linear Regression
Statistics for the Social Sciences Psychology 340 Fall 2013 Tuesday, November 19 Chi-Squared Test of Independence.
Introduction to Linear Regression and Correlation Analysis
Relationship of two variables
New Seats – Block 1. New Seats – Block 2 Warm-up with Scatterplot Notes 1) 2) 3) 4) 5)
Statistics for the Social Sciences Psychology 340 Fall 2013 Correlation and Regression.
Stat 112 Notes 15 Today: –Outliers and influential points. Homework 4 due on Thursday.
Research Project Statistical Analysis. What type of statistical analysis will I use to analyze my data? SEM (does not tell you level of significance)
Multivariate Analysis. One-way ANOVA Tests the difference in the means of 2 or more nominal groups Tests the difference in the means of 2 or more nominal.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Multiple Regression Petter Mostad Review: Simple linear regression We define a model where are independent (normally distributed) with equal.
Stat 112 Notes 16 Today: –Outliers and influential points in multiple regression (Chapter 6.7)
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
Regression Lesson 11. The General Linear Model n Relationship b/n predictor & outcome variables form straight line l Correlation, regression, t-tests,
Stat 112: Notes 1 Main topics of course: –Simple Regression –Multiple Regression –Analysis of Variance –Chapters 3-9 of textbook Readings for Notes 1:
Stat 112 Notes 10 Today: –Fitting Curvilinear Relationships (Chapter 5) Homework 3 due Thursday.
Stat 112 Notes 5 Today: –Chapter 3.7 (Cautions in interpreting regression results) –Normal Quantile Plots –Chapter 3.6 (Fitting a linear time trend to.
Chapter 10: Determining How Costs Behave 1 Horngren 13e.
Stat 112 Notes 6 Today: –Chapter 4.1 (Introduction to Multiple Regression)
© 2001 Prentice-Hall, Inc.Chap 13-1 BA 201 Lecture 18 Introduction to Simple Linear Regression (Data)Data.
Stat 112 Notes 6 Today: –Chapters 4.2 (Inferences from a Multiple Regression Analysis)
Stat 112 Notes 14 Assessing the assumptions of the multiple regression model and remedies when assumptions are not met (Chapter 6).
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Stat 112 Notes 11 Today: –Transformations for fitting Curvilinear Relationships (Chapter 5)
1 Objective Given two linearly correlated variables (x and y), find the linear function (equation) that best describes the trend. Section 10.3 Regression.
Stat 112 Notes 8 Today: –Chapters 4.3 (Assessing the Fit of a Regression Model) –Chapter 4.4 (Comparing Two Regression Models) –Chapter 4.5 (Prediction.
Predicting Energy Consumption in Buildings using Multiple Linear Regression Introduction Linear regression is used to model energy consumption in buildings.
The simple linear regression model and parameter estimation
CHAPTER 3 Describing Relationships
Chapter 5 LSRL.
Chapter 3.2 LSRL.
(Residuals and
Stat 112 Notes 4 Today: Review of p-values for one-sided tests
Simple Linear Regression
Least Squares Regression Line LSRL Chapter 7-continued
Simple Linear Regression
Chapter 5 LSRL.
Correlation and Regression
11C Line of Best Fit By Eye, 11D Linear Regression
Presentation transcript:

Stat 112: Notes 1 Main topics of course: –Simple Regression –Multiple Regression –Analysis of Variance –Chapters 3-9 of textbook Readings for Notes 1: Chapter Also, Chapter 2 contains review of material from Stat 111.

Monitoring Tiger Prey Abundance The Siberian (Amur) tiger is a species of tigers found in the Russian Far East. Tigers in general are in trouble. At the beginning of the 20 th century, there were around 100,000 tigers. Today, there are less than 6000 tigers in the world and there are only about 400 Siberian tigers. The Sika deer is a staple of the Siberian tiger diet. It is also hunted by the local people. To balance the needs of the local people and at the same time ensure there are adequate prey for tigers, local government managers need accurate estimates of the number of Sika deer in an area.

Estimating Deer Abundance Counting Method: The number of deer in a plot can be determined accurately but with considerable time and work. It requires 3- 5 expert field workers to monitor the plot and to classify whether deer tracks are moving into or out of the plot. Can “total tracks counted” be used to estimate the number of deer in the plot? This is much easier to collect.

Deer Density vs. Tracks Counted Study was done in which density was determined by expert field workers over a range of plots. How would we estimate the deer density if we counted 1 track per squared kilometer?

Simple Regression Model How would we estimate the deer density if we counted 1 track per squared kilometer? Idea: Estimate the mean deer density when we count 1 track per squared kilometer. Simple Regression Setup: –Y=outcome (density per km squared) –X=explanatory variable (tracks counted per km squared –Note: outcome is sometimes called dependent variable and explanatory variable is sometimes called independent or predictor variable Simple Regression Model: Model for the mean (expected value) of Y given X, denoted

Simple Linear Regression Model

Using the Simple Linear Regression Model for Estimating Deer Density A;dklsfkaj;s

Estimating the Slope and Intercept

Simple Linear Regression Using JMP Use Analyze, Fit Y by X. Put response variable in Y and explanatory variable in X (make sure X is continuous by clicking on the X column, clicking Cols and Column Info and checking that the Modeling Type is Continuous). Click on fit line under red triangle next to Bivariate Fit of Y by X.

Residuals

Root Mean Square Error Technical Note: RMSE^2 is average squared residual. RMSE is close to but not exactly average absolute residual

Poverty and MDs Do states with more poverty tend to have fewer doctors? Which states have an unusually high number of doctors given their poverty rate or an unusally low number of doctors given their poverty rate.

Residuals in JMP Saving the residuals in JMP: –To save the residuals, after fitting the line using Fit Y by X, click the red triangle next to linear fit and click save residuals. A column with the residuals is created on the data spreadsheet. –The residuals can be sorted by clicking Sorting the residuals: –Click the table menu, then click sort, click the name of the column with the residuals, click by and then click sort. Labeling observations: –To label an observation in the graph, click the row with the observation and then click the rows menu and label. By default, JMP will use the observation number to label the observation. To make JMP use state to label the observation, click the state column, click the Cols menu and click label

Residuals for Poverty-MD Data

Summary for Notes 1 Regression Model: Model for the mean of an outcome Y given a value of the explanatory variable X, E(Y|X). Simple Linear Regression Model: Regression Models are useful for: –Predicting Y from X –Understanding the association between Y and X. –Identifying observations that are unusual in their relationship between Y and X (large magnitude of residuals).