Linear regression issues in astronomy Eric Feigelson Summer School in astrostatistics References Isobe, Feigelson, Akritas & Babu, ApJ 364, 105 1990 Feigelson.

Slides:



Advertisements
Similar presentations
Regression Eric Feigelson Lecture and R tutorial Arcetri Observatory April 2014.
Advertisements

Lesson 10: Linear Regression and Correlation
Forecasting Using the Simple Linear Regression Model and Correlation
Extension The General Linear Model with Categorical Predictors.
Conclusion to Bivariate Linear Regression Economics 224 – Notes for November 19, 2008.
Homoscedasticity equal error variance. One of the assumption of OLS regression is that error terms have a constant variance across all value so f independent.
Regression Analysis Simple Regression. y = mx + b y = a + bx.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Data Analysis: Bivariate Correlation and Regression CHAPTER sixteen.
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Bivariate Correlation and Regression CHAPTER Thirteen.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: What it Is and How it Works. Overview What is Bivariate Linear Regression? The Regression Equation How It’s Based on r Assumptions.
1-1 Regression Models  Population Deterministic Regression Model Y i =  0 +  1 X i u Y i only depends on the value of X i and no other factor can affect.
Regression Diagnostics - I
Topics: Regression Simple Linear Regression: one dependent variable and one independent variable Multiple Regression: one dependent variable and two or.
© 2000 Prentice-Hall, Inc. Chap Forecasting Using the Simple Linear Regression Model and Correlation.
Correlation and Regression Analysis
Regression Eric Feigelson. Classical regression model ``The expectation (mean) of the dependent (response) variable Y for a given value of the independent.
Ordinary Least Squares
Introduction to Linear Regression and Correlation Analysis
Econometrics: The empirical branch of economics which utilizes math and statistics tools to test hypotheses. Special courses are taught in econometrics,
Chapter 14 – Correlation and Simple Regression Math 22 Introductory Statistics.
Learning Objective Chapter 14 Correlation and Regression Analysis CHAPTER fourteen Correlation and Regression Analysis Copyright © 2000 by John Wiley &
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
Managerial Economics Demand Estimation. Scatter Diagram Regression Analysis.
Regression Analysis Week 8 DIAGNOSTIC AND REMEDIAL MEASURES Residuals The main purpose examining residuals Diagnostic for Residuals Test involving residuals.
TODAY we will Review what we have learned so far about Regression Develop the ability to use Residual Analysis to assess if a model (LSRL) is appropriate.
M.Sc. in Economics Econometrics Module I Topic 7: Censored Regression Model Carol Newman.
Tutorial 4 MBP 1010 Kevin Brown. Correlation Review Pearson’s correlation coefficient – Varies between – 1 (perfect negative linear correlation) and 1.
REGRESSION DIAGNOSTICS Fall 2013 Dec 12/13. WHY REGRESSION DIAGNOSTICS? The validity of a regression model is based on a set of assumptions. Violation.
Chapter Thirteen Copyright © 2006 John Wiley & Sons, Inc. Bivariate Correlation and Regression.
Introduction to Biostatistics and Bioinformatics Regression and Correlation.
Multivariate Data Analysis Chapter 2 – Examining Your Data
Psychology 202a Advanced Psychological Statistics November 12, 2015.
Regression Analysis. 1. To comprehend the nature of correlation analysis. 2. To understand bivariate regression analysis. 3. To become aware of the coefficient.
DTC Quantitative Research Methods Regression I: (Correlation and) Linear Regression Thursday 27 th November 2014.
Tutorial 5 Thursday February 14 MBP 1010 Kevin Brown.
Commentary on Chris Genovese’s “Nonparametric inference and the Dark Energy equation of state” Eric Feigelson (Penn State) SCMA IV.
Correlation  We can often see the strength of the relationship between two quantitative variables in a scatterplot, but be careful. The two figures here.
Quantitative Methods. Bivariate Regression (OLS) We’ll start with OLS regression. Stands for  Ordinary Least Squares Regression. Relatively basic multivariate.
1/61: Topic 1.2 – Extensions of the Linear Regression Model Microeconometric Modeling William Greene Stern School of Business New York University New York.
732G21/732G28/732A35 Lecture 3. Properties of the model errors ε 4. ε are assumed to be normally distributed
Lecturer: Ing. Martina Hanová, PhD.. Regression analysis Regression analysis is a tool for analyzing relationships between financial variables:  Identify.
Part II Exploring Relationships Between Variables.
Regression Overview. Definition The simple linear regression model is given by the linear equation where is the y-intercept for the population data, is.
Quantitative Methods Residual Analysis Multiple Linear Regression C.W. Jackson/B. K. Gordor.
Bivariate & Multivariate Regression Analysis
Chapter 10 CORRELATION.
REGRESSION DIAGNOSTIC II: HETEROSCEDASTICITY
The Correlation Coefficient (r)
IB Math SL 21A: Correlation.
Multivariate Regression
Chapter 12: Regression Diagnostics
Chapter 3: TWO-VARIABLE REGRESSION MODEL: The problem of Estimation
CHAPTER fourteen Correlation and Regression Analysis
Charles University Charles University STAKAN III
BIVARIATE REGRESSION AND CORRELATION
Multiple Regression A curvilinear relationship between one variable and the values of two or more other independent variables. Y = intercept + (slope1.
Microeconometric Modeling
The greatest blessing in life is
CH2. Cleaning and Transforming Data
Regression is the Most Used and Most Abused Technique in Statistics
Simple Linear Regression
MULTIVARIATE REGRESSION MODELS
Regression Assumptions
Checking Assumptions Primary Assumptions Secondary Assumptions
Graph Review Skills Needed Identify the relationship in the graph
Microeconometric Modeling
Regression Assumptions
The Correlation Coefficient (r)
Presentation transcript:

Linear regression issues in astronomy Eric Feigelson Summer School in astrostatistics References Isobe, Feigelson, Akritas & Babu, ApJ 364, Feigelson & Babu, ApJ 397,

Structural regression Seeking the intrinsic relationship between two properties without specifying ‘dependent’ and ‘independent’ variables OLS(Y|X) OLS(X|Y) (inverse regr) Four symmetrical regression lines

Analytical formulae for slopes of the 6 OLS lines

Example: Faber-Jackson relation between diameter and stellar velocity dispersion of elliptical galaxies

Heteroscedastic measurement errors in both variables Homoscedastic functional Deeming (Vistas Astr 1968) Fuller “Measurement Error Models” (1987) Heteroscedastic functional York (Can J Phys 1966) ODRPACK Boggs et al. (ACM Trans Math Soft 1990) Heteroscedastic structural BCES (Akritas & Bershady ApJ 1996)

Y = observed data V = measurement errors X = intrinsic variables e = intrinsic scatter Regression model BCES slope estimator BCES slope variance Regression with measurement errors and intrinsic scatter Akritas & Bershady, ApJ 470,

Truncation due to flux limits Econometrics: Tobit & LIMDEP models (Amemiya, Advanced econometrics 1985; Maddala, Limited-dependent & Quantitative Variables in Econometrics 1983) Astronomy: Malmquist bias in Hubble diagram (Deeming, Vistas Astr 1968, Segal, PNAS 1975)

Censoring due to non-detections Correlation coefficients: Generalized Kendall’s  (Brown, Hollander & Korwar 1974) Linear regression with normal residuals: EM Algorithm (Wolynetz Appl Stat 1979) Linear regression with Kaplan-Meier residuals: Buckley & James (Biometrika 1979) Schmitt (ApJ 1985) Presented for astronomy by Isobe, Feigelson & Nelson (ApJ 1986) Implemented in Astronomy Survival Analysis (ASURV) package

Conclusions Bivariate linear regression in astronomy can be surprisingly complex. Pay attention to precise question being asked, and details of situation. Several codes available through Functional vs. structural regression Symmetrical vs. dependent regression Weighting by measurement error Truncation & censoring due to flux limits Other topics not considered here (some covered later in the Summer School): Robust & rank regression techniques to treat outliers Goodness-of-fit, model selection and parsimony Nonlinear regression Multivariate regression