WHY ARE YOU USING THAT REGRESSION? Western Mensurationist Meeting Jim Flewelling July, 2003.

Slides:



Advertisements
Similar presentations
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Advertisements

Kin 304 Regression Linear Regression Least Sum of Squares
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
11 Simple Linear Regression and Correlation CHAPTER OUTLINE
Linear regression models
The General Linear Model. The Simple Linear Model Linear Regression.
Simple Linear Regression
Lecture 8 Relationships between Scale variables: Regression Analysis
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
Chapter 10 Simple Regression.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
1 Chapter 3 Multiple Linear Regression Ray-Bing Chen Institute of Statistics National University of Kaohsiung.
SIMPLE LINEAR REGRESSION
Nonparametric, Model-Assisted Estimation for a Two-Stage Sampling Design Mark Delorey Joint work with F. Jay Breidt and Jean Opsomer September 8, 2005.
SIMPLE LINEAR REGRESSION
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Applications of Nonparametric Survey Regression Estimation in Aquatic Resources F. Jay Breidt, Siobhan Everson-Stewart, Alicia Johnson, Jean D. Opsomer.
11-1 Empirical Models Many problems in engineering and science involve exploring the relationships between two or more variables. Regression analysis.
Business Statistics - QBM117 Statistical inference for regression.
Correlation and Regression Analysis
Simple Linear Regression and Correlation
Introduction to Regression Analysis, Chapter 13,
Simple Linear Regression Analysis
Variance and covariance Sums of squares General linear models.
Correlation & Regression
Regression and Correlation Methods Judy Zhong Ph.D.
SIMPLE LINEAR REGRESSION
Introduction to Linear Regression and Correlation Analysis
Regression Analysis Regression analysis is a statistical technique that is very useful for exploring the relationships between two or more variables (one.
Inference for regression - Simple linear regression
Statistics & Biology Shelly’s Super Happy Fun Times February 7, 2012 Will Herrick.
+ Chapter 12: Inference for Regression Inference for Linear Regression.
1 Ratio estimation under SRS Assume Absence of nonsampling error SRS of size n from a pop of size N Ratio estimation is alternative to under SRS, uses.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Application of Stochastic Frontier Regression (SFR) in the Investigation of the Size-Density Relationship Bruce E. Borders and Dehai Zhao.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Chapter 14 Inference for Regression AP Statistics 14.1 – Inference about the Model 14.2 – Predictions and Conditions.
Lecture 8 Simple Linear Regression (cont.). Section Objectives: Statistical model for linear regression Data for simple linear regression Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
4 Hypothesis & Testing. CHAPTER OUTLINE 4-1 STATISTICAL INFERENCE 4-2 POINT ESTIMATION 4-3 HYPOTHESIS TESTING Statistical Hypotheses Testing.
1 11 Simple Linear Regression and Correlation 11-1 Empirical Models 11-2 Simple Linear Regression 11-3 Properties of the Least Squares Estimators 11-4.
Stat 112: Notes 2 Today’s class: Section 3.3. –Full description of simple linear regression model. –Checking the assumptions of the simple linear regression.
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 13 Multiple Regression Section 13.3 Using Multiple Regression to Make Inferences.
Chapter 10 For Explaining Psychological Statistics, 4th ed. by B. Cohen 1 A perfect correlation implies the ability to predict one score from another perfectly.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Copyright © 2010 Pearson Education, Inc Chapter Seventeen Correlation and Regression.
Modeling regional variation in the self-thinning boundary line Aaron Weiskittel Sean Garber Hailemariam Temesgen.
Annualized diameter and height growth equations for plantation grown Douglas- fir, western hemlock, and red alder Aaron Weiskittel 1, Sean Garber 1, Greg.
Simple linear regression. What is simple linear regression? A way of evaluating the relationship between two continuous variables. One variable is regarded.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Simple linear regression. What is simple linear regression? A way of evaluating the relationship between two continuous variables. One variable is regarded.
Perspectives from a REIT Growth and Yield Modeling Architecture and Treatment Response Models – The Rayonier Approach J.P. MCTAGUE Western Mensurationists.
Stats Methods at IC Lecture 3: Regression.
Estimating standard error using bootstrap
Joonghoon Shin Oregon State University
Chapter 4: Basic Estimation Techniques
Modeling in R Sanna Härkönen.
REGRESSION G&W p
BIVARIATE REGRESSION AND CORRELATION
Simple Linear Regression - Introduction
CHAPTER 29: Multiple Regression*
The Regression Model Suppose we wish to estimate the parameters of the following relationship: A common method is to choose parameters to minimise the.
Simple Linear Regression
SIMPLE LINEAR REGRESSION
Statistics II: An Overview of Statistics
SIMPLE LINEAR REGRESSION
Regression Models - Introduction
Presentation transcript:

WHY ARE YOU USING THAT REGRESSION? Western Mensurationist Meeting Jim Flewelling July, 2003

FOCUS POPULATIONS –VARIANCE IN RELATIONSHIPS OBJECTIVES –USE OF REGRESSION TECHNIQUES are SECONDARY

TWO WORLDS SURVEY SAMPLING –Fixed Populations –Objective refers to Population REGRESSION ANALYSIS –Relationships between variables –Objectives refer to individuals or populations

SURVEY SAMPLING Fixed Population. Specified probability-sampling processes. Estimation of population parameters –unbiased estimators.

SURVEY SAMPLING “If we are to infer from sample to population, the selection process is an integral part of the inference.” - Stuart (1984, p. 4)

REGRESSIONS IN SURVEY SAMPLING AUXILIARY INFORMATION (X) –known for population. –Increased precision. MODEL-ASSISTED ESTIMATORS COMMON (Särndal et al.,1992) MODEL-BASED ESTIMATORS

MODEL-ASSISTED SURVEY SAMPLING Ratio of Means Estimator: Asymptotically unbiased, whether or not y proportional to x. Could be used to estimate individual y’s. No claim of unbiasedness here.

MODEL-BASED SURVEY SAMPLING Assumptions from Regression Analysis. –True model –E(e|x) = 0 –Errors are independent. Random selection avoids a source of bias. Inference from regression theory, not the distribution of samples. Theory from Royall (1970).

REGRESSION ANALYSIS Least Squares - Legrendre (1805) and Gauss. Sir Francis Galton (1877, 1885): Offspring of seeds “did not tend to resemble their parent seeds in size, but to be always more mediocre [i.e., more average] than they - to be smaller than the parents, if the parents were large … the mean filial regression towards mediocrity was directly proportional to the parental deviation from it.” (quoted from Draper & Smith)

LEAST SQUARES REGRESSION Var < Var(y)

GEOMETRIC MEAN REGRESSION Preserves Variance Discussion by Ricker (1984)

HEIGHT-AGE CURVES Site Curves (Curtis) Site Index Prediction Functions Geometric Mean Regression Stochastic Differential Equation Height Growth Models Percentile Models

Site Curves and SI Prediction Functions Curtis et al. (1974) Site Curve - Yield table construction –H = f(A, SI). SI Prediction Function - Site Classification –SI = f(A, H).

SITE CURVES, SI PREDICTION, and GMR SI = H (index age) H A = H (age A) 3 Lines: All at mean (H A, SI) Slope =  SI /  HA  { , 1, 1/  } Straight-line assumption valid for bivariate normal.

Stochastic Differential Equation (Garcia, 1979) dH/dt = (b/c)H{(a/H) c -1} –b is plot-specific, (a, c) are global. Integrates to Chapman-Richards. Add Wiener process error to growth. Add measurement errors at intervals. Fit with Maximum Likelihood. It’s a growth-model; also base-age invariant site curve.

Height Growth Model Family of H-A curves. From any one age, predict height difference to next or previous age. Parameters adjusted to minimize errors in predicted growth. (Bonnor et al, 1995), Flewelling et al (2001). Crude, ignores measurement errors, and correlations between periods. Flexible model form. It’s a growth model - attempts to model H-A trajectories of plots. Base-age invariant.

Percentile Models Concept by Pienaar and Clutter (Clutter et al, 1983). Example by Bi (2002). Extends to irregular data. (Flewelling, 1982, unpublished). Current econometrics theory, rich history.

Percentile Models Pienaar and Clutter: Percentiles as a labeling device: “useful in illustrating the fact that index age is not a fundamental or required concept in the use of site index to express site quality.”

Percentile Models, Example Bi et al ( 2002) Temporary plots (age and site assumed orthogonal). H(t) assumed to have normal distribution. Q 0.75 and Q 0.25, fit as functions of t. –methodology from Koenker and Bassett (1978) Mean H(t) fit with weighted regression.

Percentile Model, Irregular data. Sectioned tree data, height every year. Younger ages: full data set. Older ages: reduced data set. Establish tree percentiles at young age. Reassign censored percentiles older ages. Compute (and model) means and standard deviations from heights and percentiles.

Percentile models, econometrics Koenker (2000): wonderful discussion of least squares, alternative methods, and statistical history. Minimization of summed absolute errors dates from 1760’s.

Height-Age Curves. Questions Should height growth models be the same as constant percentile curves? Are regressions from one age to another wanted? Is there any use for an index age other than as a label?

POPULATIONS WHICH PROJECTION IS WANTED?

TREE GROWTH MODELS  DBH Mortality fractions. What ensures that the variance of projected stand table is correct? –Need variance models as constraints? –Different fitting techniques? –Good luck and occasional checking?

RIGHT INDEPENDENT VARIABLES? Regional H-DBH Curves. Biased by Age or position in stand. Alternative: local curves, another variable.

Bayesian Regression Neglected in Forestry? Empirical Bayes used in volume equations (Green and Strawderman, 1985). Taper and volume equations by forest district (McTague, Stansfield and Lan, 1992). Other opportunities?

Bayesian Opportunity Fit y = a 0 + a 1 x 1 + a 2 x 2 + a 3 x 3 + ….. – Often by species or other category. – Coefficients tested and omitted if non- significant. –Or, selected coefficients fit in common for all species. –Bayesian regression or other methods better?

OTHER REGRESSION TECHNIQUES ML with better error characterization. Mixed models. Systems: Seemingly unrelated regression, 2SLS, 3SLS …….. Generally are more efficient, better estimates of parameter variance, possibly avoid some biases. Necessary? Imputation?

SUMMARY What does population look like? What should be described? What techniques allow that?

REFERENCES Bi, H., A.D. Kozek and I.S. Ferguson Quantile-based site index curves: a brief introductory note. Proc of IUFRO Symposium on Statistics and Technology in Forestry, Sept 8-12, 2002 Blacksburg. [ May be a related 2003 paper in J of Agr, Biological, and Environmental Statistics.] Bonnor, G.M., R. J. DeJong, P. Boudewyn and J. Flewelling A guide to the STIM growth model. Nat. Res. Canada. Info Rpt X-353. Clutter, J.L., J.C. Fortson, L.V. Pienaar, G.H. Brister and R.L. Bailey Timber management: a quantitative approach. Krieger Publ., Malamar, FL. 333 p. Curtis, R.O., D.J. Demars, F.R. Herman Which dependent variable is site index - height - age relationships? For. Sci. 20: Draper, N. R. and H. Smith Applied Regression Analysis. Wiley. New York. 706 p. Flewelling, J Dominant height trends for plantations of loblolly pine at the Mississippi/Alabama region of Weyerhaeuser Company. Research Rpt /3. Weyerhaeuser Forestry Research, Hot Springs. (unpublished) Flewelling, J., R. Collier, B. Gonyea, D. Marshall and E. Turnblom Height-age curves for planted stands of Douglas fir, with adjustments for density. SMC Working Paper No. 1, Univ. of WA, Seattle.

REFERENCES Garcia, O., A stochastic Differential Equation Model for height growth of forest stands. Biometrics 39: Green, E. and W.E. Strawderman The use of Bayes/Empirical Bayes Estimation in Individual Tree Volume Equation Development. For. Sci. 31: Koenker, R Galton, Edgeworth, Frisch, and prospects for quantile regression in econometrics. J of Econometrics 95: Koenker, R.W. and G.W. Basset Regression Quantiles. Econometrica 50, McTague, J.P., W.F. Stansfield, Z. Lan Southwestern ponderosa pine, Douglas fir and white fir volume and taper functions. Report to USFS. Northern Arizona University. Ricker, W.E. Computation and uses of central trend lines. Can. J. Zool. 62: Royall, R.M On finite population sampling theory under certain linear regression models. Biometrika 57: Särndal, C., B. Swensson, J. Wretman Model assisted survey sampling. Springer- Verlag, New York. 694 p. Stuart, A The ideas of sampling. Macmillan, New York. 91 p.

COMMENTS? QUESTIONS?