Feiran Jiao and Barbara Monaco DETERMINING EFFECTS OF HABITAT AND TIME ON THE DIETS OF ANCIENT NATIVE AMERICANS.

Slides:



Advertisements
Similar presentations
Simple linear models Straight line is simplest case, but key is that parameters appear linearly in the model Needs estimates of the model parameters (slope.
Advertisements

Computational Statistics. Basic ideas  Predict values that are hard to measure irl, by using co-variables (other properties from the same measurement.
Inference for Regression
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Regression Analysis Once a linear relationship is defined, the independent variable can be used to forecast the dependent variable. Y ^ = bo + bX bo is.
Factorial and Mixed Factor ANOVA and ANCOVA
Generalized Linear Models (GLM)
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Chapter 13 Multiple Regression
Multiple Regression Predicting a response with multiple explanatory variables.
LINEAR REGRESSION: Evaluating Regression Models Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Assumptions for Linear Regression Evaluating a Regression Model.
LINEAR REGRESSION: Evaluating Regression Models. Overview Standard Error of the Estimate Goodness of Fit Coefficient of Determination Regression Coefficients.
Lecture 18: Thurs., Nov. 6th Chapters 8.3.2, 8.4, Outliers and Influential Observations Transformations Interpretation of log transformations (8.4)
Basics of ANOVA Why ANOVA Assumptions used in ANOVA
Part I – MULTIVARIATE ANALYSIS
Chapter 12 Multiple Regression
Class 5: Thurs., Sep. 23 Example of using regression to make predictions and understand the likely errors in the predictions: salaries of teachers and.
Lesson #32 Simple Linear Regression. Regression is used to model and/or predict a variable; called the dependent variable, Y; based on one or more independent.
Lecture 24: Thurs. Dec. 4 Extra sum of squares F-tests (10.3) R-squared statistic (10.4.1) Residual plots (11.2) Influential observations (11.3,
Copyright (c) Bani K. Mallick1 STAT 651 Lecture #13.
Copyright (c) Bani K. Mallick1 STAT 651 Lecture # 12.
This Week Continue with linear regression Begin multiple regression –Le 8.2 –C & S 9:A-E Handout: Class examples and assignment 3.
Business Statistics - QBM117 Statistical inference for regression.
Two-Way Analysis of Variance STAT E-150 Statistical Methods.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-1 Chapter 13 Introduction to Multiple Regression Statistics for Managers.
Multiple Linear Regression Response Variable: Y Explanatory Variables: X 1,...,X k Model (Extension of Simple Regression): E(Y) =  +  1 X 1 +  +  k.
Chapter 12: Analysis of Variance
Lecture 15 Basics of Regression Analysis
HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2010 by Hawkes Learning Systems/Quant Systems, Inc. All rights reserved. Chapter 14 Analysis.
© 2002 Prentice-Hall, Inc.Chap 14-1 Introduction to Multiple Regression Model.
Analysis of Covariance Harry R. Erwin, PhD School of Computing and Technology University of Sunderland.
 Combines linear regression and ANOVA  Can be used to compare g treatments, after controlling for quantitative factor believed to be related to response.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
Hydrologic Modeling: Verification, Validation, Calibration, and Sensitivity Analysis Fritz R. Fiedler, P.E., Ph.D.
OPIM 303-Lecture #8 Jose M. Cruz Assistant Professor.
23-1 Analysis of Covariance (Chapter 16) A procedure for comparing treatment means that incorporates information on a quantitative explanatory variable,
Chapter 14 Introduction to Multiple Regression
Regression Analysis. Scatter plots Regression analysis requires interval and ratio-level data. To see if your data fits the models of regression, it is.
Regression For the purposes of this class: –Does Y depend on X? –Does a change in X cause a change in Y? –Can Y be predicted from X? Y= mX + b Predicted.
Soc 3306a Lecture 9: Multivariate 2 More on Multiple Regression: Building a Model and Interpreting Coefficients.
Xuhua Xia Polynomial Regression A biologist is interested in the relationship between feeding time and body weight in the males of a mammalian species.
MBP1010H – Lecture 4: March 26, Multiple regression 2.Survival analysis Reading: Introduction to the Practice of Statistics: Chapters 2, 10 and 11.
Chapter 10: Analyzing Experimental Data Inferential statistics are used to determine whether the independent variable had an effect on the dependent variance.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
MGS3100_04.ppt/Sep 29, 2015/Page 1 Georgia State University - Confidential MGS 3100 Business Analysis Regression Sep 29 and 30, 2015.
Review of Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
BPS - 3rd Ed. Chapter 161 Inference about a Population Mean.
Slide 1 DSCI 5340: Predictive Modeling and Business Forecasting Spring 2013 – Dr. Nick Evangelopoulos Lecture 2: Review of Multiple Regression (Ch. 4-5)
Regression Analysis © 2007 Prentice Hall17-1. © 2007 Prentice Hall17-2 Chapter Outline 1) Correlations 2) Bivariate Regression 3) Statistics Associated.
Multiple Regression. Simple Regression in detail Y i = β o + β 1 x i + ε i Where Y => Dependent variable X => Independent variable β o => Model parameter.
Copyright © 2011 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Model Building and Model Diagnostics Chapter 15.
Environmental Modeling Basic Testing Methods - Statistics III.
Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 14-1 Chapter 14 Introduction to Multiple Regression Statistics for Managers using Microsoft.
ANOVA, Regression and Multiple Regression March
Heart Disease Example Male residents age Two models examined A) independence 1)logit(╥) = α B) linear logit 1)logit(╥) = α + βx¡
Statistics for Managers Using Microsoft Excel, 5e © 2008 Prentice-Hall, Inc.Chap 14-1 Statistics for Managers Using Microsoft® Excel 5th Edition Chapter.
Chapter 9 Minitab Recipe Cards. Contingency tests Enter the data from Example 9.1 in C1, C2 and C3.
Maths Study Centre CB Open 11am – 5pm Semester Weekdays
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
BPS - 5th Ed. Chapter 231 Inference for Regression.
The “Big Picture” (from Heath 1995). Simple Linear Regression.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 10: Comparing Models.
Stats Methods at IC Lecture 3: Regression.
12 Inferential Analysis.
Joanna Romaniuk Quanticate, Warsaw, Poland
12 Inferential Analysis.
2k Factorial Design k=2 Ex:.
Species diversity: rarefaction, evenness and indices
Presentation transcript:

Feiran Jiao and Barbara Monaco DETERMINING EFFECTS OF HABITAT AND TIME ON THE DIETS OF ANCIENT NATIVE AMERICANS

Specific Goals: Identify significant spatial (geography, habitat) and temporal variability in prey species use, as well as any association between time and space patterning Evaluate how well variability in data is explained by cultural (pop. changes, landuse practices, overhunting, new technology, etc.) and/or natural (climate change, vegetation differences) factors We focused on the first of Matt’s goals and looked at the variables Habitat (mountains, plains, alluvial valley) and Time Group (Paleoindian, Archaic, Woodlands, Late Prehistoric) to determine whether or not they significantly affect various responses regarding dietary habits. CLIENT’S ORIGINAL GOALS

 Simpson’s Index (D):[0,1] Measure of Evenness where  Margalef’s Diversity Index (0, ∞) Simple species richness index that attempts to compensate for sampling effects  Large Body Size [0,1] Proportion of bones that were classified as belonging to a large animal (mammoth, buffalo) Our data initially contained measures on three types of Function Groups (Kill, Camp, and Other). To match with Matt’s discipline, we eliminated the Kill and Other sites, analyzing only the Camp sites. RESPONSES

 Our initial analysis was to perform a Two-Way ANOVA for the three responses.  However, as you can see from the residual plots, the assumptions for the ANOVA are violated. INITIAL ANALYSIS

FINAL RESULTS  Since the assumptions for the Two-Way ANOVA using the original data were clearly violated we attempted to transform the data:  Simpson’s Index:  Margalef’s Diversity Index:  We could find no transformations that improved the residual plots enough that we could trust the p-values and hypothesis performed by the Two-Way ANOVA for Large Body Size

Anova Table (Type III tests) Response: neg.ln.D Sum Sq Df F value Pr(>F) (Intercept) as.factor(Time.Group) as.factor(Habitat) as.factor(Time.Group):as.factor(Habitat) Residuals We have a significant interaction between Time Group and Habitat, thus our next step is to look at contrasts and interaction plots between these twelve group means. TRANSFORMED SIMPSON’S INDEX

Parameter Estimate Error t Value Pr > |t| valleyPaleo - mountainPaleo valleyPaleo - plainsPaleo mountainPaleo - plainsPaleo valleyArchaic - mountainArchaic valleyArchaic - plainsArchaic mountainArchaic - plainsArchaic valleyWood - mountainWood valleyWood - plainsWood mountainWood - plainsWood valleyLate - mountainLate valleyLate - plainsLate mountainLate - plainsLate Since we are making 12 comparisons, we need to adjust the cut- off alpha level using a Bonferroni correction. So we will take alpha and divide it by 12, so the new cut-off will be alpha= Thus the only comparisons that are significantly different are Valley – Plains (Paleoindian), and Mountain-Plains(Paleoindian). SAS CONTRASTS

Anova Table (Type III tests) Response: sqrt.marg Sum Sq Df F value Pr(>F) (Intercept) <0.001 as.factor(Time.Group) as.factor(Habitat) as.factor(Time.Group):as.factor(Habitat) Residuals We have a significant interaction between Time Group and Habitat, thus our next step is to look at contrasts and interaction plots between these twelve group means. TRANSFORMED MARGALEF’S INDEX

Standard Parameter Estimate Error t Value Pr > |t| valleyPaleo - mountainPaleo valleyPaleo - plainsPaleo mountainPaleo - plainsPaleo <.0001 valleyArchaic - mountainArchaic valleyArchaic - plainsArchaic mountainArchaic - plainsArchaic valleyWood - mountainWood valleyWood - plainsWood mountainWood - plainsWood valleyLate - mountainLate valleyLate - plainsLate mountainLate - plainsLate Since we are making 12 comparisons, we need to adjust the cut- off alpha level using a Bonferroni correction. So we will take alpha and divide it by 12, so the new cut-off will be alpha= Thus the only comparisons that are significantly different are Valley – Plains (Paleoindian), and Mountain-Plains(Paleoindian), which are the same significant difference that we found for the transformed Simpson’s Index. SAS CONTRASTS

Since the Large Body Size variable had such a large proportions of 0’s and 1’s, it appeared to follow a Beta Density. We plotted the density histograms for each of the twelve group means (Habitat*Time Group) and fitted a Beta density to the data. We excluded the Woodlands since we had so few observations for each Habitat and felt that would not give an accurate fit to the density. We also dropped Plains/Rolling Hills for a similar reason. LARGE BODY SIZE

BETA REGRESSION

Since Beta Regression is not a commonly used (if ever) in Matt’s discipline, if he decides to move on with the Large Body Size analysis we have proposed then this would require additional work and collaboration between him and Feiran. NEXT STEPS

Hill, M. E. (2008). Variation in paleoindian fauna use on the great plains and rocky mountains of north america. Quaternary International, 191, Magurran, A. E. (2004). Measuring Biological Diversity. Blackwell Science Ltd. REFERENCES

QUESTIONS?