Maximum Entropy spatial modeling with imperfect data.

Slides:

Advertisements

Similar presentations

Mustafa Cayci INFS 795 An Evaluation on Feature Selection for Text Clustering.

Advertisements

Evaluating Classifiers

Evaluation of segmentation. Example Reference standard & segmentation.

© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Other Classification Techniques 1.Nearest Neighbor Classifiers 2.Support Vector Machines.

POINT ESTIMATION AND INTERVAL ESTIMATION

Curva ROC figuras esquemáticas Curva ROC figuras esquemáticas Prof. Ivan Balducci FOSJC / Unesp.

Receiver Operating Characteristic (ROC) Curves

Lecture 22: Evaluation April 24, 2010.

PROJECTING THE ENVIRONMENTAL NICHE FOR SUMMERTIME COCCOLITHOPHORE BLOOMS IN THE NORTH ATLANTIC ABSTRACT Coccolithophore blooms are one of the few phytoplankton.

CJT 765: Structural Equation Modeling Class 3: Data Screening: Fixing Distributional Problems, Missing Data, Measurement.

Maxent interface.

Integrating Bayesian Networks and Simpson’s Paradox in Data Mining Alex Freitas University of Kent Ken McGarry University of Sunderland.

The University of Texas at Austin, CS 395T, Spring 2008, Prof. William H. Press 1 Computational Statistics with Application to Bioinformatics Prof. William.

Assessing and Comparing Classification Algorithms Introduction Resampling and Cross Validation Measuring Error Interval Estimation and Hypothesis Testing.

Classification and risk prediction

Model Evaluation Metrics for Performance Evaluation

Darlene Goldstein 29 January 2003 Receiver Operating Characteristic Methodology.

Maximum Entropy Model LING 572 Fei Xia 02/07-02/09/06.

ROC & AUC, LIFT ד"ר אבי רוזנפלד.

Maxent Implements “Maximum Entropy” modeling –Entropy = randomness –Maximizes randomness by removing patterns –The pattern is the response Website with.

CS Bayesian Learning1 Bayesian Learning. CS Bayesian Learning2 States, causes, hypotheses. Observations, effect, data. We need to reconcile.

1 Chapter 20 Two Categorical Variables: The Chi-Square Test.

BASIC STATISTICS: AN OXYMORON? (With a little EPI thrown in…) URVASHI VAID MD, MS AUG 2012.

Chapter 1 Basics of Probability.

Basics of fMRI Inference Douglas N. Greve. Overview Inference False Positives and False Negatives Problem of Multiple Comparisons Bonferroni Correction.

Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 9-2 Inferences About Two Proportions.

Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.

Topics: Statistics & Experimental Design The Human Visual System Color Science Light Sources: Radiometry/Photometry Geometric Optics Tone-transfer Function.

9th International Symposium on Wild Boar and others Suids, Hannover 2012 Factors influencing wild boar presence in agricultural landscape: a habitat suitability.

Data Analysis 1 Mark Stamp. Topics  Experimental design o Training set, test set, n-fold cross validation, thresholding, imbalance, etc.  Accuracy o.

Ryan DiGaudio Modified from Catherine Jarnevich, Sunil Kumar, Paul Evangelista, Jeff Morisette, Tom Stohlgren Maxent Overview.

Montane Frogs in Rainforest 2013, Marcio et al., Understanding the mechanisms underlying the distribution of microendemic montane frogs (Brachycephalus.

Candidate KBA Identification: Modeling Techniques for Field Survey Prioritization Species Distribution Modeling: approximation of species ecological niche.

1 CS 391L: Machine Learning: Experimental Evaluation Raymond J. Mooney University of Texas at Austin.

CpSc 810: Machine Learning Evaluation of Classifier.

Why Model? Make predictions or forecasts where we don’t have data.

Museum and Institute of Zoology PAS Warsaw Magdalena Żytomska Berlin, 6th September 2007.

Estimating Component Availability by Dempster-Shafer Belief Networks Estimating Component Availability by Dempster-Shafer Belief Networks Lan Guo Lane.

BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.

Niches, Interactions and Movements. Calculating a Species Distribution Range Jorge Soberon M. A. Townsend Peterson.

Enrique Martínez-Meyer

Evaluating Results of Learning Blaž Zupan

Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.

Computational Intelligence: Methods and Applications Lecture 16 Model evaluation and ROC Włodzisław Duch Dept. of Informatics, UMK Google: W Duch.

Model Evaluation l Metrics for Performance Evaluation –How to evaluate the performance of a model? l Methods for Performance Evaluation –How to obtain.

Remote-sensing and biodiversity in a changing climate Catherine Graham SUNY-Stony Brook Robert Hijmans, UC-Berkeley Lianrong Zhai, SUNY-Stony Brook Sassan.

Ryan DiGaudio Modified from Catherine Jarnevich, Sunil Kumar, Paul Evangelista, Jeff Morisette, Tom Stohlgren Maxent Overview.

July 3 rd, 2014 Charlotte Germain-Aubrey ECOLOGICAL NICHE MODELING: PRACTICAL.

Professor William H. Press, Department of Computer Science, the University of Texas at Austin1 Opinionated in Statistics by Bill Press Lessons #50 Binary.

Lecture 3: MLE, Bayes Learning, and Maximum Entropy

Testing Predictive Performance of Ecological Niche Models A. Townsend Peterson, STOLEN FROM Richard Pearson.

BPS - 5th Ed. Chapter 221 Two Categorical Variables: The Chi-Square Test.

 1 Species Richness 5.19 UF Community-level Studies Many community-level studies collect occupancy-type data (species lists). Imperfect detection.

Downloading the MAXENT Software

Use of Maxent for predictive habitat mapping of CWC in the Bari canyon Bargain Annaëlle Foglini Federica, Bonaldo Davide, Pairaud Ivane & Fabri Marie-Claire.

Biostatistics Class 2 Probability 2/1/2000.

7. Performance Measurement

Lecture 1.31 Criteria for optimal reception of radio signals.

Introduction to species distribution Models

Probability and Statistics

Chapter 7. Classification and Prediction

Evaluating Results of Learning

Active Learning Lecture Slides

Data Mining Classification: Alternative Techniques

Combining Species Occupancy Models and Boosted Regression Trees

Chapter 9 Hypothesis Testing.

Review of Hypothesis Testing

Pattern Recognition and Image Analysis

More on Maxent Env. Variable importance:

Presentation transcript:

Maximum Entropy spatial modeling with imperfect data

Spatial modeling with imperfect data Example: species distribution models What’s wrong with available data and why we want to use it anyway Comparison of different approaches Maximum Entropy approach: Maxent Model evaluation: AUC

Building models with imperfect data Example: constructing (or reconstructing) species distributions Paleoecology, conservation, speciation, invasion Often data is presence-only (Elith et al. 2006): – Museum records – Herbaria – Fossil locals – Reported sightings Sparse data Spatial bias Temporal bias Uncertainty in absence records

Building models with imperfect data Elith et al. 2006: Evaluated methods for modeling species distributions using presence-only data Compared 16 methods for 226 species in 6 geographic regions Models were built using presence-only data and climate and environmental layers Evaluated against independent presence- absence datasets

Building models with imperfect data Elith et al. 2006: Models that use only occurrences: – Envelope BIOCLIM – Distance based DOMAIN, LIVES Models that characterize background (psuedo-absence): – GLM, GAM, MARS, GARP, MAXENT, BRT, GDM, MARS- COMM Some models implemented in several ways

Building models with imperfect data Elith et al. 2006:

Building models with imperfect data Elith et al. 2006:

Building models with imperfect data Elith et al. 2006:

Maxent: A maximum entropy approach Occurrence is a Lat-Long pair denoting location of observation\collection Layers that inform the model are from same geographic area in raster format Model represents approximation of the realized niche for species Assumed that the realized niche and the fundamental niche for the species coincide Increasing sampling in larger geographic area (and thus including more variation in environmental conditions encountered by the species) may increase the fraction of fundamental niche represented by occurrences

Fundamental Niche

Realized Niche

Fundamental Niche

Maxent methods The approximation of an unknown probability distribution should satisfy any known constraints, and subject to those constraints should have maximum entropy (Jaynes, 1957) Maximum Entropy is an epistemic approach to Bayes’ rule The Monkey Example: – A team of monkeys are employed to create images by throwing balls at a grid of bins – Every so often the grid is removed and replaced by a new one – Eventually the monkeys will create multiple copies of each possible arrangement

Maxent methods The Monkey Example (cont.): – Given some evidence about true grid some of the monkey’s grids can be ruled out – Those left constitute the feasible set, and that which appears most often is a reasonable choice – Assuming the monkeys are not biased, this choice is consistent with the data but noncommittal about information we do not have

Maxent methods π is the unknown probability distribution over a finite set X (the set of pixels or points in the study area) The distribution defines a non-negative probability π(x) to each point x These probabilities sum to 1 Best approximation of π is the probability distribution π(hat) The entropy of π(hat) is:

Maxent methods Constraints on π(hat) for layers informing model: – Linear features- continuous variables should be close to their observed values (their mean at occurrence localities) – Quadratic features- variance of continuous variables should be close to observed values – Product features- covariance of two continuous variables should be close to observed values – Threshold feature- proportion of model that has values above a threshold for a continuous variable should be close to observed proportion – Binary feature- the proportion of each category in a categorical feature should be close to the observed proportions

Maxent methods Regularization parameter B j governs how close the constraints need to match the observed value (without regularization they must be equal) Program allows a user-specified proportion of occurrence locals to be reserved from model training for model testing absences can be randomly selected (pseudo-absences for presence only) or specified by user (if P-A data available) Model will run for either a set number of iterations or until the gain from each iteration falls below a set threshold

Maxent example: brown-throated three-toed sloth, Bradypus variegatus

Log contribution of each variable to the raw prediction value

Maxent example : brown-throated three-toed sloth, Bradypus variegatus

Other Maxent Applications (Siva 1990)

Model Evaluation Area under ROC curve (AUC) Receiver Operating Characteristic Contingency Table: Actual Value (Data) Predicted Outcome (Model) Presence (pos) Absence (neg) Presence (pos) True Positive (TP) False Positive (FP) Absence (neg) False Negative (FN) True Negative (TN)

Model Evaluation Area under ROC curve (AUC) Sensitivity- True Positive Rate (TPR) Actual Value (Data) Predicted Outcome (Model) Presence (pos) Absence (neg) Presence (pos) True Positive (TP) False Positive (FP) Absence (neg) False Negative (FN) True Negative (TN)

Model Evaluation Area under ROC curve (AUC) Specificity- True Negative Rate (TNR) Actual Value (Data) Predicted Outcome (Model) Presence (pos) Absence (neg) Presence (pos) True Positive (TP) False Positive (FP) Absence (neg) False Negative (FN) True Negative (TN)

Model Evaluation Area under ROC curve (AUC) Specificity- True Negative Rate (TNR) ROC is Sensitivity by (1- Specificity)=(FPR) Actual Value (Data) Predicted Outcome (Model) Presence (pos) Absence (neg) Presence (pos) True Positive (TP) False Positive (FP) Absence (neg) False Negative (FN) True Negative (TN)

Model Evaluation Area under ROC curve (AUC) An example: TPR = 63/100 =.63 FPR = 28/100 =.28 Actual Value (Data) Predicted Outcome (Model) P=100N=100 P=91TP=63FP=28 N=109FN=37TN=72

Image from wikipedia A P=100N=100 P=91TP=63FP=28 N=109FN=37TN=72 B P=100N=100 P=154TP=77FP=77 N=46FN=23TN=23 C P=100N=100 P=112TP=24FP=88 N=88FN=76TN=12

Image from wikipedia A P=100N=100 P=91TP=63FP=28 N=109FN=37TN=72 B P=100N=100 P=154TP=77FP=77 N=46FN=23TN=23 C P=100N=100 P=112TP=24FP=88 N=88FN=76TN=12 C’ P=100N=100 P=112TP=88FP=24 N=88FN=12TN=76

TPTN FP FN

TPTN FP FN TP TN FN

TPTN FP FN TP TNFP

AUC > 0.5 Higher Predictive Power AUC = 0.5 Random Chance AUC < 0.5 Worse than Random False Positive Rate True Positive Rate 1 1 False Positive Rate True Positive Rate 1 0 False Positive Rate True Positive Rate Model Evaluation Area under ROC curve (AUC)

References: Elith, J., Graham, C. H., Anderson, R. P., Dudı´k, M., Ferrier, S., Guisan, A., Hijmans, R. J.,Huettmann, F., Leathwick, J. R., Lehmann, A., Li, J., Lohmann, L. G., Loiselle, B. A., Manion, G.,Moritz, C., Nakamura, M., Nakazawa, Y., Overton, J. McC., Peterson, A. T., Phillips, S. J.,Richardson, K. S., Scachetti-Pereira, R., Schapire, R. E., Sobero´n, J., Williams, S., Wisz, M. S. and Zimmermann, N. E Novel methods improve prediction of species’ distributions from occurrence data. Ecography 29: Jaynes, E.T., Information theory and statistical mechanics. Phys. Rev. 106, Lobo, J. M., Jiménez-Valverde, A. and Real R AUC a misleading measure of the performance of predictive distribution models. Global Ecology and Biogeography Phillips, S. J., Dudik, M. and Schapire, R. E A maximum entropy approach to species distribution modeling. Proceedings of the 21 st International Conference on Machine Learning, Banff, Canada Phillips, S. J., Anderson, R. P. and Schapire, R. E Maximum entropy modeling of species geographic distributions. Ecological Modeling 190: Siva, D. S., Bayesian Inductive Inference Maximum Entropy & Neutron Scattering. Los Alamos Science, Summer: 180 – 206 Maxent program website (its free):