Downscaling of European land use projections for the ALARM toolkit Joint work between UCL : Nicolas Dendoncker, Mark Rounsevell, Patrick Bogaert BioSS:

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

Exploratory methods to analyse output from complex environmental models Exploratory methods to analyse output from complex environmental models Adam Butler,
Classification. Introduction A discriminant is a function that separates the examples of different classes. For example – IF (income > Q1 and saving >Q2)
INTRODUCTION TO MACHINE LEARNING Bayesian Estimation.
Unsupervised Learning
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
TRIM Workshop Arco van Strien Wildlife statistics Statistics Netherlands (CBS)
K Means Clustering , Nearest Cluster and Gaussian Mixture
Chapter 4: Linear Models for Classification
Paper Discussion: “Simultaneous Localization and Environmental Mapping with a Sensor Network”, Marinakis et. al. ICRA 2011.
Using the Maryland Biological Stream Survey Data to Test Spatial Statistical Models A Collaborative Approach to Analyzing Stream Network Data Andrew A.
2D1431 Machine Learning Boosting.
Fitting models to data. Step 5) Express the relationships mathematically in equations Step 6)Get values of parameters Determine what type of model you.
Expectation-Maximization
Maximum Entropy Model LING 572 Fei Xia 02/07-02/09/06.
Ordinary Kriging Process in ArcGIS
Radial Basis Function Networks
1 Bayesian methods for parameter estimation and data assimilation with crop models Part 2: Likelihood function and prior distribution David Makowski and.
Mapping the future Converting storylines to maps Nasser Olwero GMP, Bangkok April
1 Naïve Bayes Models for Probability Estimation Daniel Lowd University of Washington (Joint work with Pedro Domingos)
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
A Decision Support Demonstrator for Abiotic Damage to Trees, using a WWW Interface Ari Talkkari Roger Dunham.
Center for Watershed Protection USDA Forest Service, Northeastern Area, State and Private Forestry How to estimate future forest cover in a watershed.
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
AQUIFER VULNERABILITY ASSESSMENT: PROJECT REVIEW COLUMBIA COUNTY June 11, 2009 Columbia County, Florida Advanced GeoSpatial Inc. Alan Baker, P.G. James.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 3: LINEAR MODELS FOR REGRESSION.
Modeling the Greenhouse gases of cropland/grassland At European scale N. Viovy, S. Gervois, N. Vuichard, N. de Noblet-Ducoudré, B. Seguin, N. Brisson,
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Museum and Institute of Zoology PAS Warsaw Magdalena Żytomska Berlin, 6th September 2007.
17 May 2007RSS Kent Local Group1 Quantifying uncertainty in the UK carbon flux Tony O’Hagan CTCD, Sheffield.
Center for Radiative Shock Hydrodynamics Fall 2011 Review Assessment of predictive capability Derek Bingham 1.
Integration of biosphere and atmosphere observations Yingping Wang 1, Gabriel Abramowitz 1, Rachel Law 1, Bernard Pak 1, Cathy Trudinger 1, Ian Enting.
Statistical approach Statistical post-processing of LPJ output Analyse trends in global annual mean NPP based on outputs from 19 runs of the LPJ model.
Regional Air Quality Plans: assessment of plan efficacy through integrated assessment modelling and other local models G. Vialetto 1, G. Calori 3, I. D’Elia.
BioSS reading group Adam Butler, 21 June 2006 Allen & Stott (2003) Estimating signal amplitudes in optimal fingerprinting, part I: theory. Climate dynamics,
Slide 1 Marc Kennedy, Clive Anderson, Anthony O’Hagan, Mark Lomas, Ian Woodward, Andreas Heinemayer and John Paul Gosling Quantifying uncertainty in the.
Forcing climatic factors Biodiversity & water quality Socio/Econ/Cult state Social perception | AQUATIC ECOSYSTEM SERVICES Methods (a) Original SAFER Methodologies.
1  The Problem: Consider a two class task with ω 1, ω 2   LINEAR CLASSIFIERS.
Remote-sensing and biodiversity in a changing climate Catherine Graham SUNY-Stony Brook Robert Hijmans, UC-Berkeley Lianrong Zhai, SUNY-Stony Brook Sassan.
Eurostat Accuracy of Results of Statistical Matching Training Course «Statistical Matching» Rome, 6-8 November 2013 Marcello D’Orazio Dept. National Accounts.
Attractiveness Mapping Modeling Land Use Preference.
Exploring Microsimulation Methodologies for the Estimation of Household Attributes Dimitris Ballas, Graham Clarke, and Ian Turton School of Geography University.
Welcome to the PRECIS training workshop
Machine Learning 5. Parametric Methods.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: MLLR For Two Gaussians Mean and Variance Adaptation MATLB Example Resources:
Impacts of Landuse Management and Climate Change on Landslides Susceptibility over the Olympic Peninsula of Washington State Muhammad Barik and Jennifer.
Multiple Sequence Alignment Vasileios Hatzivassiloglou University of Texas at Dallas.
Naïve Bayes Classifier April 25 th, Classification Methods (1) Manual classification Used by Yahoo!, Looksmart, about.com, ODP Very accurate when.
What if? prospects based on Corilis Alex Oulton, Manuel Winograd Ronan Uhel & Jean-Louis Weber LAND QUICK SCAN INTERFACE: Challenges and needs Internal.
Neural Networks The Elements of Statistical Learning, Chapter 12 Presented by Nick Rizzolo.
Modelling with CORILIS Change in land cover patterns, landscape ecological potential & “temperatures” on N2000, river basins and UMZ Wire frame and examples.
Deforestation Part 2: Top-down Modelling Pedro R. Andrade Münster, 2013.
Introduction to emulators Tony O’Hagan University of Sheffield.
Adam Butler, Stijn Bierman & Glenn Marion Biomathematics & Statistics Scotland CEH Bush, April 2008 ALARM: a statistical perspective.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
Unsupervised Learning Part 2. Topics How to determine the K in K-means? Hierarchical clustering Soft clustering with Gaussian mixture models Expectation-Maximization.
Migration Modelling using Global Population Projections
SLCP Benefits Toolkit:
Chapter 7. Classification and Prediction
Ruth Doherty, Edinburgh University Adam Butler & Glenn Marion, BioSS
Copenhagen 31 January 2008 Wind energy potential in Europe Methodology Hans Eerens MNP Netherlands.
Downscaling sea level rise in the Mediterranean Sea under different future climate change scenarios ( ) Kareem M. Tonbol (Ph.D.) Assistant Professor.
Adam Butler, Biomathematics and Statistics Scotland
Adam Butler & Glenn Marion, Biomathematics & Statistics Scotland •
Generalized Spatial Dirichlet Process Models
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
EC Workshop on European Water Scenarios Brussels 30 June 2003
A new way of looking at emission uncertainties
Presentation transcript:

Downscaling of European land use projections for the ALARM toolkit Joint work between UCL : Nicolas Dendoncker, Mark Rounsevell, Patrick Bogaert BioSS: Adam Butler, Glenn Marion Athens ALARM meeting, January 2007

Overview Current day land use data are available at a relatively fine spatial resolution (e.g. CORINE), but land use projections within ALARM are generated at a much coarser resolution There is a need to convert these projections onto finer spatial scales in a way that properly reflects the statistical properties of high-resolution land use maps Using the downscaling method developed by Dendoncker et al. (2006) we aim to generate CORINE-scale projections of European land use under the ALARM scenarios

Assumptions Downscaling introduces additional error into land use projections - the unavoidable price of looking at a high spatial resolution The method relies on the assumption that the overall frequencies associated with different land types will change in the future, but that the spatial structure of the landspace will not change Current day land use is assumed to be known without error It is explicitly assumed that urban areas will remain urban

Outputs Land use maps at a CORINE scale (250 x 250m) for years under each of the ALARM scenarios, in terms of four broad land use types: urban, arable, grassland & forestry Maps could potentially be adapted to provide information on more detailed land classes (e.g. forest & grassland types, individual crops) – but this would require additional assumptions &/or need to make use of additional data Generic tools for applying land use downscaling to other data, & training materials to illustrate how the downscaling methods work

Some potential uses …as projected environmental data for local field studies e.g. FSN …as fine-scale inputs to mechanistic ecological models e.g. LPJ …as land use inputs for climate envelope analyses …as a resource for future ecological research, via the toolkit

Unresolved issues How can we best deal with land use classes that are only present in future scenarios e.g. surplus land, biofuels? How can we best deal with protected areas e.g. NATURA sites? How should we visualise downscaled land use maps, and how can we best incorporate these into the toolkit? Should we try to quantify & represent the uncertainties involved in land use projection? If so, how?

Example: Luxembourg

Statistical methodology Developed at UCL by Dendoncker et al. (2006): 1Fit a multinomial autologistic regression model to current CORINE land use data, which is at high spatial resolution 2Use ALARM scenarios data to calculate the marginal probabilities that will be associated with different land use classes in future; ensure that these vary smoothly over space 3Combine these sources of information using Bayes Theorem, in order to estimate the conditional probabilities associated with each of the land use classes at high spatial resolution 4Take the projected land class for a CORINE cell to be the class that has the highest conditional probability for that cell

1. Current land use Data: x ik = 1 if CORINE cell i has land use class k, 0 otherwise Model: we assume the probability that cell i belongs to class k conditionally upon the values of x Ik for all other cells I is where n ik denotes the number of cells in the neighbourhood of cell i that belong to class k and where  k are unknown parameters The marginal probability associated with class k is equal to

2. Future land use Data: f jk = fraction of ALARM cell j that is projected to have land use class k (for a particular future year in a particular scenario) We assume the marginal probability that CORINE cell i will have land class k in future is equal to where d ij is the distance between the midpoints of cells i and j Using a weighted sum ensures smoothness; the value of q controls how smooth we would like the probability surface to be If cell i has the same midpoint as ALARM cell j then m ik = f jk

3. Bayes theorem Using Bayes Theorem we can calculate the future conditional probability that CORINE cell i belongs to land use class k as: For each cell i we need to rescale the C ik so that they sum to one, & so are valid probabilities; however this means that the marginal probabilities will no longer be equal to the values m ik that we computed using the ALARM scenario data We can use an iterative procedure to ensure that the rescaled conditional probabilities also respect these marginal probabilities; we alternate between: until the marginal probabilities have approximately converged to m ik after T steps

4. Prediction Finally, we predict that CORINE cell i will belong to the class that has the highest associated conditional probability, so that:

Computation The procedure is not inherently expensive to run, but the vast size of the baseline CORINE dataset means that computational issues will be the key technical problem in applying it at a pan- European scale Currently implemented using a mixture of: SAS, to fit the multinomial autologistic regression model matlab, for the remaining steps We are looking at the feasability of porting the code to R, with most of the heavy internal computation being done in Fortran90, so that it could easily be integrated into the toolkit Could calculations be done online, via the ALARM map portal?