A new tool for fundamental niche modelling Renato De Giovanni Centro de Referência em Informação Ambiental, CrIA.

Slides:



Advertisements
Similar presentations
MicroKernel Pattern Presented by Sahibzada Sami ud din Kashif Khurshid.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
TSpaces Services Suite: Automating the Development and Management of Web Services Presenter: Kevin McCurley IBM Almaden Research Center Contact: Marcus.
Centro de Referência em Informação Ambiental, CRIA Sidnei de Souza Abril 2006 mapcria web service.
SpeciesLink The Brazilian experience on setting up a network Renato De Giovanni Centro de Referência em Informação Ambiental, CrIA.
OpenCV Introduction Hang Xiao Oct 26, History  1999 Jan : lanched by Intel, real time machine vision library for UI, optimized code for intel 
GARP Genetic Algorithm for Rule-set Production
T-FLEX DOCs PLM, Document and Workflow Management.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
Maxent interface.
 Introduction Originally developed by Open Software Foundation (OSF), which is now called The Open Group ( Provides a set of tools and.
CIM2564 Introduction to Development Frameworks 1 Overview of a Development Framework Topic 1.
SpeciesLink A System for integrating distributed primary biodiversity data Vanderlei Perez Canhos Centro de Referência em Informação Ambiental, CrIA.
1 Location-Based Services Using GSM Cell Information over Symbian OS Final Year Project LYU0301 Mok Ming Fai (mfmok1) Lee Kwok Chau (leekc1)
October 7, 2010Neural Networks Lecture 10: Setting Backpropagation Parameters 1 Creating Data Representations On the other hand, sets of orthogonal vectors.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
.NET Mobile Application Development Remote Procedure Call.
November 2011 At A Glance GREAT is a flexible & highly portable set of mission operations analysis tools that increases the operational value of ground.
Introduction to the course January 9, Points to Cover  What is GIS?  GIS and Geographic Information Science  Components of GIS Spatial data.
L EC. 01: J AVA FUNDAMENTALS Fall Java Programming.
DIRAC API DIRAC Project. Overview  DIRAC API  Why APIs are important?  Why advanced users prefer APIs?  How it is done?  What is local mode what.
.NET, and Service Gateways Group members: Andre Tran, Priyanka Gangishetty, Irena Mao, Wileen Chiu.
Gravity Control™: Is a new generation graphic user interface for searching, sorting and managing large amounts of data from different sources. Makes interaction.
ROOT: A Data Mining Tool from CERN Arun Tripathi and Ravi Kumar 2008 CAS Ratemaking Seminar on Ratemaking 17 March 2008 Cambridge, Massachusetts.
Fall, Privacy&Security - Virginia Tech – Computer Science Click to edit Master title style Design Extensions to Google+ CS6204 Privacy and Security.
4-1 INTERNET DATABASE CONNECTOR Colorado Technical University IT420 Tim Peterson.
INCOFISH WP3 - Campinas, April 2006 WEB Tools and Data Cleaning Alexandre Marino Centro de Referência em Informação Ambiental, CrIA.
CS 390- Unix Programming Environment CS 390 Unix Programming Environment Topics to be covered: Distributed Computing Fundamentals.
Lecture 15 Introduction to Web Services Web Service Applications.
Pipelines and Scientific Workflows with Ptolemy II Deana Pennington University of New Mexico LTER Network Office Shawn Bowers UCSD San Diego Supercomputer.
Chapter 6 SAS ® OLAP Cube Studio. Section 6.1 SAS OLAP Cube Studio Architecture.
A performance evaluation approach openModeller: A Framework for species distribution Modelling.
Real-Time Cyber Physical Systems Application on MobilityFirst Winlab Summer Internship 2015 Karthikeyan Ganesan, Wuyang Zhang, Zihong Zheng.
Centro de Referência em Informação Ambiental, CRIA Dora Ann Lange Canhos March, 2007 mapcria web service openModeller Incofish & CRIA.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
I Power Higher Computing Software Development The Software Development Process.
CE Operating Systems Lecture 3 Overview of OS functions and structure.
Museum and Institute of Zoology PAS Warsaw Magdalena Żytomska Berlin, 6th September 2007.
OpenModeller framework for ecological niche modelling CRIA, INPE, Poli-USP.
Enrique Martínez-Meyer
Development of a Distributed MATLAB Environment with Real-Time Data Visualization Authors: Joseph Diamond, Richard McEver Affiliation: Dr. Jian Huang,
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
An introduction to data exchange protocols in TDWG Renato De Giovanni TDWG 2008.
BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.
1 Circuitscape Design Review Presentation Team Circuitscape Mike Schulte Sean Collins Katie Rankin Carl Reniker.
OpenModeller A framework for biological/environmental modelling Inter-American Workshop on Environmental Data Access Campinas - SP, Brazil March 2004.
SIMO Python/XML Simulator Current situation 28/10/2005 SIMO Seminar Antti Mäkinen Dept. of Forest Resource Management / University of Helsinki.
Workforce Scheduling Release 5.0 for Windows Implementation Overview OWS Development Team.
Shell Interface Shell Interface Functions Data. Graphical Interface Graphical Interface Command-line Interface Command-line Interface Experiments Private.
Our project main purpose is to develop a tool for a combinatorial game researcher. Given a version of combinatorial puzzle game and few more parameters,
Species Distribution Modeling Alexandre Copertino Jardim Antonio Miguel Vieira Monteiro Karla Donato Fook Lúbia Vinhas Silvana Amaral Scientific Workshop.
Getting Started with.NET Getting Started with.NET/Lesson 1/Slide 1 of 31 Objectives In this lesson, you will learn to: *Identify the components of the.NET.
Slide 1 Service-centric Software Engineering. Slide 2 Objectives To explain the notion of a reusable service, based on web service standards, that provides.
1 openModeller Presentation Plan: Overview of openModeller OMWS: an open standard for distributed ecological niche modelling openModeller in relation to.
G.v. Bochmann, revised Jan Comm Systems Arch 1 Different system architectures Object-oriented architecture (only objects, no particular structure)
1 Circuitscape Capstone Presentation Team Circuitscape Katie Rankin Mike Schulte Carl Reniker Sean Collins.
Example projects using metadata and thesauri: the Biodiversity World Project Richard White Cardiff University, UK
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
OSSIM Technology Overview Mark Lucas. “Awesome” Open Source Software Image Map (OSSIM)
Staging of the Ecological Niche Modeling Mammal Prototype Project Deana Pennington University of New Mexico December 14, 2004.
OPERATING SYSTEMS (OS) By the end of this lesson you will be able to explain: 1. What an OS is 2. The relationship between the OS & application programs.
Lifemapper 2.0 Using and Creating Geospatial Data and Open Source Tools for the Biological Community Aimee Stewart, CJ Grady, Dave Vieglais, Jim Beach.
ASPOC Presentation for the CAA Operations Review-1 Klaus Torkar and Harald Jeszenszky IWF/OAW Graz ESTEC, May 2006.
Expanding and Scaling Lifemapper Computations Using CCTools
On Maxent Jorge Soberon University of Kansas.
Dot Net Application PROF. S. LAKSHMANAN, DEPT. OF B. VOC. (SD & SA),
Realtime Recognition of Orchestral Instruments
Web Servers (IIS and Apache)
PyWBEM Python WBEM Client: Overview #2
Eurostat Unit B3 – IT and standards for data and metadata exchange
Presentation transcript:

A new tool for fundamental niche modelling Renato De Giovanni Centro de Referência em Informação Ambiental, CrIA

openModeller Definition History Motivation and features Design Interfaces and additional tools Algorithms Future plans

Definition openModeller is an open source C++ library completely dedicated to static spatial distribution modelling. Applications Biology: Fundamental niche modelling. Geology ? Demography ? Others ?

openModeller’s history apr 2003: Initial design of a new modelling environment at CRIA as a natural consequence of previous experiences with other tools (DesktopGarp). oct 2003: First working prototype as part of the speciesLink project (Fapesp). dec 2003: Released all source code (sourceforge). feb 2004: Partnership with BDWorld (CSM / GRID component). apr 2004: Partnership with University of Kansas (GARP / BTRA). jan 2005: Released first graphical user interface (Tim Sutton & Peter Brewer). may 2005: Basis of a new thematic project funded by Fapesp (4y).

Main Motivation Facilitate and speed up modelling tasks, offering at the same time a homogeneous environment to carry out experiments with different algorithms. Main features Platform independent. Enables the existence of multiple interfaces on top of it. Accepts different formats of georeferenced maps. Accepts different coordinate systems and projections for each map and for the whole set of occurrence points. Accepts different cell sizes and extents for each map. Allows the different algorithms to use exactly the same input and the same working environment, therefore enabling fair comparison between all results. Isolates algorithm logic from other issues related to maps, georeferencing, input and output formats, etc. Offers a collaborative and transparent environment for all interested developers.

Architecture overview open Modeller GARP Bioclim CSM pluggable algorithms API others... API Console interfaces SOAP server SWIG wrapper others... drivers pointsmaps (GDAL, proj4, etc) (diff. formats) (diff. coord systems )

Interfaces and additional tools Command line / Console suite –om_console –om_viewer (X11) –om_niche (X11) SWIG wrapper –Python SOAP interface (prototype server and sample client) Web interface Graphical User Interface (Linux, Windows, Mac OS)

Console interface >> om_console request.txt WKT Coord System = Species file = Species = Map = Mask = Output map = Output mask = Output format = Output file = Algorithm = Parameter =

Console interface

Tool for visualizing maps >> om_viewer -r request.txt

Tool for visualizing models >> om_niche request.txt

Web Interface

Graphical User Interface

Development of algorithms Metadata definitions (name, version, author, description, bibliographic references, parameters). Method to initialize the algorithm. Method to generate the model. Method to calculate the probability of occurrence given a certain vector of environmental values.

Algorithms: Building models open Modeller Algorithm API Sampler gives the algorithm vectors of environmental values from a set of occurrence points: Ex: [20˚, 115 mm], [22˚, 100 mm] Algorithm uses the values to build a distribution model and stores an internal representation of it.

Algorithms: Generating distribution maps open Modeller Algorithm For each cell of the resulting map, openModeller asks the probability of presence sending the vector of environmental values as a parameter. Ex: probability for [30˚, 90 mm] ? Algorithm answers with a probability of presence. Ex: prob = F( [30˚, 90 mm] ) = 0.8

Algorithms Bioclim Climate Space Model (Broken Stick cutoff method) GARP (incl. best subset procedures) Distance algorithms –Distance to average –Minimum distance

Algorithms - Bioclim Assumes normal distribution for each environmental variable. Envelopes are represented by the interval [m - c*s, m + c*s], where 'm' is the mean; 'c' is the cutoff parameter; and 's' is the standard deviation. Besides the envelope, each environmental variable has additional upper and lower limits taken from the maximum and minimum values related to the set of occurrence points. Points are classified as: suitable, marginal or unsuitable. fig. 1: cutoff = fig. 2: cutoff = 0.99

Algorithms - GARP Genetic Algorithm for Rule-set Production: models are represented by a set of rules generated by a genetic algorithm. Non-deterministic: produces a different model each time the algorithm is run. fig. 1: model 1 fig. 2: model 2fig. 3: model 3

Algorithms – GARP with Best subsets procedure fig. 1: sample model Runs several GARP models and chooses the best ones according to omission and commission erros. Resulting model is the overlapping of models that were selected in the previous step.

Algorithms – distance to average Normalizes environmental values and parameter. Calculates the mean point in environmental space considering all presence points. Probabily of presence is proportional to the Euclidean distance from the average point (linear decay). Parameter determines the maximum accepted distance. fig. 1: parameter = 0.1 fig. 2: parameter = 0.3

Algorithms – Minimum distance Normalizes environmental values and parameter. Probabily of presence is proportional to the Euclidean distance from the closest point (linear decay). Parameter determines the maximum accepted distance. fig. 1: parameter = 0.05fig. 2: parameter = 0.1

Use case – Byrsonima subterranea Brad. & Markgr. = original point = 4 new points

Scope issues & known limitations Works only with static models – dynamic modelling is currently outside the scope of this tool. None of the algorithms can handle categorical maps (although the library is already prepared to deal with them). None of the algorithms can handle absence points (except GARP), and none of the high level interfaces is prepared to receive absence points as an additional parametrer. Produces only bi-dimensional maps – not prepared to produce models in three dimensions (especially considering aquatic environments). Still not sufficiently documented! Still not sufficiently tested!

Future plans Implementation of other algorithms: neural nets, cellular automata, GLM, GAM, GRASP, Domain… Development of new components to help on pre-processing and post-analysis. Finalize Web and SOAP interfaces. Develop SWIG interfaces for other programming languages. Improve documentation. Implementation of a new and advanced graphical user interface.

New version of the graphical interface

Institutions & People Tim Sutton Peter Brewer Ricardo S. Pereira Kevin Ruland Jens Oberender Mauro Muñoz Renato De Giovanni

Thank you openmodeller. sf. net renato (at) cria. org. br