Introduction to Kepler Deana Pennington, PhD University of New Mexico LTER Network Office, Sevilleta LTER PI CI-Team: Advancing CI-Based Science through Education, Training, and Mentoring of Science Communities CoPI Science Environment for Ecological Knowledge (SEEK) project July 10, 2007
Science Environment for Ecological Knowledge (SEEK) project National Science Foundation-funded large ITR Partnership for Biodiversity Informatics (PBI)
Kepler website:
Which version? Beta is stable, but with more limited functionalityBeta is stable, but with more limited functionality Beta does not work with Java 1.6 – use 1.5 or install kepler-1.0.beta3-jre, which includes a compatible versionBeta does not work with Java 1.6 – use 1.5 or install kepler-1.0.beta3-jre, which includes a compatible version Nightly build has most current functionality – but some is undocumentedNightly build has most current functionality – but some is undocumented Nightly build is sometimes broken – but builds from the prior 3 nights are also availableNightly build is sometimes broken – but builds from the prior 3 nights are also available
Opening Kepler Installed version: StartStart My ProgramsMy Programs KeplerKeplerNightly: Kepler.batKepler.bat
Director/Actor Metaphor Directors define the model of computation to be used in the workflowDirectors define the model of computation to be used in the workflow Every workflow must specify a directorEvery workflow must specify a director Actor Actors know HOW to act..know their part Directors know WHEN they should act
Actors & ports actor name data ports Input data parameters Output data Atomic actor 1 input port2 output ports Reads a dataset Initializing parameters function
Composite Actors Composite actor Open Actor input output
Graph Editor Model building area (Canvas) Library of components Navigation area Tool bar Search
Discovery DataData ActorsActors DirectorsDirectors
Kepler EarthGrid
EarthGrid registered data show up in KEPLER Grid query Grid get
Kepler understands EML data*
Kepler: web services access
ENM in Kepler: Conceptual Workflow IPCC future climate scenarios (S = 21) GARP model training & prediction of present distribution (P) Sample Data 2 sets Calculate model error Select best models (m) Testing data sample set Combine prediction results => probability map Dispersal analysis For each S integrated with T 2 2 Predict future distribution (F) from model For each P & F prediction from models (m) = 22 For each model in m For i = 1 to n n = # of models MaNIS Species Locations (L) Hydro1k topographic layers (T) n = 4 Convex Hull Mask IPCC present climate layers (C) n = 7 Restructure Rescale Projection Extent Grain Input Parameters Append layers Append datasets Store points as ASCII For each T For each C Rescale values For each C, S & T EcoGrid query through Kepler Legend 1 1 Restructure For each S Filter out If n < X, where n = count of occurrences X is user defined Convert layers to binary Occurrence Points Gridded layers: Climate Topography Modeling Prediction GIS: GDAL/Java Kepler Native CR GIS/R
ENM in Kepler: Executable Workflow Top level
Kepler Self-Directed Tutorial