Introduction for BEAM Ecological Niche Modeling Working Meeting Deana Pennington University of New Mexico December 14, 2004.

Slides:



Advertisements
Similar presentations
High Performance Wireless Research and Education Network
Advertisements

Overview of the Science Environment for Ecological Knowledge (SEEK) Ricardo Scachetti Pereira.
Using Specimen Data in Scientific Workflow Environments to Connect to Metadata Archive and Discovery Services in Environmental Biology CJ Grady, J.H. Beach,
UCSD SAN DIEGO SUPERCOMPUTER CENTER Ilkay Altintas Scientific Workflow Automation Technologies Provenance Collection Support in the Kepler Scientific Workflow.
1 Cyberinfrastructure Summer Institute for Geoscientists August 14-18, 2006 San Diego Supercomputer Center.
Chad Berkley National Center for Ecological Analysis and Synthesis (NCEAS), University of California, Santa Barbara February.
McGuinness – Microsoft eScience – December 8, Semantically-Enabled Science Informatics: With Supporting Knowledge Provenance and Evolution Infrastructure.
Introduction to Kepler Deana Pennington, PhD University of New Mexico LTER Network Office, Sevilleta LTER PI CI-Team: Advancing CI-Based Science through.
KEPLER: Overview and Project Status Bertram Ludäscher San Diego Supercomputer Center Associate Professor Dept. of Computer Science.
GIS Actors in Kepler - Java-based, GDAL-JNI, and C++(Grass) Routines Dan Higgins - UC Santa Barbara (NCEAS) Chad Berkley – UC Santa Barbara (NCEAS) Jianting.
1 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Global Earth Observation Grid Workshop, Bangkok, Thailand, March Integration Platform.
KEPLER: Overview and Project Status Bertram Ludäscher San Diego Supercomputer Center Associate Professor Dept. of Computer Science.
A Kepler-based Three Tier Architecture applied to LiDAR Interpolation and Analysis Efrat Frank, Ilkay Altintas San Diego Supercomputer Center, UCSD Configuration.
Leveraging semantic metadata for ecological data discovery and integration for analysis and modeling Matthew B. Jones Mark P. Schildhauer with contributions.
The Kepler Project Overview, Status, and Future Directions Matthew B. Jones on behalf of the Kepler Project team National Center for Ecological Analysis.
Human-Computer Interaction in Biodiversity Informatics Workshop in association with the 22 nd annual HCIL Symposium and Open House Sponsored by NBII and.
Knowledge Environments for Science and Engineering: Overview of Past, Present and Future Michael Pazzani, Information and Intelligent Systems Division,
Kepler: Towards a Grid-Enabled System for Scientific Workflows Ilkay Altintas, Chad Berkley, Efrat Jaeger, Matthew Jones, Bertram Ludäscher*, Steve Mock.
1 Ilkay ALTINTAS - October, 2007 Ilkay ALTINTAS Lab Director, Scientific Workflow Automation Technologies San Diego Supercomputer Center, UCSD Kepler Scientific.
Biology.sdsc.edu CIPRes in Kepler: An integrative workflow package for streamlining phylogenetic data analyses Zhijie Guan 1, Alex Borchers 1, Timothy.
January, 23, 2006 Ilkay Altintas
SDM Center A Quick Update on the TSI and PIW workflows SDM All Hands March 2-3, Terence Critchlow, Xiaowen Xin, Bertram.
U.S. Department of the Interior U.S. Geological Survey CDI Data Management Working Group December 12, 2011 Sally Holl, USGS Texas Water Science Center.
SEEK: Enabling Ecology and Biodiversity Science Through Cyberinfrastructure.
Supporting Large-Scale Science with Workflows Deana Pennington University of New Mexico Long-Term Ecological Research Network Office ITR: Science Environment.
Data R&D Issues for GTL Data and Knowledge Systems San Diego Supercomputer Center University of California, San Diego Bertram Ludäscher
Pipelines and Scientific Workflows with Ptolemy II Deana Pennington University of New Mexico LTER Network Office Shawn Bowers UCSD San Diego Supercomputer.
1 Kepler/SPA Extensions for Scientific Workflows – Now and Upcoming Ilkay Altintas SWAT lead San Diego Supercomputer Center Bertram Ludäscher.
Directions in observational data organization: from schemas to ontologies Matthew B. Jones 1 Chad Berkley 1 Shawn Bowers 2 Joshua Madin 3 Mark Schildhauer.
Science Environment for Ecological Knowledge Bertram Ludäscher San Diego Supercomputer Center University of California, San Diego
Science Environment for Ecological Knowledge: EcoGrid Matthew B. Jones National Center for.
Enabling Access to High-Resolution LiDAR Topography through Cyberinfrastructure-Based Data Distribution and Processing Christopher J. Crosby, J Ramón Arrowsmith.
1 Cyberinfrastructure Summer Institute for Geoscientists July 18-22, 2005 San Diego Supercomputer Center.
Accelerating Scientific Exploration Using Workflow Automation Systems Terence Critchlow (LLNL) Ilkay Altintas (SDSC) Scott Klasky(ORNL) Mladen Vouk (NCSU)
SAN DIEGO SUPERCOMPUTER CENTER This is a title AN NSF SPONSORED WORKSHOP HOSTED BY THE PARTNERSHIP FOR BIODIVERSITY INFORMATICS NATIONAL CENTER FOR ECOLOGICAL.
SEEK EcoGrid l Integrate diverse data networks from ecology, biodiversity, and environmental sciences l Metacat, DiGIR, SRB, Xanthoria,... l EML is the.
Data, Metadata, and Ontology in Ecology Matthew B. Jones National Center for Ecological Analysis and Synthesis (NCEAS) University of California Santa Barbara.
Chad Berkley NCEAS National Center for Ecological Analysis and Synthesis (NCEAS), University of California Santa Barbara Long Term Ecological Research.
1 Ilkay ALTINTAS - July 24th, 2007 Ilkay ALTINTAS Director, Scientific Workflow Automation Technologies Laboratory San Diego Supercomputer Center, UCSD.
Research Design for Collaborative Computational Approaches and Scientific Workflows Deana Pennington January 8, 2007.
Grid Technologies Arcot Rajasekar (SEEK) Paul Watson (North East eScience Centre)
Hassan A. Karimi Geoinformatics Laboratory School of Information Sciences University of Pittsburgh 3/27/20121.
The SEEK EcoGrid: A Data Grid System for Ecology Arcot Rajasekar Matthew Jones Bertram Ludäscher
Soil and Water Conservation Modeling: MODELING SUMMIT SUMMARY COMMENTS Dennis Ojima Natural Resource Ecology Laboratory COLORADO STATE UNIVERSITY 31 MARCH.
Using R in Kepler Dan Higgins – NCEAS Prepared for: Ecoinformatics Training for Ecologists LTER (Albuquerque) January 8-12, 2007
Using Desktop Data in Kepler Dan Higgins – NCEAS Prepared for: Ecoinformatics Training for Ecologists LTER (Albuquerque) January 8-12, 2007
Kepler includes contributors from GEON, SEEK, SDM Center and Ptolemy II, supported by NSF ITRs (SEEK), EAR (GEON), DOE DE-FC02-01ER25486.
SDM center Supporting Heterogeneous Data Access in Genomics Terence Critchlow Center for Applied Scientific Computing Lawrence Livermore National Laboratory.
EScience Workshop on Scientific Workflows Matthew B. Jones National Center for Ecological Analysis and Synthesis University of California Santa Barbara.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
SDM center Supporting Heterogeneous Data Access in Genomics Terence Critchlow Ling Liu, Calton Pu GT Reagan Moore, Bertam Ludaescher, SDSC Amarnath Gupta.
SEEK Science Environment for Ecological Knowledge l EcoGrid l Ecological, biodiversity and environmental data l Computational access l Standardized, open.
Ecological Niche Modeling Conceptual Workflows Deana Pennington University of New Mexico December 16, 2004.
Semantic Mediation and Scientific Workflows Bertram Ludäscher Data and Knowledge Systems San Diego Supercomputer Center University of California, San Diego.
The LTER Network Planning Grant Barbara Benson NTL-LTER.
GEONSearch: From Searching to Recommending GeoInformatics 2006 May 10-12, Reston, Virginia Ullas Nambiar, Bertram Ludaescher Dept. of Computer Science.
Visualization in Kepler Dan Higgins – NCEAS Prepared for: Ecoinformatics Training for Ecologists LTER (Albuquerque) January 8-12, 2007
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Workflow-Driven Science using Kepler Ilkay Altintas, PhD San Diego Supercomputer Center, UCSD words.sdsc.edu.
Staging of the Ecological Niche Modeling Mammal Prototype Project Deana Pennington University of New Mexico December 14, 2004.
Efrat Jaeger – SDSC Bertram Ludäscher – UC DAVIS Krishna Sinha – Virginia Tech Ashraf Memon – SDSC Ghulam Memon – SDSC Ilkay Altintas – SDSC Kai Lin –
EcoGrid in SEEK A Data Grid System for Ecology Bertram Ludaescher University of California, Davis Arcot Rajasekar San Diego Supercomputer Center, University.
Strategies for NIS Development
Problem: Ecological data needed to address critical questions are dispersed, heterogeneous, and complex Solution: An internet-based mechanism to discover,
Data R&D Issues for GTL Bertram Ludäscher Data and Knowledge Systems
Discussion and Conclusion
Bringing Organism Observations Into Bioinformatics Networks
A Semantic Type System and Propagation
KEPLER: Overview and Project Status
This material is based upon work supported by the National Science Foundation under Grant #XXXXXX. Any opinions, findings, and conclusions or recommendations.
Presentation transcript:

Introduction for BEAM Ecological Niche Modeling Working Meeting Deana Pennington University of New Mexico December 14, 2004

SEEK Project NSF-funded Information Technology Research (ITR) 5 years (starting year 3) 50+ researchers and developers 9 institutions

Grand Challenges in Ecology  Alterations in biodiversity…exotic species, infectious disease  Altered biogeochemical cycles at multiple spatial scales  Climate change and variability, including ecosystem reponse to change  Coupled human-natural ecosystems

Ecoinformatics  200+ years of data collection in US, 300+ globally  Large and widely distributed data sets  Data heterogeneity (text, Excel, GIS, DB, etc.)  New data collection techniques: in situ sensor arrays  Remotely-sensed imagery  Scaling issues: space, time, levels (taxon) Tackling these question will require the use of all of the information available to us Biodiversity and ecosystem informatics R&D has been identified as a critical national priority –Computer-mediated collaboration –New tools for synthetic understanding

Science and Technology Data-intensive Data mining Bio-inspired algorithms Exp. Data Analysis Visualization Compute-intensive Parallel processing High throughput Grid technologies Domain-intensive User interfaces Human cognition Ontologies Semantic mediation Analysis & Modeling EcoGrid

Technologic Systems for Scientists Data-intensive Compute- intensive Domain- intensive Science-focused Technology-enabled Science Kepler Workflow System

Informatics and the Research Cycle Mental Model Research Design Share Results Data-intensive Data mining Bio-inspired algs. Exp. Data Analysis Visualization Compute- intensive Parallel processing High throughput Grid technologies Domain- intensive User interfaces Human cognition Ontologies Sem. mediation Collect Data Inductive, Descriptive Statistics Deductive, Prescriptive Mechanistic Conduct Analyses Metadata Scientific Workflow

Source: NIH BIRN (Jeffrey Grethe, UCSD)

Promoter Identification Workflow (PIW) Source: Matt Coleman (LLNL)

Species Distribution Workflow Training sample GARP rule set Test sample Species pres. & abs. points EcoGrid Query EcoGrid Query Layer Integration Sample Data + A3 + A2 + A1 Model Calculation Map Generation Validation User Model quality parameters Native range prediction map Env. layers Generate Metadata Archive To Ecogrid Selected prediction maps Physical Transformation Scaling EcoGrid DataBase EcoGrid DataBase EcoGrid DataBase EcoGrid DataBase Integrated layers Integrated layers GARP rule set Species pres. & abs. points

ENM workflows  Climate change  Species invasion  Macroanalysis  Cross-validation  Calibration  Environmental monitoring  Time-specific predictions  Zoonotic disease

Past year Conceptual Workflows Executable Workflows  Scripting/Visual modeling Single environment Single platform  Workflows: Cross-platform Cross-environment Distributed data & analyses

Data & Analysis Sharing: EcoGrid

What is a workflow? Reporting Sharing Research Design Data integration Analysis integration (data transformation)

Starting point: Ptolemy II Edward Lee et al.

Kepler Additions  Grid-enabled data and analysis sharing Local Shared Web application Web service  Statistical library: R (open source)  GIS library: GDAL/GRASS (open source)  Domain specific functionality (GARP, etc.)

Kepler Contributors, Projects, Sponsors  Ilkay Altintas SDM  Chad Berkley SEEK  Shawn Bowers SEEK  Tobin Fricke ROADNet  Jeffrey Grethe BIRN  Christopher H. Brooks Ptolemy II  Zhengang Cheng SDM  Dan Higgins SEEK  Efrat Jaeger GEON  Matt Jones SEEK  Edward A. Lee Ptolemy II  Kai Lin GEON  Ashraf Memon GEON  Bertram Ludaescher BIRN, GEON, SDM, SEEK  Steve Mock NMI  Steve Neuendorffer Ptolemy II  Jing Tao SEEK  Mladen Vouk SDM  Xiaowen Xin SDM  Yang Zhao Ptolemy II  Bing Zhu SEEK  E-Science Link-up Project Recommended for NEON

Agenda Goal: To give you the knowledge and training needed to begin to develop grid-enabled applications in Kepler  Prototype project: ENM Mammal Project  Resource sharing and grid technologies (Tues am)  Metadata requirements (Tues pm)  Kepler training (Wed am)  Kepler applications in ENM What you can do now (or very soon) (Wed am/pm) What expanded functionality needs to be added (Thurs am/pm)  Feedback and planning (Thurs pm)

Important Disclaimer  Kepler is a CIS research project in its EARLY stage… there are many, many still to be done. If: Something crashes….it’s a work in progress Something looks weird…it’s a work in progress Something doesn’t work…it’s a work in progress Something should be done a different way…it’s a work in progress best to keep a sense of humor

Acknowledgements This material is based upon work supported by the National Science Foundation under awards for SEEK and (AWSFL008-DS3) for GEON and by the Department of Energy under Contract No. DE-FC02- 01ER25486 for SciDAC/SDM and by DARPA under Contract No. F C-1703 for Ptolemy. Any opinions, findings and conclusions or recomendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation (NSF). The National Center for Ecological Analysis and Synthesis, a Center funded by NSF (Grant Number ), the University of California, and the UC Santa Barbara campus. The Andrew W. Mellon Foundation. PBI Collaborators: NCEAS, University of New Mexico (Long Term Ecological Research Network Office), San Diego Supercomputer Center, University of Kansas (Center for Biodiversity Research) Kepler contributors: SEEK, Ptolemy II, SDM/SciDAC, GEON