Presentation is loading. Please wait.

Presentation is loading. Please wait.

SEEK EcoGrid l Integrate diverse data networks from ecology, biodiversity, and environmental sciences l Metacat, DiGIR, SRB, Xanthoria,... l EML is the.

Similar presentations


Presentation on theme: "SEEK EcoGrid l Integrate diverse data networks from ecology, biodiversity, and environmental sciences l Metacat, DiGIR, SRB, Xanthoria,... l EML is the."— Presentation transcript:

1 SEEK EcoGrid l Integrate diverse data networks from ecology, biodiversity, and environmental sciences l Metacat, DiGIR, SRB, Xanthoria,... l EML is the core for data documentation l Open programming interface

2 EcoGrid client interactions

3 Aims of EcoGrid l Which, Where, How, Who ???? l Share Data and Information l Relate Data from multiple projects/groups l Crosswalks across data structures l Develop Eco-related Finding Aids for Data l Global User: Authenticate and Authorize l Provide an infrastructure for “Archivable Collection-building” for SEEK scientists l Facilitate the A&M layer and the SMS layer

4 Challenges of EcoGrid l Data & User Diversity l 6000+ datasets & 1500+ scientists l themes, methods, units,structures l Small data sizes but high complexity - metadata l Multiple Data Organizations l Biodiversity Surveys l Population data l GIS, Satellite Images, Weather Data, … l Ontologies & Taxonomies l Data Discovery: No single place to find l Data Entropy – rapid decline of information on data l Autonomy with Centralized access l Leverage Computational Grid work

5 Existing services l Metacat – syntactic and semantic metadata querying/inserting/updating/deleting, user registration/authentication, data replication, data/metadata versioning, - supports any XML- based metadata l Xanthoria – common-schema mediator (currently 8 sites) metadata query/insert/update/delete for any XML schema to underlying metadatabase (SQL, native XML)

6 Existing Systems l DiGIR – querying arbitrary XML-describable resources (underlying data sources can be any type: RDB, XMLDB). l ClimDB – integrating (using wrapping at the data source) diverse format climate data. Access through web, common schema identified beforehand – tabular description l HyperLTER – summary ontology as metadata for images put in as metadata, image extraction /geographicsubsetting/band-level subsetting/ - integration with MODIS images and Hyperspectral images, TM images, airphotos, …

7 Existing Systems l VegBank – 3 databases co-occurrence records, species taxonomic database that is concept-driven, community classification. Distributed vegbank, querying by plots. Querying/insert/update/annotate across three diverse databases that are described using XML l SRB – access distributed data, syntactic, semantics,user-defined (arbitrary relational) metadata based querying. Annotations for data. Opertions on data. Extraction of metadata. ingest,bulk ingest, delete,upate of data/metadata

8 EcoGrid Services l Query l Search metadata and data, return result sets with ID l Read l Retrieve data objects by ID l Authentication l Verify user identity l Authorization l Record allowable interactions l Write l Write data objects by ID l Replication l Mirror objects for backup and efficiency l Computation l Execute models and simulations from AMS on various nodes

9 EcoGrid Search Interactions l Features l Well-defined interfaces (e.g., WSDL) l Standardized messaging formats l Automated discovery of implementing services l Aggregation/Indexing across nodes for efficiency l Support heterogeneous data objects via metadata descriptions l Lightweight to implement for various systems like DiGIR and Metacat Client Registry QueryService 1. Register 2. Find Query Nodes 3. Search (recursive)

10 4. Read (recursive) 5. Find Index Nodes 1. Register EcoGrid Index Interactions Client Registry QueryService 3. Search (recursive) IndexedQueryService 6. Search 2. Find Query Nodes

11 Authentication and Authorization l KNB uses simple LDAP system with referrals l Leverages existing DB (e.g. LTER personnel DB) l Not really scalable in terms of administration l Grid Security Infrastructure (GSI) l Certificate based authentication l Proxy certificates allows transfer of rights l De-centralized administration (I.e., multiple CA’s) l Can we easily transition to GSI?

12 Native Range prediction workflow Slide from D. Pennington KNB Abundance Data (a1) Training sample (d) GARP rule set (e) Test sample (d) Integrated layers (native range) (c) DiGIR Species presence & absence points (a2) EcoGrid Query EcoGrid Query Layer Integration Sample + A3 + A2 + A1 Data Calculation MapValidationUser Model quality parameter (g) Native range prediction map (f) SRB Environmental layers (b) EcoGrid Query EcoGrid Archive

13 Implementation l Short-term l Define common WSDL services l Simple service registry l Wrappers for Metacat, DiGIR, SRB, Xanthoria, etc. l Medium-term l Use OGSI-compliant interfaces l (add methods to current WSDL) l Grid Registry service

14 Timing l April 4 l April 11 -- Design Diagrams l April 18 -- WSDL, Registry instance operational, query + read, RSIDS schema and examples. l April 25 l May 2 l May 9 Wrapper implementations + test client(s) l May 16 (SEEK Technical WG meeting) l May 23 l May 30 -- Hard deadline for implementation of Eco-GRID alpha 1

15 Query Messages <egq:query queryId="test.1.1" system="test" xmlns:egq="ecogrid://ecoinformatics.org/ecogrid-query-1.0.0alpha1"> Soils metadata query %soil% %dirt% <condition operator="LIKE" concept="eml:surName">%Jones% <condition operator="LIKE" concept="eml:surName">%Vieglais%

16 Result responses <rs:resultset resultsetId="foo.1.1" system="http://knb.ecoinformatics.org/knb/" xmlns:rs='ecogrid://ecoinformatics.org/ecogrid-resultset-1.0.0alpha1'> 2003-05-02T16:45:50-09:00 86 <records startRecord="1" endRecord="1" xmlns:eml='eml://ecoinformatics.org/eml-2.0.0'> Soil data from West Valley, 1983 Jones Smith aves ornithology biodiversity


Download ppt "SEEK EcoGrid l Integrate diverse data networks from ecology, biodiversity, and environmental sciences l Metacat, DiGIR, SRB, Xanthoria,... l EML is the."

Similar presentations


Ads by Google