GEONSearch: From Searching to Recommending GeoInformatics 2006 May 10-12, Reston, Virginia Ullas Nambiar, Bertram Ludaescher Dept. of Computer Science.

Slides:



Advertisements
Similar presentations
UCSD SAN DIEGO SUPERCOMPUTER CENTER Ilkay Altintas Scientific Workflow Automation Technologies Provenance Collection Support in the Kepler Scientific Workflow.
Advertisements

GIS in GEON Cyberinfrastructure Presented by Ashraf Memon Presented by Ashraf Memon.
The GEON LiDAR Workflow: An Internet-Based Tool for the Distribution and Processing of LiDAR Point Cloud Data Christopher J. Crosby, J Ramón Arrowsmith,
2. Point Cloud x, y, z, … Complete LiDAR Workflow 1. Survey 4. Analyze / “Do Science” 3. Interpolate / Grid USGS Coastal & Marine.
Planned Title: Review of Evaluation of Geospatial Search Allan Doyle.
1 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Global Earth Observation Grid Workshop, Bangkok, Thailand, March Integration Platform.
SAN DIEGO SUPERCOMPUTER CENTER Developing a CUAHSI HIS Data Node, as part of Cyberinfrastructure for the Hydrologic Sciences David Valentine Ilya Zaslavsky.
A Kepler-based Three Tier Architecture applied to LiDAR Interpolation and Analysis Efrat Frank, Ilkay Altintas San Diego Supercomputer Center, UCSD Configuration.
PHANEROZOIC EARTH AND LIFE: THE PALEOINTEGRATION PROJECT Allister Rees - Department of Geosciences, University of Arizona John Alroy - Paleobiology Database,
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES WMS Map Integration - Improved Ghulam Memon Ashraf Memon.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
TWC Knowledge Evolution in Distributed Geoscience Datasets and the Role of Semantic Technologies Xiaogang (Marshall) Ma Tetherless World Constellation.
Methods for Data Discovery – Portals Portal facilitates access to and also assimilation of data Portal is not simply a web site: it offers services such.
GEON: The User Perspective Choonhan Youn Dogan Seber, Chaitan Baru, Ashraf Memon San Diego Supercomputer Center, University of California at San Diego.
January, 23, 2006 Ilkay Altintas
GEON Science Application Demos
1 Distributed Database Concepts 8:30-10:00AM Thursday, July 21 st 2005 CSIG05 Chaitan Baru.
GEON-UTEP GEON-Knowledge Representation WG Update GEON-KR list (currently) Bertram Ludaescher (SDSC: Bertram Ludaescher (SDSC:
NSF Meeting on Cyberinfrastructure for Surficial Processes, Jan.18-19, 2006 Slide 1 GEON: The Geosciences Network Chaitan Baru San Diego Supercomputer.
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
Data R&D Issues for GTL Data and Knowledge Systems San Diego Supercomputer Center University of California, San Diego Bertram Ludäscher
Welcome to the SWGeoNet System The SouthWest Geospatial Network (SWGeoNet) system is an integrated geospatial-data system for the Transition Zone between.
The Pragmatics of Geo-ontologies, and the Ontology of Geo-pragmatics Boyan Brodaric, Geological Survey of Canada, Ottawa.
Physical model Model results HPCC Data Modeling Environment Core Grid Services Authentication, monitoring, scheduling, catalog, data transfer, Replication,
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Investigators: Chaitan Baru, Randy Keller, Dogan Seber, Krishna Sinha, Ramon Arrowsmith, Boyan Brodaric, Karl Flessa, Eric Frost, Ann Gates, Mark Gahegan,
Science Environment for Ecological Knowledge: EcoGrid Matthew B. Jones National Center for.
Enabling Access to High-Resolution LiDAR Topography through Cyberinfrastructure-Based Data Distribution and Processing Christopher J. Crosby, J Ramón Arrowsmith.
Efrat Frank, Ashraf Memon, Vishu Nandigam, Chaitan Baru
1PeopleDocumentsData Catalog Generation Tools Analysis and Visualization Tools Data Services Discovery and Publication Tools Discovery and Publication.
CBEO:N Chesapeake Bay Environmental Observatory as a Network Node About CBEO The mission of the CBEO project is development of a Chesapeake Bay Environmental.
1 Ilkay ALTINTAS - July 24th, 2007 Ilkay ALTINTAS Director, Scientific Workflow Automation Technologies Laboratory San Diego Supercomputer Center, UCSD.
The Digital Library for Earth System Science: Contributing resources and collections Meeting with GLOBE 5/29/03 Holly Devaul.
Where to find LiDAR: Online Data Resources.
Fall AGU Meeting, December, 2005 GEON Developments for Searching, Accessing, Integrating, and Visualizing Distributed Data Charles Meertens UNAVCO Dogan.
Geosciences Network (GEON): Enabling Discoveries in the Earth Sciences Dogan Seber San Diego Supercomputer Center University of California,
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES GEON Systems Report Karan Bhatia San Diego Supercomputer Center Friday Aug
Cyberinfrastructure and EarthScope Science goals: A GEON perspective What is Cyberinfrastructure? What is GEON? How will GEON research facilitate discovery.
GEON PI Meeting, March h, 2004, Blacksburg, VA C YBERINFRASTRUCTURE FOR THE G EOSCIENCES GEON IT Update PI Meeting, Blacksburg, VA March 21-23, 2004.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
The Digital Library for Earth System Science: Contributing resources and collections GCCS Internship Orientation Holly Devaul 19 June 2003.
Kepler includes contributors from GEON, SEEK, SDM Center and Ptolemy II, supported by NSF ITRs (SEEK), EAR (GEON), DOE DE-FC02-01ER25486.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
The VIRTUAL SOLAR-TERRESTRIAL OBSERVATORY - Exploring paradigms for interdisciplinary data-driven science Peter Fox 1 Don Middleton 2,
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES GEON Project Management Dogan Seber (GEON PI and Project Manager) San Diego Supercomputer Center.
The Global Land Cover Facility is sponsored by NASA and the University of Maryland.The GLCF is a founding member of the Federation of Earth Science Information.
1 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES IGEON 2007 at the University of Hyderabad, India, August Web Services – The Motivation.
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES GEON IT Advances: Overview Chaitan Baru San Diego Supercomputer Center.
AN ORGANISATION FOR A NATIONAL EARTH SCIENCE INFRASTRUCTURE PROGRAM Virtual Geophysics Laboratory (VGL): Scientific workflows Exploiting the Cloud Josh.
A Cyberinfrastructure Framework for Discovery, Integration, and Analysis of Earth Science Data A Prototype System A. K. Sinha, Z. Malik, A. Rezgui, A.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE SAN DIEGO SUPERCOMPUTER CENTER Interlib Technology Integration Reagan.
Glossary WMS – OGC Web Mapping Services WFS – OGC Web Feature Services XML- Extensible Markup Language OGC – Open GIS Consortium ADN –
SCEC: An NSF + USGS Research Center Focus on Forecasts Motivation.
Origami: Scientific Distributed Workflow in McIDAS-V Maciek Smuga-Otto, Bruce Flynn (also Bob Knuteson, Ray Garcia) SSEC.
THE PALEOINTEGRATION PROJECT IN GEON Allister Rees - Department of Geosciences, University of Arizona.
A Science Collaboration Environment for the Network for Earthquake Engineering Simulation (NEES) Choonhan Youn Chaitan Baru, Ahmed Elgamal,
GEON IT Solutions: Products and Demos Chaitan Baru San Diego Supercomputer Center.
System Software Laboratory Databases and the Grid by Paul Watson University of Newcastle Grid Computing: Making the Global Infrastructure a Reality June.
EcoGrid in SEEK A Data Grid System for Ecology Bertram Ludaescher University of California, Davis Arcot Rajasekar San Diego Supercomputer Center, University.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
1 Design and Implementation of EarthScope Data Portal Chaitan Baru, Kai Lin San Diego Supercomputer Center.
improve the efficiency, collaborative potential, and
Data R&D Issues for GTL Bertram Ludäscher Data and Knowledge Systems
Grid Portal Services IeSE (the Integrated e-Science Environment)
Digital library for Earth System Education Teaching Boxes
INSPIRE Geoportal Thematic Views Application
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
Presentation transcript:

GEONSearch: From Searching to Recommending GeoInformatics 2006 May 10-12, Reston, Virginia Ullas Nambiar, Bertram Ludaescher Dept. of Computer Science University of California, Davis Ghulam Memon, Dogan Seber, Chaitan Baru San Diego Supercomputer Center University of California, San Diego

 NSF Large ITR project – collaborative effort  GEON is creating an IT infrastructure to “enable” interdisciplinary geoscience research -- not a group of researchers, but the entire community will benefit  Support efficient Knowledge Discovery from GeoScientific Data – GEONPortal provided as a Web-based tool for knowledge discovery

Mapping Services ArcIMS WMS WFS Logging Services Usage Stats Collection & Analysis Data Services DB2, Postgres mySQL OpenDAP SRB Data Registration Services Indexing Services Spatial Temporal Conceptual Data Integration Services Ontology Enabled Integration Computational & Modeling Services Modeling, Analysis Tools Metadata Services GEON Catalog Others RegistrationGEONsearch GEONworkbench workflow, visualization, HPC Web/Grid Services Interfaces (WSDL) Physical Grid RedHat Linux, ROCKS, OGSI, Internet, I2, OptIPuter (planned) Other Core Services GridFTP OGSA-DAI CSF GEON: GEOsciences Network Slide adapted from Dr Dogan Seber, SDSC

Scientific Knowledge Discovery Source: Fayyad, U., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R., Advances in Knowledge Discovery and Data Mining, GEON Network GEONSearch GEON Map/Query Integration & Tools

Prototypical query over GEON  Query: “Find Gravity data for regions near Rocky Mountains where geologic age is Jurassic”  What is needed ? – Definition of “regions near” Rocky Mountains Approximate “regions near” as “Rocky Mountain States”  Colorado, Idaho, Montana, Nevada, Utah and Wyoming – Geologic age data for above regions – Gravity measurements taken in those regions – Map of continental US or that of Rocky Mountain States  How to obtain it ? – From GEON and with the help of GEONSearch

Current GEONSearch Features  GEONSearch is the resource discovery tool available under GEON Portal  Allows users to retrieve datasets by querying its – Keywords, Title and Description – Metadata Search the Subject Taxonomy under which datasets are classified – Spatial Coverage A bounding box of region covered by a registered dataset – Temporal Coverage Age of objects evaluated or time when evaluations done – Concepts from Ontology Concepts to which dataset or items contained in dataset are mapped to during registration

Keyword Search in GEON Rockies gravity data or Colorado gravity or Montana gravity or gravity data …. Rocky Mountain Region Map or Colorado Map or state maps…. Rockies Jurassic or Colorado Jurassic or Jurassic …… Too few answers Too many answers

Advanced GEONSearch Enter the potential subject group of datasets ? Datasets containing a city in Colorado, Idaho, etc Data with age Jurassic Datasets mapped to concept mountains or gravity or geologic age

Improving GEONSearch  The challenge before GEONSearch – Reduce iterative querying effort from Scientists  The solution – Suggest “similar queries” Queries with more/less keywords Queries that likely to have highly similar answers – Suggest “related answers” for every result based on Spatial proximity Temporal proximity Usage patterns Common Ontology mappings  Caveat – Suggested queries and answers must be ranked using corresponding distance measures

Knowledge Discovery using GEON GEONSearch RD1V1 DD2V4 Query/Map Integration Tools DD6V1 DD7V1 Integration Cart (MyGEON) GEON Network Registered Datasets RD1V1 RD2V2 Derived Datasets DD2V2 DD2V4 MetadataMetadata O n t o l o g y Need Gravity data around Rocky Mountains …… How to share result with other scientists ! Register DD6V1 DD7V1

Supporting Versions in GEON  GEON network can be visualized as a virtual scientific database that stores results of scientific inquiry – Every dataset available reflects a scientific process E.g. Dataset D1 contains gravity measurements around Davis  Datasets may change over time – Parameters of a scientific process can be changed resulting in additional results – Updating a registered dataset could affect outcome of other scientific inquiries E.g. Updating D1 with new data may make results of processes using D1 irreproducible

GEON Versioning System  Focus – Revision management for locally-hosted data ASCII, Excel, ESRI Shapefiles, GeoTIFF, OWL files etc  Operations supported – Revisions – Branching  Assumptions – Dataset provider decides between Branch or Revision – No support for eventual merging

Provenance Management  Provenance Management becomes necessary in presence of versioning  The provenance of a piece of data is data about process involved in generating the data. – Who, What, Where, How, Assumptions transformation a1a1 a1a1 a2a2 a2a2  Useful for verifying quality of datasets recommended by GEONSearch a1a1 a2a2

Summary  GEONSearch is a necessary tool for knowledge discovery using GEON – Allows both simple Keyword Search and advanced Search  GEONSearch will soon be enhanced to provide “related content” and thus improve the “Search Process”  GEON Versioning System under development for supporting version management and provenance tracking

Thank You! Contact Info: Search for “Ullas Nambiar” in your favorite Search Engine Feedback is very welcome:  Questions  Suggestions  Specific Use Cases