GEON: The User Perspective Choonhan Youn Dogan Seber, Chaitan Baru, Ashraf Memon San Diego Supercomputer Center, University of California at San Diego.

Slides:



Advertisements
Similar presentations
An Operational Metadata Framework For Searching, Indexing, and Retrieving Distributed GIServices on the Internet By Ming-Hsiang.
Advertisements

SACNAS, Sept 29-Oct 1, 2005, Denver, CO What is Cyberinfrastructure? The Computer Science Perspective Dr. Chaitan Baru Project Director, The Geosciences.
SAN DIEGO SUPERCOMPUTER CENTER Choonhan Youn Viswanath Nandigam, Nancy Wilkins-Diehr, Chaitan Baru San Diego Supercomputer Center, University of California,
The Geosciences Network (GEON) An Example of Democratizing Science G. Randy Keller - University of Oklahoma (Cyberinfrastructure in Action)
Chad Berkley National Center for Ecological Analysis and Synthesis (NCEAS), University of California, Santa Barbara February.
GIS in GEON Cyberinfrastructure Presented by Ashraf Memon Presented by Ashraf Memon.
The GEON LiDAR Workflow: An Internet-Based Tool for the Distribution and Processing of LiDAR Point Cloud Data Christopher J. Crosby, J Ramón Arrowsmith,
2. Point Cloud x, y, z, … Complete LiDAR Workflow 1. Survey 4. Analyze / “Do Science” 3. Interpolate / Grid USGS Coastal & Marine.
Update on ASU GEON Activities J Ramon Arrowsmith GEON PI Meeting Reston, VA May 13, 2006.
1 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Global Earth Observation Grid Workshop, Bangkok, Thailand, March Integration Platform.
Center for Environmental Studies Arizona State University Digital Research Records at Center for Environmental Studies Peter McCartney.
GIS at SDSC Domains: –From geology, environmental science, hydrology, ocean biodiversity, regional development, Katrina response, archaeology, to neuroscience.
SAN DIEGO SUPERCOMPUTER CENTER Developing a CUAHSI HIS Data Node, as part of Cyberinfrastructure for the Hydrologic Sciences David Valentine Ilya Zaslavsky.
A Kepler-based Three Tier Architecture applied to LiDAR Interpolation and Analysis Efrat Frank, Ilkay Altintas San Diego Supercomputer Center, UCSD Configuration.
Dogan Seber, PhD San Diego Supercomputer Center University of California, San Diego I. DLESE Library II. DISCOVER OUR EARTH Earth Science Resources for.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Best Practices: Integration of OpenTopography DEM data with UIUC Viewshed tool SDSC OT team.
About CUAHSI The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) is an organization representing 120+ universities.
CSIG 10 Survey of Emerging IT Trends and Technologies Chaitan Baru SDSC 1.
GEON Workshop, Auckland, Nov 26-27, 2007 Introduction to GEON and iGEON Chaitan Baru.
GEON Science Application Demos
Web Services: a Mechanism for Across-the-Internet On Demand Computing and Communication DMS Workshop Stevenson, WA Wed 08 June 2005 What are Web Services,
CI Days: Planning Your Campus Cyberinfrastructure Strategy Russ Hobby, Internet2 Internet2 Member Meeting 9 October 2007.
NSF Meeting on Cyberinfrastructure for Surficial Processes, Jan.18-19, 2006 Slide 1 GEON: The Geosciences Network Chaitan Baru San Diego Supercomputer.
Data R&D Issues for GTL Data and Knowledge Systems San Diego Supercomputer Center University of California, San Diego Bertram Ludäscher
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES High Performance Computing applications in GEON: From Design to Production Dogan Seber.
Physical model Model results HPCC Data Modeling Environment Core Grid Services Authentication, monitoring, scheduling, catalog, data transfer, Replication,
“A Library outranks any other one thing a community can do to benefit its people.” Andrew Carnegie Mary R. Marlino, Ed.D. DLESE Program Center Presentation.
Investigators: Chaitan Baru, Randy Keller, Dogan Seber, Krishna Sinha, Ramon Arrowsmith, Boyan Brodaric, Karl Flessa, Eric Frost, Ann Gates, Mark Gahegan,
Enabling Access to High-Resolution LiDAR Topography through Cyberinfrastructure-Based Data Distribution and Processing Christopher J. Crosby, J Ramón Arrowsmith.
What is Cyberinfrastructure? Russ Hobby, Internet2 Clemson University CI Days 20 May 2008.
Efrat Frank, Ashraf Memon, Vishu Nandigam, Chaitan Baru
1PeopleDocumentsData Catalog Generation Tools Analysis and Visualization Tools Data Services Discovery and Publication Tools Discovery and Publication.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
CBEO:N Chesapeake Bay Environmental Observatory as a Network Node About CBEO The mission of the CBEO project is development of a Chesapeake Bay Environmental.
1 Ilkay ALTINTAS - July 24th, 2007 Ilkay ALTINTAS Director, Scientific Workflow Automation Technologies Laboratory San Diego Supercomputer Center, UCSD.
Where to find LiDAR: Online Data Resources.
Fall AGU Meeting, December, 2005 GEON Developments for Searching, Accessing, Integrating, and Visualizing Distributed Data Charles Meertens UNAVCO Dogan.
Geosciences Network (GEON): Enabling Discoveries in the Earth Sciences Dogan Seber San Diego Supercomputer Center University of California,
GEON PI Meeting, March h, 2004, Blacksburg, VA C YBERINFRASTRUCTURE FOR THE G EOSCIENCES GEON IT Update PI Meeting, Blacksburg, VA March 21-23, 2004.
Holding slide prior to starting show. A Portlet Interface for Computational Electromagnetics on the Grid Maria Lin and David Walker Cardiff University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GO-ESSP Workshop, LLNL, Livermore, CA, Jun 19-21, 2006, Center for ATmosphere sciences and Earthquake Researches Construction of e-science Environment.
Kepler includes contributors from GEON, SEEK, SDM Center and Ptolemy II, supported by NSF ITRs (SEEK), EAR (GEON), DOE DE-FC02-01ER25486.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
SIG: Synthetic Seismogram Exchange Standards (formats & metadata) Is it time to establish exchange standards for synthetic seismograms? IRIS Annual Workshop.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES GEON Project Management Dogan Seber (GEON PI and Project Manager) San Diego Supercomputer Center.
1 CYBERINFRASTRUCTURE FOR THE GEOSCIENCES IGEON 2007 at the University of Hyderabad, India, August Web Services – The Motivation.
GEOSCIENCE NEEDS & CHALLENGES Dogan Seber San Diego Supercomputer Center University of California, San Diego, USA.
GRID-ENABLED MEDIATION SERVICES FOR GEOSPATIAL INFORMATION Ilya Zaslavsky, Chaitan Baru San Diego Supercomputer Center University of California San Diego.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES GEON IT Advances: Overview Chaitan Baru San Diego Supercomputer Center.
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
06/22/041 Data-Gathering Systems IRIS Stanford/ USGS UNAVCO JPL/UCSD Data Management Organizations PI’s, Groups, Centers, etc. Publications, Presentations,
A Cyberinfrastructure Framework for Discovery, Integration, and Analysis of Earth Science Data A Prototype System A. K. Sinha, Z. Malik, A. Rezgui, A.
Cyberinfrastructure Overview Russ Hobby, Internet2 ECSU CI Days 4 January 2008.
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
CUAHSI HIS: Science Challenges Linking small integrated research sites (
GEONSearch: From Searching to Recommending GeoInformatics 2006 May 10-12, Reston, Virginia Ullas Nambiar, Bertram Ludaescher Dept. of Computer Science.
Glossary WMS – OGC Web Mapping Services WFS – OGC Web Feature Services XML- Extensible Markup Language OGC – Open GIS Consortium ADN –
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
SAN DIEGO SUPERCOMPUTER CENTER, UCSD NATIONAL PARTNERSHIP FOR ADVANCED COMPUTATIONAL INFRASTRUCTURE Introduction to SDSC Fran Berman Director, SDSC and.
A Science Collaboration Environment for the Network for Earthquake Engineering Simulation (NEES) Choonhan Youn Chaitan Baru, Ahmed Elgamal,
GEON IT Solutions: Products and Demos Chaitan Baru San Diego Supercomputer Center.
Shaowen Wang 1, 2, Yan Liu 1, 2, Nancy Wilkins-Diehr 3, Stuart Martin 4,5 1. CyberInfrastructure and Geospatial Information Laboratory (CIGI) Department.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Joslynn Lee – Data Science Educator
Shaowen Wang1, 2, Yan Liu1, 2, Nancy Wilkins-Diehr3, Stuart Martin4,5
Data R&D Issues for GTL Bertram Ludäscher Data and Knowledge Systems
Growing importance of metadata for synthetics: Calculating and Sharing Synthetic Seismic Data Dogan Seber University of California, San Diego San Diego.
Presentation transcript:

GEON: The User Perspective Choonhan Youn Dogan Seber, Chaitan Baru, Ashraf Memon San Diego Supercomputer Center, University of California at San Diego

GEON (GEOscience Network) A cyberinfrastructure project for geosciences funded by NSF ITR. creating an IT infrastructure to “enable” interdisciplinary geoscience research -- not a group of researchers, but the entire community will benefit Vision: Enable new discoveries in the geosciences by building an easy-to-use and “comprehensive” data, software, tools, and information network by utilizing state-of the-art information technology resources.

Current GEON member institutions Members Arizona State University Bryn Mawr College Penn State University Rice University San Diego State University San Diego Supercomputer Center / University of California, San Diego University of Arizona University of Idaho University of Missouri, Columbia University of Texas at El Paso University of Utah Virginia Tech UNAVCO, Inc. Digital Library for Earth System Education (DLESE) Partners California Institute for Telecommunications and Information Technology Cal-IT2 Chronos CUAHSI ESRI Geological Survey of Canada Georeference Online IBM Kansas Geological Survey Lawrence Livermore National Laboratory U.S. Geological Survey (USGS) Other Affiliates Southern California Earthquake Center (SCEC), EarthScope, IRIS, NASA

GEOSCIENCE CHALLENGES Exponential Increase in Data Volume – How to manage vast amounts of data can be used by all scientists in an easy-to-use environment Data Storage, Access and Preservation – How to build a framework to exchange data and help preserving collected data sets Data Integration (semantic and syntactic) – How to merge multiple geology maps to make a seamless (“integrated”) map Computational Challenges – How to build a system that helps scientists run advance software without having access to significant resources (computers and technical), focusing on the science problem Advance Visualization (3D/4D) – How to build a visualization system that helps scientists analyze large and complex data sets dynamically Archiving and publications of results with reusable components (reusability) – How to preserve scientific results and help others to repeat the analysis as efficiently as possible?

GEON Cyberinfrastructure (CI) Principles CI: Support the “day to day” conduct of science (e- science), in addition to “hero” computations An equal partnership – IT works in close conjunction with science Create shared “science infrastructure” – Integrated online databases, with advanced search and query engines – Online models, robust tools and applications Leverage from other intersecting projects – Much commonality in the technologies, regardless of science disciplines, e.g. BIRN, SEEK, and many others

Main e-Research facilities I A Resource Registration System for Data Providers – Register ontologies (domain knowledge) and ontology articulations – Register datasets with metadata including data access information – Optionally register datasets to ontologies (which is crucial for data integration and smart search): Ontology enabled semantic integration – Shapefile, ASCII, Excel, GMT Raster, Geo TIFF, Relational Database, PDF, tool, WMS service, Web service, etc. A Search Engine for Data Users – Metadata based search – Spatial coverage based search – Temporal coverage based search – Concept based search – Ontology based data discovering

Main e-Research facilities II The user workspace, called myGEON area. – Users are able to search and collect their data sets from the GEON search engine and integrate them. – For example, users can review and analyze "SYNSEIS“ ouputs that are generated by job running. Computational HPC – SYNSEIS (Synthetic Seismogram toolkit) Workflow – LiDAR: an end-to-end solution for the distribution, interpolation and analysis of LiDAR / ALSM point data. – Atype workflow: generates map for all plutonic bodies in Virginia from the VA Igneous rocks database based on the certain inputs.

Constraints for main e-Research facilities Dynamic workflow issues due to the web-based system on the GEON Large computational clusters for simulating GEON applications as needed – GEON has three small cluster nodes on partner sites

GEON Portal Usability Easy of use – GEON Search, SYNSEIS, many of them, etc. Make complex tasks easy to specify – LiDAR Highly interactive – SYNSEIS Integrated access to tools and resources – myGEON, Mapping Integration

Computational HPC for SYNSEIS

Lessons Learnt Its main strengths – Standard-compliant ways – Using open source libraries and tools for most of implementations Its main weaknesses – Highly user interactive, friendly interface issues within the portlet franework Would you consider alternatives to a portal solution? – Currently, No

Future Plan Will add and develop new functionalities based on the requests from GEON PIs and geoscience community. Will keep improving the portal usability. – For example, in case of SYNSEIS, add more user capabilities in the user interface for complex earthquake simulations. Will expand its use within geoscience community internationally – Center on GEON PIs first

GEON: The Developer Perspective Choonhan Youn Dogan Seber, Chaitan Baru, Ashraf Memon San Diego Supercomputer Center, University of California at San Diego

Methods of GEON’s Design Several workshops were held with participation from scientists from different disciplines like geochemistry, geophysics etc. Also Principal Investigators (PIs) visits SDSC for focused discussion on their requirements Prototypes are built using gathered requirements and then spiral model of software development is followed to enhance the prototype.

Service-Oriented Approach

Priority of Functional and non-functional requirements Start with functional requirement from the principal investigators or local geo-science PI Prototypes are built and functional requirements are tested Then focus on to non-functional requirements like usability

Technical Strategy The “two-tier” approach – Use best practices, including use of commercial tools and open standards, where applicable… start with development using the technology available now – …while developing advanced technology, and doing CS research push for open source and best practices as much as possible

GEONmiddleware GEONSearch, Registration, myGEON Portlet myOntology.owl myDataset.foo metadata User Access (via Portal) Gazetteer, DLESE, … Geologic Age, Chronos, … external services GEONsearch Search condition(s) spatial temporal concept Log myGEON GEON Workspace (user) User actions add delete manipulate GEON Catalog ResourceRegistration SRB Client Access (via web services) Other distributed apps Kepler, DLESE, …

Flash Application SYNSEIS toolkit SYNSEIS Portlet Data Model Service Job Submission/Monitoring and File Service Data Archives Service HPC Resources Data Repository Job Database SOAP JDBC CORBA(IIOP) Grid Services GEONGrid Portal User Access (via Web Browser) Cornell Map Server IRIS DMC HTTP SOAP Grid FTP Web Services myGEON Portlet SAC Service TeraGrid clusters

Development Issues Constraints – Interoperability issues due to use of existing tools Use of existing tools developed in Fortran and some machine dependent algorithms and code GRASS based GIS processing. Incompatible implementation of same standard (OGC’s WMS) – Usability requirements Portlets UI is designed by the software developers and so they are not very user friendly – Part of our tension in the project is that while this is an R&D project for the IT folks, the science folks want some of it to look like production software – lack of user input in some cases, because some users are still trying to get up to speed with the IT concepts so they haven’t really used the system.

Evaluation Usually success of our GEON services is determined by user satisfaction! Usability workshop was held recently with domain scientist involved and their feedback was taken. – Based on this report, we are working on it Another workshop will be held after the implementation of the suggested changes.

Lessons Learnt The most successful aspects – Integrating with other grid, such as TeraGrid – Data registration, search capabilities for geoscience community – Community involvement The least successful aspects – Community still is evaluating this system.

Future Plans Will provide a secure role-based authorization control (using SAML) to fully integrate into the GEON portal. Will add WSRP service. The definition of conventions for managing state may be handled through standard ways such as WSRF so that applications discover, bind, and communicate with stateful resources in standard and interoperable ways.

GEON Search Portlet

GEON Resource Registration Portlet

User Workspace

Mapping Integration Portlet Client Portlet Map Integration Portlet (Mediator) Map Integration Portlet (Mediator) Geon Dataset Ids Gridsphere GEON Metadata Catalogue Ontology Service SRB Query Tracking DB Query Service Mapping Services Webservices 1.Dataset Ids to Dataset Names 2.Dataset Ids to Ontology Ids 3.Ontology Ids to Ontology Names 4.Ontology Ids to Ontology Concepts Ontology Engine ArcIMS Knowledge Representation Redefine Query Mapping Execute Query Download Datasets Store Query Results Query Result Indexing Generate Map GET_EXTRACT GET_MAP

IBM DB2 GEON Portal NFS Mounted Disk Data Processing Algorithms Compute Cluster x,y,z and attribute raw data process output maps/data Client WWW GEON Search Portlet LiDAR Process Portlet Other Portlet LiDAR Processing Service Spatial Query Service GEON Search Service Software Tools DB2 Spatial Function GRASSARCINFOGMT GEON Catalog DATA PROCESSING(LiDAR Portlet) TeraGrid DataStar