© Geodise Project, University of Southampton, 2003. Data Management in Geodise Jasmin Wason, Zhuoan Jiao and Marc Molinari Engineering.

Slides:



Advertisements
Similar presentations
X-SIGMA (An XML based Simple data Integration system for Gathering, Managing and Accessing scientific experimental data in grid environments) Karpjoo
Advertisements

National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
WP2: Data Management Gavin McCance University of Glasgow November 5, 2001.
© Geodise Project, University of Southampton, Applying the Semantic Web to Manage Knowledge on the Grid Feng Tao, Colin.
Visualisation in Grid Enabled Optimisation and Design Search for Engineering Marc Molinari, Sam Gould University of Southampton
Grid Enabled Optimisation and Design Search for Engineering (G EODISE ) Prof Simon Cox Southampton University 3 rd Annual Workshop on Linux Clusters for.
© Geodise Project, University of Southampton, Geodise: Taking the Grid to the Engineer Graeme Pound International Summer.
© Geodise Project, University of Southampton, Applications and Middleware Hakki Eres, Fenglian Xu & Graeme Pound.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
Workshop on Cyber Infrastructure in Combustion Science April 19-20, 2006 Subrata Bhattacharjee and Christopher Paolini Mechanical.
Mike Jackson EPCC OGSA-DAI Today Release 2.2 Principles and Architectures for Structured Data Integration: OGSA-DAI.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Database Environment 1.  Purpose of three-level database architecture.  Contents of external, conceptual, and internal levels.  Purpose of external/conceptual.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
© Geodise Project, University of Southampton, Geodise: A Grid-enabled PSE for design search and optimisation Graeme Pound.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
1 Dr. Markus Hillenbrand, ICSY Lab, University of Kaiserslautern, Germany A Generic Database Web Service for the Venice Service Grid Michael Koch, Markus.
A Metadata Catalog Service for Data Intensive Applications Presented by Chin-Yi Tsai.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
© Geodise Project, University of Southampton, Data Management in Geodise Zhuoan Jiao, Jasmin Wason and Marc Molinari
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
© Geodise Project, University of Southampton, GEODISE: Grid-enabled toolkits for the Engineer Andrew Price UK e-Science Programme,
1 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre The Data Portal Glen Drinkwater.
DATABASE MANAGEMENT SYSTEMS IN DATA INTENSIVE ENVIRONMENNTS Leon Guzenda Chief Technology Officer.
Javascript Cog Kit By Zhenhua Guo. Grid Applications Currently, most grid related applications are written as separate software. –server side: Globus,
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Elmasri and Navathe, Fundamentals of Database Systems, Fourth Edition Copyright © 2004 Pearson Education, Inc. Slide 2-1 Data Models Data Model: A set.
INFSO-RI Enabling Grids for E-sciencE OGSA DAI Data Access and Integration Marek Ciglan Institute of Informatics, Slovac Academy.
13 1 Chapter 13 The Data Warehouse Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
© Geodise Project, University of Southampton, Geodise Middleware & Optimisation Graeme Pound, Hakki Eres, Gang Xue & Matthew Fairman Summer 2003.
Uwe SchindlerGES 2007 – May 2-4, 2007 Data Information Service based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler 1, Benny Bräuer.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
© Geodise Project, University of Southampton, Knowledge Management in Geodise Geodise Knowledge Management Team Barry Tao, Colin Puleston, Liming.
Mike Jackson EPCC OGSA-DAI Architecture + Extensibility OGSA-DAI Tutorial GGF17, Tokyo.
Metadata Mòrag Burgon-Lyon University of Glasgow.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
© Geodise Project, University of Southampton, Data Management in Geodise Zhuoan Jiao, Jasmin Wason & Marc Molinari { z.jiao,
Hussein Suleman University of Cape Town Department of Computer Science Digital Libraries Laboratory February 2008 Data Curation Repositories:
MySQL and GRID status Gabriele Carcassi 9 September 2002.
© Geodise Project, University of Southampton, Grid middleware for engineering design search and optimisation Graeme Pound.
© Geodise Project, University of Southampton, Geodise Middleware Graeme Pound, Gang Xue & Matthew Fairman Summer 2003.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
© Geodise Project, University of Southampton, Integrating Data Management into Engineering Applications Zhuoan Jiao, Jasmin.
ESG-CET Meeting, Boulder, CO, April 2008 Gateway Implementation 4/30/2008.
EbXML Registry and Repository Dept of Computer Engineering Khon Kaen University.
Data Manipulation with Globus Toolkit Ivan Ivanovski TU München,
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
© Geodise Project, University of Southampton, Geodise Compute Toolbox Functions CommandFunctionCommandFunction gd_certinfo.
1 Database Environment. 2 Objectives of Three-Level Architecture u All users should be able to access same data. u A user’s view is immune to changes.
© Geodise Project, Scenario: Design optimisation v Model device, discretize, solve, postprocess, optimise Scripting.
2) Database System Concepts and Architecture. Slide 2- 2 Outline Data Models and Their Categories Schemas, Instances, and States Three-Schema Architecture.
© Geodise Project, University of Southampton, Workflow Support for Advanced Grid-Enabled Computing Fenglian Xu *, M.
© Geodise Project, University of Southampton, Data Management in Geodise Jasmin Wason, Zhuoan Jiao and Marc Molinari 12 May.
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
FHIR and Relational Databases
GEODISE: Grid-enabled toolkits for the Engineer
The Client/Server Database Environment
Grid Enabled Optimisation and Design Search (GEODISE)
Knowledge Based Workflow Building Architecture
Database Environment Transparencies
Presentation transcript:

© Geodise Project, University of Southampton, Data Management in Geodise Jasmin Wason, Zhuoan Jiao and Marc Molinari Engineering design and optimisation is a computationally intensive process where data may be generated at different locations with different characteristics. Data is traditionally stored in flat files with little descriptive metadata provided by the file system. Our focus is on providing data management by leveraging existing database tools that are not commonly used in engineering and making them accessible to users of the system. The main objectives are to provide: A data management service Securely store and retrieve data files and data structures to/from a distributed repository. Technical and application specific metadata added so data is easier to search for, locate and share. Metadata management services Web services provide API access to metadata in relational and XML databases. Related data aggregated into groups. A familiar interface for engineers Work with functions and variables rather than underlying XML, SOAP, SQL, XPath, etc.

© Geodise Project, University of Southampton, Storage service Allows applications to archive files sent over GridFTP to file systems, and store data structures as XML in databases. A location service maps logical data identities with physical storage locations. Example: Archive data: >> fileID = gd_archive('C:\input.dat'); Retrieve data: >> gd_retrieve(fileID, 'E:\tmp' ) ans = E:\tmp\input.dat Metadata service The data can be stored with additional descriptive information detailing technical characteristics (e.g. format, size, date), ownership, and user-defined application domain specific metadata. Example: Define metadata and archive file: >> m.grids = 1; >> m.turb_model = 'sa'; >> fileID = gd_archive('C:\input.dat', m); Query service Querying over the metadata database can help to locate the needed data intuitively and efficiently, using the gd_query function or query GUI. Users only receive results for data they are authorised to access. Example: >> r = gd_query('standard.userID = me & grids < 2'); >> gd_display(r): standard.userID = me standard.ID = input_dat_8a ad2d-4055-aad9-a1 grids = 1 Authorisation service An authenticated user may grant other users access rights to their data and this information is stored in the authorisation database. Example: >> m.grids = 1; >> m.access.users = {'userA', 'userB'}; >> m.access.groups = {'groupC'}; >> fileID = gd_archive ('C:\input.dat', m); Geodise Database Toolbox

© Geodise Project, University of Southampton, Authorisation Service Location Service Java Metadata Archive & Query Services Data Management Implementation To increase the usability of file and metadata management services for engineers we have implemented a MATLAB Toolbox for archiving, querying and retrieval of data to and from a Geodise repository. Matlab Functions Java clients Globus Server Geodise Database Toolbox Metadata Database Client Grid Authorisation Database CoG Apache SOAP Location Database Refers to SOAP GridFTP

© Geodise Project, University of Southampton, XML Toolbox Enables the conversion of Matlab variables and structures from proprietary format to XML and vice versa in transparent, easy-to-use way. The XML can then be transferred, stored, and retrieved across the Grid. Four functions: xml_save(), xml_load(), xml_format(), xml_parse() The XML Toolbox developed by the GEM project. Application of the XML Toolbox in Geodise.

© Geodise Project, University of Southampton, Future Work Categorisation of metadata based on XML Schemas XML Schemas can be generated to describe user-defined metadata. Changes are made over time to metadata about a design. Use comparison and merge tool to detect changes and alter XML Schema. XML Schema work may include future integration of ontologies. Web query interface Auto generation of query interface based on XML Schemas. User certificate required for authentication. Computational steering with shared Matlab variables Temporary shared storage and update of Matlab variables. XML Toolbox Ability to read most XML files and convert these into Matlab struct variables and vice versa. Infrastructure Improved Web Service security and use of OGSA-DAI. Jython implementation of client toolbox.