© Geodise Project, University of Southampton, 2003. Data Management in Geodise Zhuoan Jiao, Jasmin Wason and Marc Molinari 30-31.

Slides:



Advertisements
Similar presentations
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Data Grids for Collection Federation Reagan W. Moore University.
Advertisements

WP2: Data Management Gavin McCance University of Glasgow November 5, 2001.
The Quantum Chromodynamics Grid James Perry, Andrew Jackson, Matthew Egbert, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
E-Science Data Information and Knowledge Transformation Eldas Building Service Grids with Enterprise Level Data Access Services Alan Gray
© Geodise Project, University of Southampton, Applying the Semantic Web to Manage Knowledge on the Grid Feng Tao, Colin.
Using the Collaborative Tools in NEESgrid Charles Severance University of Michigan.
Grid Enabled Optimisation and Design Search for Engineering (G EODISE ) Prof Simon Cox Southampton University 3 rd Annual Workshop on Linux Clusters for.
© Geodise Project, University of Southampton, Geodise: Taking the Grid to the Engineer Graeme Pound International Summer.
© Geodise Project, University of Southampton, Applications and Middleware Hakki Eres, Fenglian Xu & Graeme Pound.
Connect. Communicate. Collaborate Click to edit Master title style MODULE 1: perfSONAR TECHNICAL OVERVIEW.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
© Geodise Project, University of Southampton, Geodise: A Grid-enabled PSE for design search and optimisation Graeme Pound.
Knowledge Management in Geodise Geodise Knowledge Management Team Liming Chen, Barry Tao, Colin Puleston, Paul Smart University of Southampton University.
© Geodise Project, University of Southampton, Data Management in Geodise Jasmin Wason, Zhuoan Jiao and Marc Molinari Engineering.
EXtensible Neuroimaging Archive Toolkit (XNAT) Washington University Neuroinformatics Group.
SITools Enhanced Use of Laboratory Services and Data Romain Conseil
M1G Introduction to Database Development 6. Building Applications.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
© Geodise Project, University of Southampton, GEODISE: Grid-enabled toolkits for the Engineer Andrew Price UK e-Science Programme,
1 All-Hands Meeting 2-4 th Sept 2003 e-Science Centre The Data Portal Glen Drinkwater.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
Design of a Search Engine for Metadata Search Based on Metalogy Ing-Xiang Chen, Che-Min Chen,and Cheng-Zen Yang Dept. of Computer Engineering and Science.
Application portlets within the PROGRESS HPC Portal Michał Kosiedowski
Shannon Hastings Multiscale Computing Laboratory Department of Biomedical Informatics.
1 1 EPCC 2 Curtin Business School & Edinburgh University Management School Michael J. Jackson 1 Ashley D. Lloyd 2 Terence M. Sloan 1 Enabling Access to.
The Globus Toolkit 4 (GT4) A brief introduction. Web Services, WSRF, OGSA and GT4.
Web: Minimal Metadata for Data Services Through DIALOGUE Neil Chue Hong AHM2007.
ICDL 2004 Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer Science Old Dominion University.
Implementing the XDS Infrastructure Bill Majurski IT Infrastructure National Institute of Standards and Technology.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
JISC Middleware Security Workshop 20/10/05© 2005 University of Kent.1 The PERMIS Authorisation Infrastructure David Chadwick
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
INFSO-RI Enabling Grids for E-sciencE OGSA DAI Data Access and Integration Marek Ciglan Institute of Informatics, Slovac Academy.
© Geodise Project, University of Southampton, Geodise Middleware & Optimisation Graeme Pound, Hakki Eres, Gang Xue & Matthew Fairman Summer 2003.
Uwe SchindlerGES 2007 – May 2-4, 2007 Data Information Service based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler 1, Benny Bräuer.
© Geodise Project, University of Southampton, Knowledge Management in Geodise Geodise Knowledge Management Team Barry Tao, Colin Puleston, Liming.
The CERA2 Data Base Data input – Data output Hans Luthardt Model & Data/MPI-M, Hamburg Services and Facilities of DKRZ and Model & Data Hamburg,
Metadata Mòrag Burgon-Lyon University of Glasgow.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
Presented by Jens Schwidder Tara D. Gibson James D. Myers Computing & Computational Sciences Directorate Oak Ridge National Laboratory Scientific Annotation.
EGEE User Forum Data Management session Development of gLite Web Service Based Security Components for the ATLAS Metadata Interface Thomas Doherty GridPP.
© Geodise Project, University of Southampton, Data Management in Geodise Zhuoan Jiao, Jasmin Wason & Marc Molinari { z.jiao,
Lattice QCD Data Grid Middleware: Meta Data Catalog (MDC) -- CCS ( tsukuba) proposal -- M. Sato, for ILDG Middleware WG ILDG Workshop, May 2004.
INRIA - Progress report DBGlobe meeting - Athens November 29 th, 2002.
Replica Management Kelly Clynes. Agenda Grid Computing Globus Toolkit What is Replica Management Replica Management in Globus Replica Management Catalog.
© Geodise Project, University of Southampton, Grid middleware for engineering design search and optimisation Graeme Pound.
© Geodise Project, University of Southampton, Geodise Middleware Graeme Pound, Gang Xue & Matthew Fairman Summer 2003.
DSpace System Architecture 11 July 2002 DSpace System Architecture.
Feb 24-27, 2004ICDL 2004, New Dehli Improving Federated Service for Non-cooperating Digital Libraries R. Shi, K. Maly, M. Zubair Department of Computer.
© Geodise Project, University of Southampton, Integrating Data Management into Engineering Applications Zhuoan Jiao, Jasmin.
ESG-CET Meeting, Boulder, CO, April 2008 Gateway Implementation 4/30/2008.
1 AHM, 2–4 Sept 2003 e-Science Centre GRID Authorization Framework for CCLRC Data Portal Ananta Manandhar.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
© Geodise Project, University of Southampton, Geodise Compute Toolbox Functions CommandFunctionCommandFunction gd_certinfo.
Brian Matthews, euroCRIS, 18/09/03 CRIS architecture to support an ERA Brian Matthews.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
© Geodise Project, Scenario: Design optimisation v Model device, discretize, solve, postprocess, optimise Scripting.
Data Breakout. OGSA Architecture – databases Eldas, OGSA-DAI and GridMiner implement a slightly old version of OGSA / DAIS –Architecture doc describes.
© Geodise Project, University of Southampton, Workflow Support for Advanced Grid-Enabled Computing Fenglian Xu *, M.
© Geodise Project, University of Southampton, Data Management in Geodise Jasmin Wason, Zhuoan Jiao and Marc Molinari 12 May.
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
Robert Worden Open Mapping Software Ltd
Viet Tran Institute of Informatics Slovakia
Knowledge Based Workflow Building Architecture
Presentation transcript:

© Geodise Project, University of Southampton, Data Management in Geodise Zhuoan Jiao, Jasmin Wason and Marc Molinari January 2003, Edinburgh Engineering design and optimisation is a computationally intensive process where data may be generated at different locations with different characteristics. Data is traditionally stored in flat files with little descriptive metadata provided by the file system. Our focus is on providing data management by leveraging existing database tools that are not commonly used in engineering and making them accessible to users of the system. The main objectives are to provide: A data management service Store and retrieve data files securely from a repository using GridFTP. Technical and application specific metadata added so data is easier to search for and locate. Metadata management services Web services provide API access to metadata in databases. Use both relational and XML databases. A familiar interface for engineers Work with functions and variables rather than underlying XML, SOAP, SQL, XPath, etc.

© Geodise Project, University of Southampton, Storage service Allows applications to archive data sent over GridFTP in file systems curated by Geodise for benefits of: accessibility by a larger community (via authorisation), storage capacity, and a uniform query interface. Example: Archive data: >> fileID = gd_archive('C:\input.dat'); Retrieve data: >> gd_retrieve(fileID, 'E:\tmp' ) ans = E:\tmp\input.dat Metadata service The data can be stored with additional descriptive information detailing technical characteristics (e.g. location, format), ownership, and application domain specific metadata. Example: Define metadata and archive file: >> m.grids = 1; >> m.turb_model = 'sa'; >> fileID = gd_archive('C:\input.dat', m); Query service Query over the metadata database can help to locate the needed data intuitively and efficiently. Example: >> r = gd_query('standard.userID = me & grids < 2'); To access a value from first result in cell array r: >> r{1}.turb_model ans = sa Authorisation service Access rights to data can be granted to an authenticated user based on information stored in the authorisation database. Example: >> m.grids = 1; >> m.access.users = {'userA', 'userB'}; >> m.access.groups = {'groupC'}; >> fileID = gd_archive ('C:\input.dat', m); Geodise Database Toolkit

© Geodise Project, University of Southampton, Database Application of XML Toolbox for MATLAB Name-based XML (easy for query) X = <file_metadata type="struct" idx="0" fields="grids turb_model"> 1 sa Type-based XML (easy for converting back to Matlab) X = <struct xmlns=" idx="0“ fields="grids turb_model"> 1 sa Define Matlab variables: >> meta.grids = 1 >> meta.turb_model = ‘sa’ X = xml_format(meta) meta = xml_parse(X) XSLT: type2name XSLT: name2type The following functions are responsible for converting a Matlab variable to and from an XML string: xml_format() - Convert a Matlab variable into an XML string. xml_parse() - Convert an XML string into a Matlab variable.

© Geodise Project, University of Southampton, Authorisation Service Location Service Java Metadata Archive & Query Services Data Management Implementation To increase the usability of file and metadata management services for Engineers we have implemented a MATLAB Toolkit for archiving, querying and retrieval of data to and from a Geodise repository. Matlab Functions Java clients Globus Server Geodise Database Toolkit Metadata Database Browser Client Grid Authorisation Database CoG Apache SOAP Location Database Refers to SOAP GridFTP

© Geodise Project, University of Southampton, Query Example MATLAB commands to retrieve files. Copy and paste

© Geodise Project, University of Southampton, Future Work Grid Data Management Replace and enhance some of our functionality with that provided by OGSA DAI for Grid Database Services. E.g. Name mapping interface for authenticating Grid credentials to local ids. Automatic collection of data and metadata from a higher level engineering problem setup GUI. Manage Matlab Structures Some data may take the form of Matlab structures rather than files. These can be archived as XML in the repository and then queried. The structures can also be retrieved back into the Matlab workspace. Categorisation of metadata based on XML Schemas Group metadata with same XML Schema in the database. Users are not expected to write XML Schemas. Generate simple XML Schema from metadata structure if one does not already exist to describe it. Could help future integration of ontologies.