QCDgrid User Interfaces James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.

Slides:



Advertisements
Similar presentations
The Quantum Chromodynamics Grid James Perry, Andrew Jackson, Matthew Egbert, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Advertisements

UKQCD GridPP NeSCAC Irving, 4/2/041 9 th GridPP Collaboration Meeting QCDgrid: Status and Future Alan Irving University of Liverpool.
CS0004: Introduction to Programming Visual Studio 2010 and Controls.
VAMDC Registry Portal Proof of Concept. Registry VAMDC Registry is available at – ex.jsp
Chapter 18 - Data sources and datasets 1 Outline How to create a data source How to use a data source How to use Query Builder to build a simple query.
Browsers and Servers CGI Processing Model ( Common Gateway Interface ) © Norman White, 2013.
Guide to Oracle10G1 Introduction To Forms Builder Chapter 5.
A Guide to Oracle9i1 Introduction To Forms Builder Chapter 5.
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
11 3 / 12 CHAPTER Databases MIS105 Lec14 Irfan Ahmed Ilyas.
Chapter 7 Managing Data Sources. ASP.NET 2.0, Third Edition2.
SQL Reporting Services Overview SSRS includes all the development and management pieces necessary to publish end user reports in  HTML  PDF 
Data Grid Web Services Chip Watson Jie Chen, Ying Chen, Bryan Hess, Walt Akers.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
The Project AH Computing. Functional Requirements  What the product must do!  Examples attractive welcome screen all options available as clickable.
TIBCO Designer TIBCO BusinessWorks is a scalable, extensible, and easy to use integration platform that allows you to develop, deploy, and run integration.
Microsoft Access Database software. What is a database? … a database is an organized collection of data. A collection of data of similar information compiled.
MS Access Advanced Instructor: Vicki Weidler Assistant:
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Linux Operations and Administration
The ATLAS Production System. The Architecture ATLAS Production Database Eowyn Lexor Lexor-CondorG Oracle SQL queries Dulcinea NorduGrid Panda OSGLCG The.
BaBar WEB job submission with Globus authentication and AFS access T. Adye, R. Barlow, A. Forti, A. McNab, S. Salih, D. H. Smith on behalf of the BaBar.
Copyright © 2003 by Prentice Hall Module 4 Database Management Systems 1.What is a database? Data hierarchy and data organization Field, record, file,
Copyright © 2003 by Prentice Hall Computers: Tools for an Information Age Chapter 13 Database Management Systems: Getting Data Together.
4-1 INTERNET DATABASE CONNECTOR Colorado Technical University IT420 Tim Peterson.
Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.
What is Sure BDCs? BDC stands for Batch Data Communication and is also known as Batch Input. It is a technique for mass input of data into SAP by simulating.
ILDG5QCDgrid1 QCDgrid status report UKQCD data grid Chris Maynard.
QCDgrid UKQCD Achievements and Future Priorities Who and what Achievements QCDgrid middleware Future priorities Demo of meta-data catalogue browser Alan.
1 PHP and MySQL. 2 Topics  Querying Data with PHP  User-Driven Querying  Writing Data with PHP and MySQL PHP and MySQL.
OracleAS Reports Services. Problem Statement To simplify the process of managing, creating and execution of Oracle Reports.
Copyright © 2007, Oracle. All rights reserved. Managing Concurrent Requests.
9 Chapter Nine Compiled Web Server Programs. 9 Chapter Objectives Learn about Common Gateway Interface (CGI) Create CGI programs that generate dynamic.
Chapter 6 SAS ® OLAP Cube Studio. Section 6.1 SAS OLAP Cube Studio Architecture.
Management Information Systems MS Access MS Access is an application software that facilitates us to create Database Management Systems (DBMS)
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Setting up a Pan-European Datagrid using QCDgrid technology Chris Johnson, James Perry, Lorna Smith and Jean-Christophe Desplat EPCC, The University Of.
The Network Performance Advisor J. W. Ferguson NLANR/DAST & NCSA.
1 Overview of the Application Hosting Environment Stefan Zasada University College London.
Putting it all together Dynamic Data Base Access Norman White Stern School of Business.
UKQCD QCDgrid Richard Kenway. UKQCD Nov 2001QCDgrid2 why build a QCD grid? the computational problem is too big for current computers –configuration generation.
Bookkeeping Tutorial. Bookkeeping & Monitoring Tutorial2 Bookkeeping content  Contains records of all “jobs” and all “files” that are created by production.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
MS Access 2007 Management Information Systems 1. Overview 2  What is MS Access?  Access Terminology  Access Window  Database Window  Create New Database.
Digital curation activities enhance access and retrieval, maintain quality, add value, and facilitate use and re-use over time. This poster demonstrates.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
The NGS Grid Portal David Meredith NGS + Grid Technology Group, e-Science Centre, Daresbury Laboratory, UK
Getting started DIRAC Project. Outline  DIRAC information system  Documentation sources  DIRAC users and groups  Registration with DIRAC  Getting.
1 Chapter 20 – Data sources and datasets Outline How to create a data source How to use a data source How to use Query Builder to build a simple query.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
FRANEC and BaSTI grid integration Massimo Sponza INAF - Osservatorio Astronomico di Trieste.
Metadata Input Tool for CADIS Scientists and Data Managers by D. Stott August 8, 2007.
Replica Management Kelly Clynes. Agenda Grid Computing Globus Toolkit What is Replica Management Replica Management in Globus Replica Management Catalog.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Bookkeeping Tutorial. 2 Bookkeeping content  Contains records of all “jobs” and all “files” that are produced by production jobs  Job:  In fact technically.
AHM04: Sep 2004 Nottingham CCLRC e-Science Centre eMinerals: Environment from the Molecular Level Managing simulation data Lisa Blanshard e- Science Data.
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
May 2005 PPARC e-Science PG School1 QCDgrid Chris Maynard A Grid for UKQCD National collaboration for lattice QCD.
1 Egrid portal Stefano Cozzini and Angelo Leto. 2 Egrid portal Based on P-GRADE Portal 2.3 –LCG-2 middleware support: broker, CEs, SEs, BDII –MyProxy.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
The NGS Grid Portal David Meredith NGS + Grid Technology Group, e-Science Centre, Daresbury Laboratory, UK
Application Web Service Toolkit Allow users to quickly add new applications GGF5 Edinburgh Geoffrey Fox, Marlon Pierce, Ozgur Balsoy Indiana University.
Manchester Computing Supercomputing, Visualization & eScience Seamless Access to Multiple Datasets Mike AS Jones ● Demo Run-through.
Holding slide prior to starting show. Lessons Learned from the GECEM Portal David Walker Cardiff University
STAR Scheduler Gabriele Carcassi STAR Collaboration.
: Information Retrieval อาจารย์ ธีภากรณ์ นฤมาณนลิณี
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Product Training Program
Business rules.
Stephen Burke, PPARC/RAL Jeff Templon, NIKHEF
Presentation transcript:

QCDgrid User Interfaces James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh

QCDgrid Summary QCDgrid project is developing a data and compute grid for scientists in the UKQCD collaboration –data storage grid has been up and running for some months now –job submission system is in early stages of development –software developed is released as open source –builds on Globus 2.0, eXist XML database and various other technologies For more information on the project in general, see Lornas talk this afternoon!

User Requirements User interface is naturally driven by users requirements –most QCDgrid users have a good understanding of computers –for them, advanced scripting capabilities are more important than user-friendly GUIs –powerful command line interface is top priority for QCDgrid software GUIs also useful for some operations –for example, searching and browsing the metadata catalogue C and Java APIs facilitate integration with software in many different programming languages

Datagrid Interfaces Data management aspect of grid consists of two distinct parts Low level data replication deals with files themselves –files at this level are just blocks of binary data – they could contain anything –Globus replica catalogue maps logical filenames to actual physical locations Metadata catalogue associates some meaningful, structured information with each file –allows users to search for data more easily –maps interesting characteristics of data (structured as XML) to logical filenames

Low Level Data Grid Interface Low level operations provided by command line tools and C API –Java interface using JNI also available –SRM-compliant interface to some functionality Fairly small set of basic operations –put a file or directory on the grid –get a file or directory from the grid –delete a file or directory –list files on grid –register interest in a file or directory User must have a valid Globus proxy initialised

Data Grid Example Commands Some example data grid commands: put-file-on-qcdgrid /home/username/myfile gridfile - puts the local file myfile onto the grid under logical name gridfile - replication software will take care of deciding where to store the file, adding replica catalogue entries, etc. get-file-from-qcdgrid -R griddir /tmp/mydir - gets directory griddir from the grid, storing it in local directory /tmp/mydir - -R switch means recursive, works with most QCDgrid commands

More Example Commands More example commands... qcdgrid-list - lists all files on grid by logical name i-like-this-file interestingdata - registers interest in the file with logical name interestingdata - replication system takes this into account, tries to store files close to where they are most often wanted qcdgrid-delete olddata - removes all copies of the file olddata from grid

Data Grid APIs APIs provide similar functionality –Example: QCDgridClient grid = QCDgridClient.getClient(true); String logicalFile = gridfile; File physicalFile = new File(localfile); grid.getFile(logicalFile, physicalFile);

Metadata Problem: logical file names may not be meaningful –users may have trouble finding data Solution: metadata catalogue –associate some meaningful information with each file on the grid –including date produced, machine used, code used, actual physical parameters –users can then search on these fields –metadata is XML, stored in eXist XML database –queried using XPath query language Command line, GUI and Java interfaces (via standard XMLDB API) available

Metadata Interface: Commands Command line functionality currently limited to 3 operations –submit –remove –update schema Examples: java QCDgridMetadataClient localhost:8080/exist \ updateSchema newschema.xsd java QCDgridMetadataClient localhost:8080/exist \ submit newfile.xml newdocumentid

Metadata Interface: GUI Metadata browser GUI allows users to easily search for the data they want - XPath queries can be built using simple graphical input methods - GUI generated automatically from current schema - when schema is updated, GUI updates itself - matching data can be easily retrieved from the grid

Searching MDC, Step 1 Main browser window gives a list of saved queries –these are stored in the users profile –support for libraries of queries is planned

Searching MDC, Step 2 Creating a new query –first a node in the XML document structure must be selected from the tree –tree is automatically generated from schema when browser starts up –e.g. to find all the data produced on a certain date, user should select the date node

Searching MDC, Step 3 Once node has been selected, predicate must be specified –this is just an XPath term for criteria for matching node data –predicate can be entered as raw XPath if desired –most users will want to make use of form to simplify process

Searching MDC, Step 3 cont... More complex queries can be created relatively easily –in this example, the query is extended to search for data from 2 years –for most queries, knowledge of XPath is not required

Searching MDC, Step 4 New query now appears on list –from here, queries can be managed –queries can be combined together –or can be submitted to the database backend

Searching MDC, Step 5 Matching metadata documents are displayed –XML is parsed into easy-to-read, expandable tree format –corresponding data files can be fetched from grid at the press of a button

Job Submission QCDgrid job submission still in very early stages As with data management, users require command interface that can be used from scripts –integration with data grid will simplify user interface –unlike plain Globus, job input, output and error streams can be redirected to and from the users console –this allows for interactive jobs on the grid – useful for debugging etc. GUI or web portal interface may be added later

Example Commands Early prototype of job submission software is up and running Syntax quite similar to globus-job-run –Example commands: qcdgrid-job-submit qcdtest.epcc.ed.ac.uk \ /bin/date qcdgrid-job-submit doorstopper.epcc.ed.ac.uk \ /usr/bin/program arg1 arg2 arg3 \ --fetch-from-qcdgrid gridName localName

Administrator Interface Previous slides have focussed on normal end users experience QCDgrid software also provides tools to aid in administration –commands to add and remove grid nodes, and change the state of existing nodes –commands for building and maintaining the Globus replica catalogue –commands for maintaining directory of grid users Admin GUI to integrate many of these functions is a possibility

Some Admin Commands Administrators are identified by their certificate subjects –Must have a valid proxy with subject listed in the config file before executing these commands add-qcdgrid-node newnode.ed.ac.uk Edinburgh \ /home/qcdgrid disable-qcdgrid-node notworking.ed.ac.uk verify-qcdgrid-rc setup-security.sh adduser James Perry \ /O=certificate/O=subject/CN=jtp \

Interface Summary Low level data grid has command line interface and APIs Metadata catalogue mainly accessed through browser GUI –this also integrates with low level data grid Job submission currently usable from command line only –possible GUI/web portal in future Various admin tools exist or are in development Better integration of the different parts of the project is planned