Project tracking system for the structure solution software pipeline

Slides:



Advertisements
Similar presentations
Software Quality Assurance Plan
Advertisements

1 Information Systems Development (ISD) Systems Development Life Cycle Overview of Analysis Phase Overview of Design Phase CP2236: Information Systems.
26-28 th April 2004BioXHIT Kick-off Meeting: WP 5.2Slide 1 WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 7 Slide 1 System models l Abstract descriptions of systems whose requirements are being.
Peter J. Briggs, Liz Potterton *, Pryank Patel, Alun Ashton, Charles Ballard, Martyn Winn CLRC Daresbury Laboratory, Warrington, Cheshire WA4 4AD, UK *
Presented by Abirami Poonkundran.  Introduction  Current Work  Current Tools  Solution  Tesseract  Tesseract Usage Scenarios  Information Flow.
Authors Project Database Handler The project database handler dbCCP4i is a small server program that handles interactions between the job database and.
Databases and Database Management Systems
28 th March 2007CCP4 Developers Meeting BIOXHIT/CCP4(i) Database Wanjuan Yang & Peter Briggs.
CCP4(i) Database Development Wanjuan (Wendy) Yang CCP4 Annual developer’s meeting March 28, 2006 York.
Chapter 9 Moving to Design
1 st -4 th December st BioXHIT Annual Meeting WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution Peter.
Section 05Concepts Of DBMS1 HSQ - DATABASES & SQL And Franchise Colleges 05 Concepts of DBMS By MANSHA NAWAZ.
28-29 th March 2006CCP4 Automation STAB MeetingCCP4i and Automation 1 CCP4i and Automation : Opportunities and Limitations Peter Briggs, CCP4.
 Three-Schema Architecture Three-Schema Architecture  Internal Level Internal Level  Conceptual Level Conceptual Level  External Level External Level.
17 th October 2005CCP4 Database Meeting (York) CCP4(i)/BIOXHIT Database Project: Scope, Aims, Plans, Status and all that jazz Peter Briggs, Wanjuan Yang.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Budapest, September 5th, 2002 DataGrid Accounting System DGAS Current status & plans Stefano Barale INFN Budapest, September.
Developments with CCP4i & the Database Handler Peter Briggs.
Project Database Handler The Project Database Handler dbCCP4i is a brokering application that mediates interactions between the project database and an.
26-28 th April 2004BioXHIT Kick-off Meeting1 Task 4.7.1: Automation in Computation Peter Briggs, CCP4.
Data Harvesting: automatic extraction of information necessary for the deposition of structures from protein crystallography Martyn Winn CCP4, Daresbury.
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
17 th October 2005CCP4 Database Meeting York University Database Requirements for CCP4 Projects Monday 17 th October 2005 Abstract Gather information on.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Project Database Handler The Project Database Handler is a brokering application, which will mediate interactions between the project database and other.
28 th May 2005CCP4 Workshop ACA 2005 Orlando FL 1 WK04: A Protein Crystallographic Toolbox: The CCP4 Software Suite ACA 2005 Orlando May 28th 2005.
Software automation – What STAB sees as key aims? 1.Brief review of activities and recommendations (so far) 2.Reality checks 3. Things to do…
Irwin/McGraw-Hill Copyright © 2000 The McGraw-Hill Companies. All Rights reserved Whitten Bentley DittmanSYSTEMS ANALYSIS AND DESIGN METHODS5th Edition.
Future plans & strategy for CCP4 (for discussion) Tadeusz Skarzynski, 30 March 2006.
Peter J. Briggs, Alun Ashton, Charles Ballard, Martyn Winn and Pryank Patel CCLRC Daresbury Laboratory, Warrington, Cheshire WA4 4AD, UK The CCP4 project.
Project Database Handler The Project Database Handler is a brokering application which will mediate interactions between the project database and other.
Engineering, 7th edition. Chapter 8 Slide 1 System models.
Information System Applications
Architecture Review 10/11/2004
Linked Open Data Approaches within the ARIADNE project
Introduction To DBMS.
REV 00 Chapter 2 Database Environment DDC DATABASE SYSTEM.
The Components of Information Systems
REV 00 Chapter 2 Database Environment DDC DATABASE SYSTEM.
Component and Deployment Diagrams
Computer Aided Software Engineering (CASE)
Introduction This seminar is intended to give people an overall view of the capabilities of the Scientific Programme Management System (SPMS), to explain.
System Design.
CCP4 6.1 and beyond: Tools for Macromolecular Crystallography
Database Requirements for CCP4 17th October 2005
Abstract descriptions of systems whose requirements are being analysed
CH#3 Software Designing (Object Oriented Design)
Modernizing web service standards: The next version of WFS
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2: Database System Concepts and Architecture
The Components of Information Systems
Unified Modeling Language
Systems Analysis and Design in a Changing World, 6th Edition
CCP4-PDB Workshop ACA 2004 Chicago
Analysis models and design models
An Introduction to Software Architecture
Chapter 7 –Implementation Issues
Background: Currently CCP4i puts each structure determination into a separate project directory, and automatically keeps a “Project History Database” recording.
Chapter 2 Database Environment Pearson Education © 2014.
Requirements Document
Chapter 5 Architectural Design.
Database System Concepts and Architecture
Information Systems Development (ISD) Systems Development Life Cycle
Information System Building Blocks
From STAB teleconference minutes
Chapter 2: Building a System
UML Design for an Automated Registration System
N6: A Protein Crystallographic Toolbox:
SDMX IT Tools SDMX Registry
Presentation transcript:

Project tracking system for the structure solution software pipeline Progress in WP 5.2: BioXHIT Data Management for PX Structure Determination Software Pipelines Peter Briggs and Wanjuan (Wendy) Yang, CCLRC Daresbury Laboratory, Warrington WA4 4AD, UK Introduction A key part of the integrated technology platform being delivered by the BIOXHIT project is the development of automated structure determination software pipelines that cover the post-data collection stages of structure solution by protein X-ray crystallography (PX). These pipelines need to accurately record and track the data that they produce, both for their own operation and for final deposition of the determined structures. BIOXHIT Workpackage 5.2 is principally concerned with developing a system for performing this data management in order to address the needs of software pipelines. This poster reports on the progress that has been made by CCP4 (BIOXHIT Partner 10) towards this end in the last year. Project Contributors Workpackage 5.2 is co-ordinated by the Collaborative Computational Project No4 (CCP4), which provides a software suite for macromolecular structure determination by X-ray crystallography. Funding for the work described here has been provided by the European Commission as part of the BIOXHIT project. The handler and database components are being developed with contributions from the developers of the CCP4 Automation (HAPPy), XIA/e-HTPX and CRANK Projects; discussions have also taken place with the PIMS project and the MX beamlines at DIAMOND. The scope of collaboration will be broadened as part of the development of the version 1 database schema described below. Project tracking system for the structure solution software pipeline Components of the system Project database handler Database for Project Data & Tracking Visualisation tools These components and their relationships are shown schematically in the figure (right), and are described in more detail in the sections below. Key considerations Implement a system for both manual and automated structure determination Allow multiple database back-ends Gather as much information from client programs as possible automatically Open architecture accommodating heterogenous software components Project Database Handler CCP4 applications Non-CCP4 applications Project database CCP4i user interface Project Data Visualiser Other databases (PIMS, beamlines) Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external applications and databases (local or remote). It acts as a single point of access to the data for external applications and hides the implementation of the database from them. Communications between the handler and the API are encoded in XML. Version 1 of the handler was developed in Python and acted as a testbed used with the XIA and HAPPy pipelines. The code can be obtained from ftp://ftp.ccp4.ac.uk/pjx/bioxhit/ Work is also ongoing to integrate the handler into CCP4i (the CCP4 graphical user interface). Database for Project Data & Tracking A database is being designed and implemented which will be capable of storing both project data (the information used by each step in a pipeline) and project history (the steps taken and the provenance and evolution of information as the project progresses). As a result of this work a more detailed database architecture is envisaged, with three components: Knowledge base: consisting of the common crystallographic data items used in the software pipeline that are shared between different applications. This will link to external databases (e.g. PIMS and beamlines) as well as providing data for deposition, and will be consistent with the standards in WP5.1. Operational database: containing application-specific data and representations (for example parameter files or Python objects) that are not intended to be shared between applications. Visualisation Tools These tools will provide interfaces to the database, to display the project data in selective views and thus focus on particular aspects of the data-flow or logical flow – for example, as work flow diagrams. Prototype tools (below) have been developed based on the existing CCP4i project database, and a “project data explorer” based on the prototype minimal database schema. Version 1 of the handler was developed using a “minimal database” (see right) implemented in MySQL, to explore the requirements of the full system. Applications talk to the handler via a “client API” which will be implemented in different programming languages (see left). Next steps Over the next 12 months the aims are to: Improve the Python handler’s functionality and robustness and integrate it into CCP4i Specify and implement the full knowledge base and project tracking databases with input from other BIOXHIT Partners Develop and release the version 1 visualiser application. Tracking database: storing the history of the data generation in the knowledge and operational databases. Work is now ongoing to develop a MySQL schema using DBDesigner 4 for the knowledge base and tracking databases (see left). See http://www.ccp4.ac.uk/projects/bioxhit.html for more information