Worldwide Protein Data Bank www.wwpdb.org wwPDB Common D&A Project November 24, 2009 November 24, 2009 Steering Committee Project Update.

Slides:



Advertisements
Similar presentations
© Copyright 2007 Exempler Telecom Test Automation System Exempler - We pride ourselves with providing lightweight robust engineering solutions.
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
New Release Announcements and Product Roadmap Chris DiPierro, Director of Software Development April 9-11, 2014
Business logic for annotation workflow Tom Oldfield July 21, 2010.
Beta Testing: The Contractor’s Perspective Trns·port User Group Meeting October 2005.
Software Quality Assurance Plan
HP Quality Center Overview.
Systems Analysis and Design in a Changing World
Software Modeling SWE5441 Lecture 3 Eng. Mohammed Timraz
The MEMOPS Programming Framework Wayne Boucher, Cambridge
APPLICATION DEVELOPMENT BY SYED ADNAN ALI.
Ch 12 Distributed Systems Architectures
8 Systems Analysis and Design in a Changing World, Fifth Edition.
The project plan. December 16, Agenda The project plan –Risks –Language decision –Schedule –Quality plan –Testing –Documentation Program architecture.
Understanding and Managing WebSphere V5
Professional Informatics & Quality Assurance Software Lifecycle Manager „Tools that are more a help than a hindrance”
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
1.Database plan 2.Information systems plan 3.Technology plan 4.Business strategy plan 5.Enterprise analysis Which of the following serves as a road map.
 A project is “a unique endeavor to produce a set of deliverables within clearly specified time, cost and quality constraints”
PowerPoint Presentation for Dennis, Wixom, & Tegarden Systems Analysis and Design with UML, 3rd Edition Copyright © 2009 John Wiley & Sons, Inc. All rights.
JWST Integrated Modeling Environment James Webb Space Telescope.
GRID job tracking and monitoring Dmitry Rogozin Laboratory of Particle Physics, JINR 07/08/ /09/2006.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Worldwide Protein Data Bank wwPDB Common D&A Project January 28, 2010 Steering Committee Project Update.
CEN th Lecture CEN 4021 Software Engineering II Instructor: Masoud Sadjadi Software Project Planning.
1.  Project: temporary endeavor to achieve some specific objectives in a defined time  Project management ◦ Dynamic process ◦ Controlled and structured.
 Chapter 6 Architecture 1. What is Architecture?  Overall Structure of system  First Stage in Design process 2.
Webster Visualize Webster Financial Team Visual Scrumware Joe Andrusyszyn Mark Bryant Brian Hannan Robert Songer.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
10/25/20151 Single Sign-On Web Service Supervisors: Viktor Kulikov Alexander Sherman Liana Lipstov Pavel Bilenko.
Introduction to Making Multimedia
INFO 424 Team Project Practicum Week 2 - Launch report, Project tracking, Review report Glenn Booker Notes largely from Prof. Hislop.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
NMI End-to-End Diagnostic Advisory Group BoF Fall 2003 Internet2 Member Meeting.
17 th October 2005CCP4 Database Meeting (York) CCP4(i)/BIOXHIT Database Project: Scope, Aims, Plans, Status and all that jazz Peter Briggs, Wanjuan Yang.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Data Integration and Management A PDB Perspective.
Project Database Handler The Project Database Handler dbCCP4i is a brokering application that mediates interactions between the project database and an.
A university for the world real R © 2009, Chapter 9 The Runtime Environment Michael Adams.
11 CORE Architecture Mauro Bruno, Monica Scannapieco, Carlo Vaccari, Giulia Vaste Antonino Virgillito, Diego Zardetto (Istat)
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
Migrating Desktop Bartek Palak Bartek Palak Poznan Supercomputing and Networking Center The Graphical Framework.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
Workforce Scheduling Release 5.0 for Windows Implementation Overview OWS Development Team.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
Mantid Stakeholder Review Nick Draper 01/11/2007.
Worldwide Protein Data Bank Common D&A Project Sequence Processing Modular Demo May 6, 2010 Project Deliverable.
23/2/2000Status of GAUDI 1 P. Mato / CERN Computing meeting, LHCb Week 23 February 2000.
Condor Technology Solutions, Inc. Grace Performance Chemicals HRIS Intranet Project.
Stages of design  High level design  High level data structure  Architecture  Low level design-code design  Algorithms  Low level data structures.
Worldwide Protein Data Bank wwPDB Common D&A Project Full Project Team Meeting Rutgers March 16-19, 2010.
1 ILE Project Integrated Logistics Environment Kickoff Meeting NPDI Project & SCIM Summary & Status Presented by: Rick Lobsitz (NGTS)
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
Software Development Process CS 360 Lecture 3. Software Process The software process is a structured set of activities required to develop a software.
T Project Review RoadMappers I2 Iteration
Your Interactive Guide to the Digital World Discovering Computers 2012 Chapter 12 Exploring Information System Development.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
March 2004 At A Glance The AutoFDS provides a web- based interface to acquire, generate, and distribute products, using the GMSEC Reference Architecture.
PDS4 Project Report PDS MC F2F University of Maryland Dan Crichton March 27,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Operations Portal Development Update on Requirements Cyril L'Orphelin IN2P3/CNRS.
/16 Final Project Report By Facializer Team Final Project Report Eagle, Leo, Bessie, Five, Evan Dan, Kyle, Ben, Caleb.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GOCDB4 Gilles Mathieu, RAL-STFC, UK An introduction.
Chapter 2- Software Development Process  Product Components  Software Project Staff  Software Development Lifecycle Models.
CS 501: Software Engineering Fall 1999
Chapter 10 Development of Multimedia Project
Overview Activities from additional UP disciplines are needed to bring a system into being Implementation Testing Deployment Configuration and change management.
Presentation transcript:

Worldwide Protein Data Bank wwPDB Common D&A Project November 24, 2009 November 24, 2009 Steering Committee Project Update

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Update report D&A Team Charge for end of January 2010: Deliver production functionality that will provide a significant impact on the annotation workflow. Agenda: 1.Deliverables 2.Accomplishments 3.What’s keeping us/you up at night 4.Timeline overview

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Functional Deliverables  Implement the chemical and model coordinate sequences issue resolution and integration using the Master Format.  Provide an annotator graphical interface to resolve discrepancies.  Implement the capability to repeat an incremental process step (GO BACK) under conditions such as –Replacement coordinates packaged in mmCIF or PDB formats –Replacement coordinates with updated sequence –Replacement chemical sequence  Integration of these new functionalities into the existing workflows.

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Deliverable Details  Finalization of Physical Data Exchange  Annotator graphical interface for sequence functionality  Master Format.  Extended API  Tracking DB support  Extended Work Flow Engine (WFE)  Work Flow Manager (WFM)  Work Flow Manager User Interface (WFM UI)  Integration of this “module” of new functionalities into the existing workflows.

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Physical Data Exchange  All sites have acquired NetApp hardware for this project  The version of NetApp software compatible with all sites has been determined.  A simplified secure protocol for NetApp communication has been found which avoids the need for extra networking hardware.  When the release candidate for the NetApp operating system is finalized as general release, in December, all sites will be on the same page for data exchange.

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Process Overview With GO BACK functionality

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Deliverable: Annotator Interface A graphical interface for resolution of structural features Requirements for display and editing by Annotation staff, including 3D visualization Resource allocation: RCSB Technical design: JavaScript/AJAX + CSS User prototype review Stress tested prototype with very large sequences  User testing functional prototype (begins Dec 15)  Integration with current systems using Master Format (Jan15)  In Use by annotators by Jan 28.  Integrate with new system (WFE, WFM, API) March

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Design Convergence – Master Format, API, WFM, WFE, UI  Distributed development on a complex project is challenging, but we are managing  Reached consensus on critical project technologies – –Master format & workflow schema –Project identifiers –Python implementation –Division of effort among programming layers –Passing communication and control of between computational and interactive workflows –Requirements and technology platform for sequence editor + 3D viewer

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Deliverable update: MASTER FORMAT  A single data dictionary for the project based on the PDB Exchange Dictionary (PDBx). (John) PDBx extended for Common D&A project (deposition data set identifier, WF class ID, WF instance ID, Site ID, Version ID)  PDBx (mmCIF syntax) data file format will be used as a working format for PDB annotation. (Zukang) Translation between RCSB and PDBx tested with Maxit Conversion tool for PDB to PDBx completed PDBx mapping CIF to PDB within Maxit – ready for testing

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Data and Application API Design  Unified Python language implementation  Provides all access to data and applications for the workflow manager and workflow engine  Subcomponents of the API provide access to: –Data objects and data values –Applications and tools –Tracking and status information –Site level configuration information

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Deliverable update: Extended API  Site Configuration API Configuration: Division of processing responsibility between the workflow engine and the API decided.  Workflow Engine/Manager (12/15, Luana)  Add sequence data methods (11/25, Vladimir, John)  Solution for identifying and finding things Archival data files Transient files required by workflows for data processing Versioning of data files and key data values within files Progress and tracking workflows  MySQL support of tracking (12/4, Li)  Application integration with API and WFE (12/4, Vladimir)

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Deliverable update: WFE  Final design – core API communication protocols Internal object representation Final design – XML schema created (description of WF) WFE can process revised WF definitions  Test suites  Engine development (12/23, Tom)  Integration with API, data model, WFM (12/23, Tom)

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Deliverable update: WFM Design Functional Architectural design  Will present progress and tracking information  Will start/stop and restart the workflow engine in executing data processing tasks  Will work in a fully distributed web-based mode  Will provide a launch point for tasks requiring interactive or graphical interactions. Two modes defined – Immediate mode – all processing occurs in a single session (simple case). Deferred mode – requests for input are registered with the workflow manager for later processing by annotator

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Deliverable update: WFM UI  WFM – Annotator UI (Luana)  Requirements (12/3) annotator team)  Design (12/10)  Development (1/15)  WFM Development (1/21, Luana)  Integration with WFE, API (2/4, Vladimir, Luana)  User Testing (2/28, all)

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Deliverable: GO BACK FUNCTIONALITY Master Format  Workflow execution environment (WFE, WFM)  Session management and tracking infrastructure

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Things that have kept us up at night  These are cornerstone deliverables requiring intense study and design consideration – beyond the proof of concept. –Organization of data, communication protocols, etc. –Clear consensus of design features has required an evolution of understanding – requiring wetting of hands  Ramp up of skill sets: Python, mmCIF (PDBe),  EBI External services: web-service set up  Site specific integration challenges  Resource issues

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Good News (from your local PM)  Team is VERY FUNCTIONAL –A lot has been accomplished despite distributed team members and multi-tasking resources  Consensus on difficult issues – starting at considerable philosophical distances has been achieved! –No bloodshed to date – all limbs in tact  Team is still highly motivated to succeed with this project!

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables Timeline Summary  Functional Interface –integrated in existing systems January 15, 2010 –In use by annotators by January 28, 2010  Full Integration of WFM UI with WFE, WFM and API February 4, 2010  Testing completed by February 28, 2010

Worldwide Protein Data Bank Common D&A Project January 2010 Deliverables PDBe integration  There are significant changes to the PDBe annotation –PDBe data model -> D & A data model – import –Load D & A data model with status and domain data –Start web services/connect to web resources  External services at EBI –Run workflows  Implement programs at PDBe –Export data from D & A data model to PDBe data model –Requires Glen who will be away for December to integrate path