Towards a Structural Biology Work Bench Chris Morris, STFC.

Slides:



Advertisements
Similar presentations
GRADD: Scientific Workflows. Scientific Workflow E. Science laboris Workflows are the new rock and roll of eScience Machinery for coordinating the execution.
Advertisements

CASDA Project Management A presentation to the CASDA Preliminary Design Review IM&T / CASS Dan Miller | CASDA Project Manager 11 March 2014.
© 2007 IBM Corporation Enterprise Content Management Integrating Content, Process, and Connectivity for Competitive Advantage Malcolm Holden October 2007.
Slide: 1 Welcome to the workshop ESRFUP-WP7 User Single Entry Point.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
1 Technical Developments Related to Quality Issues Brian Kelly UK Web Focus UKOLN University of Bath Bath, BA2 7AY
Aug. 20, JPL, SoCalBSI '091 The power of bioinformatics tools in cancer research Early Detection Research Network, JPL Mentors: Dr. Chris Mattmann,
© , Michael Aivazis DANSE Software Issues Michael Aivazis California Institute of Technology DANSE Software Workshop September 3-8, 2003.
Chapter 9 e-Commerce Systems.
Anthony Atkins Digital Library and Archives VirginiaTech ETD Technology for Implementers Presented March 22, 2001 at the 4th International.
A National Resource Working in the Public Interest © 2006 The MITRE Corporation. All rights reserved. KM at MITRE Jean Tatalias KM TEM, December 2007.
Page 1 MODEL TEST in the small GENERALIZE PROGRAM PROCESS allocated maintenance changes management documents initial requirement project infrastructure.
Using the WS-PGRADE Portal in the ProSim Project Protein Molecule Simulation on the Grid Tamas Kiss, Gabor Testyanszky, Noam.
Recordkeeping for Good Governance Toolkit Digital Recordkeeping Guidance Funafuti, Tuvalu – June 2013.
Integrated e-Infrastructure for Scientific Facilities Kerstin Kleese van Dam STFC- e-Science Centre Daresbury Laboratory
The role of Parthenos for CLARIN ERIC Steven Krauwer CLARIN ERIC Executive Director 1.
1 INFRA : INFRA : Scientific Information Repository supporting FP7 “The views expressed in this presentation are those of the author.
Partners: *Funded by the EC as a SSA (specific support action) FESP Forum for European Structural Proteomics Lucia Banci CERM,Italy.
Authors Project Database Handler The project database handler dbCCP4i is a small server program that handles interactions between the job database and.
U.S. Department of the Interior U.S. Geological Survey CDI Webinar Sept. 5, 2012 Kevin T. Gallagher and Linda C. Gundersen September 5, 2012 CDI Science.
Configuration Management (CM)
Caring and Sharing Collaboration in Digital Curation outside North America Ross Harvey Simmons College, Boston Curation Matters: 17 June 2010.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
Colour of Ocean Data, Brussels, November 2002 Colour of Ocean Data: Discussion Panel Lesley Rickards British Oceanographic Data Centre.
Context and Linking in the Research Lifecycle CERIF and other standards Catherine Jones Scientific Information Group Scientific Computing Department STFC.
Protein Molecule Simulation on the Grid G-USE in ProSim Project Tamas Kiss Joint EGGE and EDGeS Summer School.
Joint agINFRA & SCI-BUS workshop, 30/05/2013, Budapest, Hungary FP 7-INFRASTRUCTURES programme agINFRA Joint agINFRA & SCI-BUS workshop agINFRA.
Distributed Aircraft Maintenance Environment - DAME DAME Workflow Advisor Max Ong University of Sheffield.
A portal interface to my Grid workflow technology Stefan Rennick Egglestone University of Nottingham
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
The Swiss Grid Initiative Context and Initiation Work by CSCS Peter Kunszt, CSCS.
Data Integration and Management A PDB Perspective.
Recent Developments in CLARIN-NL Jan Odijk P11 LREC, Istanbul, May 23,
© 2006 STEP Consortium ICT Infrastructure Strand.
E-HTPX: A User Perspective Robert Esnouf, University of Oxford.
The Astronomy challenge: How can workflow preservation help? Susana Sánchez, Jose Enrique Ruíz, Lourdes Verdes-Montenegro, Julian Garrido, Juan de Dios.
ESFRI & e-Infrastructure Collaborations, EGEE’09 Krzysztof Wrona September 21 st, 2009 European XFEL.
Electronic labnotes Mari Wigham COMMIT/. Information WUR  Organising, sharing, finding and reusing data  Expertise in: ● Modelling data.
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
Digital Design of Materials: Some Thoughts for a Closing Group Discussion Andrei Ruckenstein Boston University.
1 COMPUTER SCIENCE DEPARTMENT COLORADO STATE UNIVERSITY 1/9/2008 SAXS Software.
Project Database Handler The Project Database Handler is a brokering application, which will mediate interactions between the project database and other.
Simplified Experiment Submit Proposal Results Excited Users Do Expt Data Analysis Feedback.
High throughput biology data management and data intensive computing drivers George Michaels.
Thomas Gutberlet HZB User Coordination NMI3-II Neutron scattering and Muon spectroscopy Integrated Initiative WP5 Integrated User Access.
© 2013 IBM Corporation Accelerating Product and Service Innovation Service Virtualization Testing in Managed Environments Michael Elder, IBM Senior Technical.
Resource Optimization for Publisher/Subscriber-based Avionics Systems Institute for Software Integrated Systems Vanderbilt University Nashville, Tennessee.
West-Life: A VRE for Structural Biology Alexandre Bonvin, Utrecht University Chris Morris STFC EGI Community Forum Bari, November
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No West-Life.
A worldwide e-Infrastructure and Virtual Research Community for NMR and structural biology Alexandre M.J.J. Bonvin Project coordinator Bijvoet Center for.
E-infrastructure requirements from the ESFRI Physics, Astronomy and Analytical Facilities cluster Provisional material based on outcome of workshop held.
Afternoon session: The archival problem and infrastructure for solutions Prof John R Helliwell Interactive Publications.
Project Database Handler The Project Database Handler is a brokering application which will mediate interactions between the project database and other.
A Competence Center to Serve Translational Research from Molecule to Brain Alexandre M.J.J. Bonvin MoBrain CC coordinator Bijvoet Center for Biomolecular.
Page 1 Protein Crystallization and Structure Determination ---by Creative BiostructureCreative Biostructure.
PDBe Protein Interfaces, Surfaces and Assemblies
MIRACLE Cloud-based reproducible data analysis and visualization for outputs of agent-based models Xiongbing Jin, Kirsten Robinson, Allen Lee, Gary Polhill,
Scientific Computing Department
The Life Cycle of Structural Biology Data: A discipline with a culture of sharing Chris Morris, STFC DI4R Sept 2016.
POW MND section.
Research Data Context Preservation in SCAPE
Joseph JaJa, Mike Smorul, and Sangchul Song
CCP4 6.1 and beyond: Tools for Macromolecular Crystallography
A portal interface to myGrid workflow technology
Project tracking system for the structure solution software pipeline
Computing & Data Resources Application interface
Common Solutions to Common Problems
West-Life Chris Morris, STFC, UK
Brian Matthews STFC EOSCpilot Brian Matthews STFC
New Platform to Support Digital Humanities in the Czech Republic
Presentation transcript:

Towards a Structural Biology Work Bench Chris Morris, STFC

Structural Biologists are mature computer users First use of digital computers in 1940s Combined data rate for European structural experiments > LHC Rate will double with XFEL Protein Data Bank New entries by year (log)

New scientific goals My targets include: (313 responses) New PDB entries, %

New experimental methods Mean 3.9 techniques / respondent –Biologists, not technique experts Small samples Data noisy and incomplete 324 respondentsOther techniques used in past year: X-ray diffraction NMR Spectroscopy Electron Microscopy SAXSModellingCDSPRCalorimetryTotal Preferred technique: X-ray diffraction NMR spectroscopy Electron microscopy SAXS Modelling Total

New data challenges Improve archiving of data and metadata Improve automated pipelines for MX … create pipelines for other techniques Reproducibility –Keywords, version numbers Combined algorithms Deliver results to other life scientists –Quality indications

Data management should be combined with data processing If a web portal offers integrated access to data archives and to processing software, I will use it Last year I discarded some samples or files because their provenance was not recorded well enough. Last year I repeated some work because I could not find the sample or file produced. I would use combined techniques if the software was easier to get and use. If a web portal offers integrated access to data archives and to processing software, I will use it

Crowdsourcing from the middle tier Community includes: –Life scientists who use computers –End user programmers –Algorithm developers Must be easier to compose existing services to make a new web page Google widgets Semantic web BioJS

Structural Biology Work Bench Seamless data transfer between stages Accumulate metadata without user intervention No installation effort Extensible

Reinvent nothing Existing best practise includes: weNMR PaNData Diamond: pipelines and archives Scipion Data Life Cycle Lab Integration, not competition

Developing infrastructures 1.Understand context of use 2.Detailed requirements 3.User experience design 4.Technical architecture 5.Develop 6.Seek feedback users need to become much more directly involved in strategy, coordination and innovation in each of the e-Infrastructure components. This implies that users also need to be empowered to drive the direction of e-Infrastructure service. To this end, the funding for service delivery should be channelled through the users, rather than directly to the service delivery organisations. e-IRG White Paper 2013

Work packages a distributed file system a rigid body docking service that can use a variety of experimental evidence an atomistic structure solution service that can use a variety of experimental evidence a toolkit for making new active web pages that address new scientific questions a construct design service scientific collaborations, which validate the work in progress by putting it to use

References Biasini et al. (2013). Acta Cryst. D69, Gutmanas et al. (2013). Acta Cryst. D69, Karaca, E. & Bonvin, A. M. J. J. (2013). Acta Cryst. D69, Marabini, et al. (2013). Acta Cryst. D69, Morris, C. & Segal, J. (2012). IEEE Software, 29, Perrakis et al. J. Struct. Biol. 175, DiMaio et al., Nature Methods, Improved protein crystal structures at low resolution by integrated refinement with Phenix and Rosetta, in press

Part of life of mmCIF file PDB Local Store PiMS CCP4GUI2 Xia2 MrBump BioInf

Pilot survey at Instruct AGM 73% working on eukaryotic rather than prokaryotic systems 84% working on complexes rather than single gene products Each research team routinely uses three-four different techniques 83% would use combined SB techniques more often if it was easier to get access to experimental facilities 73% of the cases found it hard to combine software tools for different techniques in integrated workflows