PSI-Proteome Informatics update Andy Jones PSI 2013 Liverpool.

Slides:



Advertisements
Similar presentations
2007 ISSSS Conference Bethesda, MD Joe Jurczyk University of Akron
Advertisements

My AmeriCorps State and National Reports as an Information Tool
1 jNIK IT tool for electronic audit papers 17th meeting of the INTOSAI Working Group on IT Audit (WGITA) SAI POLAND (the Supreme Chamber of Control)
Status on the Mapping of Metadata Standards
13 September 2012 SDMX Technical Working Group1 Report of the SDMX Technical Standards Working Group SDMX Expert Group Meeting, Paris, September 2012.
Technical Highlights 25th August 2011 Sebastian Peters German National Library of Science and Technology.
© 2008 EBSCO Information Services SUSHI, COUNTER and ERM Systems An Update on Usage Standards Ressources électroniques dans les bibliothèques électroniques.
IRRA DSpace April 2006 Claire Knowles University of Edinburgh.
PSI Mass Spectrometry Standards Working Group Summary HUPO PSI MS Standards Working Group.
ICIS-NPDES Plugin Design Preview Webinar ICIS-NPDES Full Batch OpenNode2 Plugin Project Presented by Bill Rensmith Windsor Solutions, Inc. 3/15/2012.
DPM ARCHITECT FOR XBRL XBRL taxonomy editor aimed at BUSINESS USERS Based on the DPM approach and DPM XBRL Architecture Currently on its last stage of.
Impact of OASIS UIMA Standard on Apache UIMA OASIS Unstructured Information Management Architecture (UIMA) TC
Sandra Orchard EMBL-EBI Molecular Interactions
ETIS+: European Transport Policy Information System - Development and Implementation of Data Collection Methodology for EU Transport Modelling Funded by.
Gemini Re-platforming Project Update April This presentation is to provide an overview of the Gemini Re-platforming Project (GRP) and provide early.
Pensoft Writing Tool (PWT) Lyubomir Penev ViBRANT Tools for DNA taxonomists, 11 June 2013, Brussles ViBRANT.
New tools for MIAPE Generation Emilio Salazar Doñate Bioinformatics Group CNB – CSIC.
Big Data & the CPTAC Data Portal Nathan Edwards, Peter McGarvey Mauricio Oberti, Ratna Thangudu Shuang Cai, Karen Ketchum Georgetown University & ESAC.
Gemini Systems, Inc. 61 Broadway New York NY Tel: Fax:: An Introduction to VRXML December,
The COUNTER Code of Practice for Books and Reference Works Peter Shepherd Project Director COUNTER UKSG E-Books Seminar, 9 November 2005.
MS-Viewer – A Web Based Spectral Viewer For Database Search Results Peter R. Baker 1, Alma L. Burlingame 1 and Robert J. Chalkley 1 1 Mass Spectrometry.
Regents Scoring and Reporting EduTech Regents Converter Migration October, 2010.
EBI is an Outstation of the European Molecular Biology Laboratory. PRIDE associated tools: Practical exercise 1 PRIDE team, Proteomics Services Group PANDA.
MIAPE Extractor Tutorial SHPP meeting, 28 Aug 2012 La Cristalera, Miraflores de la Sierra, Madrid Salvador Martínez de Bartolomé Izquierdo CNB-CSIC / ProteoRed.
US GPO AIP Independence Test CS 496A – Senior Design Team members: Antonio Castillo, Johnny Ng, Aram Weintraub, Tin-Shuk Wong Faculty advisor: Dr. Russ.
Scaffold Download free viewer:
OMICS Group Contact us at: OMICS Group International through its Open Access Initiative is committed to make genuine and.
Update on INSPIRE: INSPIRE maintenance and implementation and INSPIRE related EEA activities on biodiversity CDDA/European protected areas technical meeting.
Project “European CDDA and INSPIRE”: scope, transformation workflow and mapping rules INSPIRE Conference 2014 Workshop: Implementing Existing European.
Conclusions What’s next? * Implementation of additional input formats * Additional vendor support: As vendors become more open with their APIs for accessing.
Data standards from the Proteomics Standards Initiative Andy Jones University of Liverpool.
EuPA- EC Education Committee (EC) working plan.
The Functional Genomics Experiment Model (FuGE) Andy Jones School of Computer Science and Faculty of Life Sciences, University of Manchester.
Max Planck Institute for Psycholinguistics Tool development report H. Brugman MPI Nijmegen.
Miscellaneous Excel Combining Excel and Access. – Importing, exporting and linking Parsing and manipulating data. 1.
The Metadata Object Description Schema (MODS) NISO Metadata Workshop May 20, 2004 Rebecca Guenther Network Development and MARC Standards Office Library.
How to assure MIAPE compliance of the data using the ProteoRed MIAPE Extractor tool HUPO-PSI meeting - Liverpool (15th April 2013) Salvador Martínez-Bartolomé.
1 Quant CVs (visualization of differences to psi-ms.obo version 3.7.3, released ) Martin Eisenacher, August 12th, 2011.
COUNTER and the development of standards for usage reports Marthyn Borghuis, Elsevier COUNTER Executive Committee For: CALISE-Taiwan.
26 June 2008 DG REGIO Evaluation Network Meeting Ex-post Evaluation of Cohesion Policy Programmes co-financed by the European Fund for Regional.
La Cristalera, Miraflores de la Sierra, December 2012 HPPHPP Use of SEQUEST search results with ProteoRed.org MIAPE Extractor.
Data Standards Submission 1 st CHr-16 Workshop. Miraflores de la Sierra August, 28 th -29 th 2012 Alberto Medina.
Analysis of Complex Proteomic Datasets Using Scaffold Free Scaffold Viewer can be downloaded at:
MIAMExpress development October 2002 Mohammad shojatalab
The Functional Genomics Experiment Object Model (FuGE) Andrew Jones, School of Computer Science, University of Manchester MGED Society.
FuGE: A framework for developing standards for functional genomics Angel Pizarro Univesrity of Pennsylvania Andrew Jones University of Manchester.
Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.
Standards for proteomics: The HUPO Proteomics Standards Initiative (HUPO PSI) Public Repository for Mass spectrometry spectral.
FuGE: A framework for developing standards for functional genomics Andrew Jones School of Computer Science, University of Manchester Metabomeeting 2.0.
Extending FuGE into other domains Andrew Jones School of Computer Science, University of Manchester
Johannes Griss PSI Meeting Heidelberg, April 2011 EBI is an Outstation of the European Molecular Biology Laboratory. mzTab Proposal for.
September 25, 2006 NASA Feasibility Study Status Update.
Jason Platts Lead Technical Developer The Open University An overview of how the Open University has incorporated bibliographic.
Query Health Distributed Population Queries Implementation Group Meeting March 6, 2012.
Water Framework Directive WISE Items on which to report progress oFinalising schemas for 2005 reporting oDevelop WISE Web site for 2005 oDevelop.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
DEX Publication Project OASIS PLCS TC Face to Face meeting 10 March 2008 Trine Hansen.
3rd Training Workshop June 2008, Ostende Management of CSR Anne Che-Bohnenstengel, BSH  Metadata Formats  Defined Vocabularies  Content Management.
CoLIMS progress Computational Omics and Systems Biology (CompOmics) Group Niels Hulstaert
Using Scaffold OHRI Proteomics Core Facility. This presentation is intended for Core Facility internal training purposes only.
Democratization of ‘Omics Data Availability and Review Robert Chalkley UCSF Data Management Editor - MCP.
Agenda Welcome from the Skyline team!
Open source tools for data analysis
Research Organisation Subgroup June 1, 2017
AIXM 5.1 – Interoperability issues
Overview PSI-PI activities
Review plan of the nature reporting – update 7
WP 4 - Revision of Natura 2000 dataflow
Education and Training Statistics Working Group – 2-3 June 2016
QoS Metadata Status 106th OGC Technical Committee Orléans, France
Presentation transcript:

PSI-Proteome Informatics update Andy Jones PSI 2013 Liverpool

PSI-PI outputs Formats and guidelines for proteome informatics Standard formats: – mzIdentML – mzQuantML – mzTab Reporting guidelines – MIAPE MSI – MIAPE Quant

MIAPE documents Originally one MIAPE document: – MIAPE Mass spectrometry information (MSI) containing both identification guidelines and quant guidelines – MIAPE MSI (ident only) and MIAPE Quant MIAPE MSI status – MIAPE MSI 1.1 published back in 2008 – Working group minor updates to requirements and removal of quant parts – MIAPE MSI 1.2 still needs to be re-submitted to PSI process Plan for meeting: – Final issues with MIAPE MSI and alignment with mzIdentML?

MIAPE Quant timeline Work started on Dec 2010 by ProteoRed groups of experts Shared with PSI working groups in March 2011 Revision at PSI meeting (Heidelberg) April 2011 PSI review: – Public and external review ended on August 2012 – Major revision accepted on October 2012 Journal of Proteomics: – Submitted on 15th February – Accepted on 27th February after minor revision Martínez-Bartolomé, S., Deutsch, E. W., Binz, P.-A., Jones, A. R., Eisenacher, M., Mayer, G., Campos, A., Canals, F., Bech-Serra, J.-J., Carrascal, M., Gay, M., Paradela, A., Navajas, R., Marcilla, M., Hernáez, M. L., Gutiérrez-Blázquez, M. D., Velarde, L. F. C., Aloria, K., Beaskoetxea, J., Medina-Aunon, J. A., and Albar, J. P. Guidelines for reporting quantitative mass spectrometry based experiments in proteomics. Journal of Proteomics, 2013 in press. No planned work for meeting

mzIdentML Timeline: – Original 1.0 version in Aug 2009 – Version 1.1 stable (Aug 2011) – Manuscript published in MCP in 2012 PSI 2013 To do list: – Updates to protein grouping – PTM localisation / ambiguity scoring – General discussion of data compression issues Jones, A. R., Eisenacher, M., Mayer, G., Kohlbacher, O., Siepen, J., Hubbard, S., Selley, J., Searle, B., Shofstahl, J., Seymour, S., Julian, R., Binz, P.-A., Deutsch, E. W., Hermjakob, H., Reisinger, F., Griss, J., Vizcaino, J. A., Chambers, M., Pizarro, A., and Creasy, D. (2012) The mzIdentML data standard for mass spectrometry-based proteomics results. Molecular & Cellular Proteomics 11, M

ToolStatusIMPORT/EXPORT Mascot mzIdentML Version 1.0 available in Mascot version 2.3, mzid stable version 1.1 available in Mascot version 2.4 EXPORT OMSSAhttp://code.google.com/p/mzidentml-lib/ from the U.Liverpool groupEXPORT MSGF+Full support for results into mzid 1.1EXPORT PeaksNative export of mzIdentML version 1.1EXPORT PhenyxExporter to mzIdentML v1.1 now available - contact GeneBio for details.EXPORT PLGS ProCon Conversion of SEQUEST *.out into mzIdentML (SpectrumIdentificationResults only); Conversion of ProteomeDiscoverer (Thermo) *.msf and *.prot.xml files into mzIdentML EXPORT ProteinPilotContact is Sean Seymour ProteinScapeWork in progress SEQUEST - NativeProCon (see above) ProteoWizard pepXML converter available now - impl. of C++ library for reading/writing MzIdentML / interface for importing other formats IMPORT AND EXPORT SEQUEST - BioWorksWork in progress SEQUEST - Proteome DiscovererWork in progress (exporters available from ProCon project) SpectraSTProteowizard conversion from pepXML EXPORT Spectrum Mill X!Tandemhttp://code.google.com/p/mzidentml-lib/ EXPORT OpenMSFully supported in release 1.9IMPORT AND EXPORT ScaffoldAvailable now in Scaffold version 3.0EXPORT Scaffold PTMScaffold PTM tool imports identifications in mzIdentMLIMPORT TPPpepXML to mzid from ProteoWizardProteoWizardIMPORT AND EXPORT MIAPE MSI ExtractorTool available from the ProteoRed teamProteoRed IMPORT CSV exporter IMPORT PAAnalyzerImports and exports mzIdentML (only v1.0)IMPORT AND EXPORT MyrimatchIdentifications exported in mzIdentMLEXPORT TagReconIdentifications exported in mzIdentMLEXPORT PepitomeIdentifications exported in mzIdentMLEXPORT IDPickerVersion 3.x implements mzIdentML importIMPORT jmzIdentMLJava API for reading and writing mzIdentMLIMPORT AND EXPORT Tooling for mzIdentML

Formats mzQuantML – Output of quantitative software – Quantitative values about proteins, protein groups, peptides and features (quantified regions on mass spec) also small molecules... Relative or absolute values for single samples (Assays) or groups of replicates (StudyVariables)

mzQuantML status Version 1.0 rc-1 submitted to the PSI process October 2011 Version 1.0 rc-2 June 2012 Re-submitted to PSI process in October 2012 & manuscript submitted to MCP, minor correction received Completed PSI process in Feb 2013 – version 1.0 release – Supports label-free (intensity), label-free (spectral counting), MS 2 tag techniques (e.g. iTRAQ) and MS 1 label techniques e.g. SILAC – Schema is fixed with each technique defined by separate semantic rules, implemented in validator software – Manuscript re-submitted to MCP, awaiting outcome Implementations Java API for creating example files (version 1.0 release): Java-based validator (version 1.0 release): Software for converting output files from MaxQuant and Progenesis: – Qi, D et al. OMICS 16(9): ; Implementation in OpenMS for some techniques Beta Java library of routines inc. mzTab exporter: Beta Excel to mzQuantML converter for spectral count data: mzquantml-converter/ mzquantml-converter/ Mzq To do list: Need to add SRM support – Local testing of SRM encoding and conversion from Skyline – Need wider input on our mapping and writing semantic rules for software Need to check whether protein grouping and mod scoring map onto format okay

mzTab To provide a simple and efficient way of exchanging results from MS approaches. – Simple summary “final” report of the experimental results; Peptides and proteins identified and quantified – Small molecules included (metabolomics) – Technical and biological metadata – Spectra can be referenced in optional columns. – Set of mandatory and optional attributes (very flexible). Four sections: – (Optional) Metadata section – (Optional) Protein section – (Optional) Peptide section – (Optional) Small Molecule section (metabolomics) Can report MS derived data at different levels: – Single experiments – Multiple (possibly linked) experiments (merged files) – Data generated as a result of a query to a bioinformatics resource – Possible to add a reliability score for each identification Easy to parse and use by the research community, systems biologists as well as providers of knowledge bases. It can be used by non-experts in bioinformatics and/or proteomics.

mzTab status Submitted to the PSI document process on May TO DO: Addressing now the remaining (minor) comments after the second round of review. So, we hope that version 1.0 will soon be formalised. Publication (revised version) under review in MCP. Current implementations: – jmzTab (Java API): 2 versions have been developed. Version 2.0 (Q.W. Xu, about to be finished, with more functionality) is going to be the maintained version. Version 1.0 (J. Griss) will not be further maintained. – mzTab Validator, PRIDE XML to mzTab converter and mzTab merger in beta status. – PRIDE Converter 2. – OpenMS (version 1.10) – R/Bioconductor package Msnbase (L. Gatto, Cambridge University) – LipidDataAnalyzer (University of Graz) – Metabolights (EBI) and COSMOS EU project: A slightly modified version is being used right now. Working in contact with them.

End-meeting summary...

PSI-PI work done mzIdentML – Minor schema issues: Optional attribute (Dbsequence_Ref) on ProteinDetectionHypothesis (would be better if mandatory) – update spec doc encouraging best practice Pre-fractionation: – update spec doc encouraging best practice – one SpectrumIdentificationList where possible Retention time reporting: – Update spec doc encouraging best practice; align with mzML CVs – Support for Crosslinking results Sketched a possible reporting format that looks to cover most simple cases Needs (considerable) further testing in local implementations and follow up by calls – Mod localisation Sketched some possible encodings Needs follow up calls and implementation in software Keen to build this support into mzid 1.1 but model is going to be a work-around. – Protein grouping Reported back current progress of working group (key members not present here) New members will join the working group

PSI-PI work done mzQuantML – Sketched SRM example files for label-based encoding – Need sketched example for label-free but seems straightforward – Plan to build export software very soon from Skyline (prototype already done) and mProphet – Write up semantic encoding rules – Submit to PSI doc process as a Community Practice document

PSI-PI work done mzTab Finalised minor issues from second round of PSI doc process review Deadline 1 st May for re-submitting final “release 1.0” document – Needs minor updates to document and example files