Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

Kaizhong Zhang Department of Computer Science University of Western Ontario London, Ontario, Canada Joint work with Bin Ma, Gilles Lajoie, Amanda Doherty-Kirby,
MN-B-C 2 Analysis of High Dimensional (-omics) Data Kay Hofmann – Protein Evolution Group Week 5: Proteomics.
Protein Sequencing and Identification by Mass Spectrometry.
PEAKS: De Novo Sequencing using MS/MS spectra Bin Ma, U. Western Ontario, Canada Kaizhong Zhang,U. Western Ontario, Canada Chengzhi Liang, Bioinformatics.
Proteomics The proteome is larger than the genome due to alternative splicing and protein modification. As we have said before we need to know All protein-protein.
ProReP - Protein Results Parser v3.0©
Proteomics Informatics – Protein identification II: search engines and protein sequence databases (Week 5)
Previous Lecture: Regression and Correlation
Mass Spectrometry. What are mass spectrometers? They are analytical tools used to measure the molecular weight of a sample. Accuracy – 0.01 % of the total.
FIGURE 5. Plot of peptide charge state ratios. Quality Control Concept Figure 6 shows a concept for the implementation of quality control as system suitability.
Mass Spectroscopy 1Dr. Nikhat Siddiqi. Mass spectrometry is a powerful analytical technique that is used to identify unknown compounds, to quantify known.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
My contact details and information about submitting samples for MS
A combination of the words Proteomics and Genomics. Proteogenomics commonly refer to studies that use proteomic information, often derived from mass spectrometry,
Proteomics Josh Leung Biology 1220 April 13 th, 2010.
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Daehee Hwang Leroy Hood Institute for Systems Biology.
Gas Chromatography And Mass Spectrometry
Tryptic digestion Proteomics Workflow for Gel-based and LC-coupled Mass Spectrometry Protein or peptide pre-fractionation is a prerequisite for the reduction.
The CCE 5 th Annual Retreat Global Proteomics & Determination of Vitamin D Metabolites Update Jiri Adamec.
Secondary ion mass spectrometry (SIMS)
PROTEIN STRUCTURE NAME: ANUSHA. INTRODUCTION Frederick Sanger was awarded his first Nobel Prize for determining the amino acid sequence of insulin, the.
Introduction The GPM project (The Global Proteome Machine Organization) Salvador Martínez de Bartolomé Bioinformatics support –
PROTEIN QUANTIFICATION AND PTM JUN SIN HSS.I. PROJECT 1.
INF380 - Proteomics-91 INF380 – Proteomics Chapter 9 – Identification and characterization by MS/MS The MS/MS identification problem can be formulated.
Common parameters At the beginning one need to set up the parameters.
1 Chemical Analysis by Mass Spectrometry. 2 All chemical substances are combinations of atoms. Atoms of different elements have different masses (H =
William Kou Jonathan Mertz. Introduction The field of Mass Spectrometry using Ambient Ionization techniques has grown exponentially since The direct.
A Comprehensive Comparison of the de novo Sequencing Accuracies of PEAKS, BioAnalyst and PLGS Bin Ma 1 ; Amanda Doherty-Kirby 1 ; Aaron Booy 2 ; Bob Olafson.
Laxman Yetukuri T : Modeling of Proteomics Data
CS 461b/661b: Bioinformatics Tools and Applications Software Algorithm Mathematical Models Biology Experiments and Data.
Temple University MASS SPECTROMETRY FURTHER INVESTIGATIONS Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
Temple University MASS SPECTROMETRY INTRODUCTION Ilyana Mushaeva and Amber Moscato Department of Electrical and Computer Engineering Temple University.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Patricia HernandezGeneva, 28 th September 2006 Swiss Bio Grid: Proteomics Project (PP)
PEAKS: De Novo Sequencing using Tandem Mass Spectrometry Bin Ma Dept. of Computer Science University of Western Ontario.
Proteomics What is it? How is it done? Are there different kinds? Why would you want to do it (what can it tell you)?
Proteomics databases for comparative studies: Transactional and Data Warehouse approaches Patricia Rodriguez-Tomé, Nicolas Pinaud, Thomas Kowall GeneProt,
C OMPUTING E SSENTIALS Timothy J. O’Leary Linda I. O’Leary Presentations by: Fred Bounds.
Dr. Kristoffer Rem Labing-isa Massachusetts Institute of Technology Dean, College of Science.
Mass Analyst Analysis of MS(MS) data. Short function overview: Load mzXML data (ms-ms data) Load pepXML and/or mascot data (found proteins/peptides after.
Overview of Mass Spectrometry
A New Strategy of Protein Identification in Proteomics Xinmin Yin CS Dept. Ball State Univ.
EBI is an Outstation of the European Molecular Biology Laboratory. In silico analysis of accurate proteomics, complemented by selective isolation of peptides.
Proteomics Informatics (BMSC-GA 4437) Instructor David Fenyö Contact information
Mass Spectrometry Relative atomic masses and the mass of individual isotopes can be determined using a mass spectrometer. The principle behind mass spectrometry.
Mass Spectrograph Physics Chapter 27 B. Mass spectrograph Adapted CRT/Thomson tube used to form and accelerate charged particles and measure their deflection.
Proteomics Informatics (BMSC-GA 4437) Course Directors David Fenyö Kelly Ruggles Beatrix Ueberheide Contact information
Atomic Structure Topic The mass spectrometer Describe and explain the operation of a mass spectrometer Describe how the mass spectrometer.
5. QUANTIFYING CHEMISTRY Chapter 5.  Atoms are extremely tiny and have a very very tiny mass. How do we measure atoms?  We have a convenient way to.
Peptide de novo sequencing Peptide de novo sequencing is the analytical process that derives a peptide’s amino acid sequence from its tandem mass spectrum.
Post translational modification n- acetylation Peptide Mass Fingerprinting (PMF) is an analytical technique for identifying unknown protein. Proteins to.
Mass Spectrometry makes it possible to measure protein/peptide masses (actually mass/charge ratio) with great accuracy Major uses Protein and peptide identification.
Review of Basic Atomic Structure, Mass Spectrometry
National High Magnetic Field Laboratory- ICR Group
A Database of Peak Annotations of Empirically Derived Mass Spectra
MassMatrix Search Results Explained
Bioinformatics Solutions Inc.
Mass Spectrometry.
Structure Determination: Mass Spectrometry and Infrared Spectroscopy
Proteomics Informatics David Fenyő
A perspective on proteomics in cell biology
Proteomics Informatics –
Mass Spectrometry THE MAIN USE OF MS IN ORG CHEM IS:
Shotgun Proteomics in Neuroscience
Applying principles of computer science in a biological context
Proteomics Informatics David Fenyő
Presentation transcript:

Software Project MassAnalyst Roeland Luitwieler Marnix Kammer April 24, 2006

2 Overview Introduction System requirements Our solution: Spectre Progress so far Conclusion

3 Introduction Project initiator Scientific background The need for software tools

4 Project initiator Dr. ir. Bas van Breukelen Department of Biomolecular Mass Spectrometry Utrecht University! WENT building Expert in: Bioinformatics Proteomics

5 Scientific background: Proteomics Our body consists of cells Cell functionality and structure is offered by proteins Proteomics Main research areas: Identification of proteins Interaction of proteins Comparison of protein levels

6 Protein identification How to identify proteins? Identity defined by their structure Protein structure Protein: sequence of peptides Peptide: sequence of amino acids 20 common types Consist of different atoms – have different masses Too small to see… but not to weigh Mass Spectrometry!

7 Mass Spectrometry (MS) Technique using a mass spectrometer Input: sample of peptides Proteins have been split chemically Provides a.o. more accuracy, efficiency Most head / tail subsequences are present Output: mass spectrum Frequencies of particles of certain masses Full peptide sequence can be derived

8

9 Mass Spectrometry (MS) How does it work? Ionize particles Now particles have an electrical charge Accelerate them in an electric field Deflect them in a magnetic field Deflection depends on mass (F = m a) Measure how far they have been deflected

10 Mass Spectrometry (MS) Improvements for better analysis (1) Use chromatography Spreads input over time: more details Output: a sequence of MS spectra

11 Mass Spectrometry (MS) Improvements for better analysis (2) Use “recursive” mass spectrometry Called MS/MS (or MS 2 or tandem MS) Take part of the sample that produces a peak Usually concerns one certain peptide Output: MS spectra with related MS/MS spectra

12 Mass Spectrometry (MS) Improvements for better analysis (3) Use bioinformatics All output is translated to mzXML A database is searched on MS/MS spectra Input: raw MS data Output: pepXML : peptide information Tools are used to e.g. display the data Lots of redundant / boring work is taken care of!

13 Bioinformatics: what can be done? Remember the Proteomics research areas: Identification of proteins Interaction of proteins Comparison of protein levels Most research: differ one aspect at a time Requires interactive display of data Zooming, “stacking”, cross sections, etc. But not just display of data Filtering, “warping”, peak detection, etc.

14 Bioinformatics: existing tools Tools exist, but… Lots of different tools to do different things Functionality not always as desired They also lack functionality Not easily extendable Example: Pep3D Nice visualization, but Only one sample at a time, only a single view Solution: develop new software

15 System requirements Load raw spectrometry data Visualize the data Manipulate and analyze the data interactively Export data Extendibility Use in open community Open source

16 Loading data mzXML : raw spectrometry data MS spectra Embedded MS/MS spectra pepXML : database of matches with peptides

17 Visualizing the data List of loaded samples MS spectrum Cross sections of the MS spectrum MS/MS spectra Peptide information

18 Manipulating and analyzing the data Stacking: toggle samples on/off Warping Zooming Peak detection More analysis, like ratio calculation

19 Export data Lists of peak pairs Modified PepXML (i.e. with ratios) Images of spectra Modified samples

20 Our solution: Spectre

21 Opening a workspace

22 Loading a workspace

23 After the workspace is loaded

24 Working in normal mode

25 Zoom on selection mode

26 After zooming in

27 Zoom on click mode

28 After zooming in

29 The structure of Spectre Graph : MS spectra, cross sections, MS/MS spectra Workspace : a collection of samples and settings Sample : internal data structure for one sample GUI : the user interface Processor : the main link between parts of the program

30 The structure of Spectre Processor GUI Workspace Sample Graph * * 4 1 1

31 Systematic approach to the problem Phased development Three versions Lots of diagrams Application of courses MSO, PM HCI team and data layer team Later on: data visualization team Extreme Programming

32 Progress so far First version will be due in week 18 Functionality: Loading raw data Visualization and user interface Basic interaction with zooming etc. Complete internal data structures Export of images Missing link between mzXML and pepXML!

33 Further planning Version 2 – week 23 Warping Peak detection / analysis Export of calculated data Version 3 – week 27 Ratio calculation Modification of samples

34 After completion of the project Web site Open source further maintaining extendable

35 Conclusion Spectre: a modular and extendable program A combination of many different requirements Phased addition of features Any questions?

36 The data structure MzTable Sample MzNode * 1 SampleParser SampleWriter MzParserPepParser ……