Presentation is loading. Please wait.

Presentation is loading. Please wait.

“Proteomics & Bioinformatics” MBI, Master's Degree Program in Helsinki, Finland 8 May, 2007 Sophia Kossida, BRF, Academy of Athens, Greece Esa Pitkänen,

Similar presentations


Presentation on theme: "“Proteomics & Bioinformatics” MBI, Master's Degree Program in Helsinki, Finland 8 May, 2007 Sophia Kossida, BRF, Academy of Athens, Greece Esa Pitkänen,"— Presentation transcript:

1 “Proteomics & Bioinformatics” MBI, Master's Degree Program in Helsinki, Finland 8 May, 2007 Sophia Kossida, BRF, Academy of Athens, Greece Esa Pitkänen, Univeristy of Helsinki, Finland Juho Rousu, University of Helsinki, Finland Lecture 2

2 Gel Image Analysis Image analysis is extracting perceptible data out of the 2DE image, and storing it in a database. It involves detecting spots and warping separate images to align like-spots of the same proteins. Spot data comes from the levels of spot darkness which is proportional to the level of proteins staining or dye labeling of particular amino acids. staining Image acquisitionImage analysis /quantification Reiner Westermeier, GE Healthcare LifeSciences, Munich, Germany

3 Image analysis software Some commercially available softwares: ImageMaster2D/ Melanie PDQuestPDQuest (Bio-Rad, USA) Progenesis Progenesis (Nonlinear, UK) Delta2D Delta2D (Decodon, Germany)

4 Melanie http://www.2d-gel-analysis.com/

5 Melanie http://au.expasy.org/melanie/

6 PDQuest http://www.bio-rad.com/

7 http://www.nonlinear.com/products/progenesis/ Progenesis

8 Delta 2D http://www.decodon.com/Solutions/Delta2D/

9 Staining Detection Pre-labeling - radioisotopes, stable isotopes, fluorescence Intermediate labeling - Fluorescence during equilibration of IPG-strip Staining of gel background – Imidazol Zinc staining Staining of proteins- Organic dyes, silver, fluorescent dyes Blotting – Immuno / affinity detection

10 Scanning Use grayscale (complete range) instead of color images. Scan all gel images using the same orientation, placing each gel at the same position on the scanner plate. Avoid scanning too much of the area around the gel. Limit post-processing to crop, mirror and rotation by 90, 180, 270 degrees Avoid producing TIFF files if you can process calibrated image file formats such as *.IMG/INF and *.GEL. usually the TIFF files are produces without grayscale calibration. This means you loose precision or grayscales are distorted nonlinearly making quantitation questionable. Avoid using JPEGfiles for quantitative analysis. Avoiding background, artifacts and noise is essential insufficient destaining, contamination by fingerprints, fluorescent sprinkle, bubbles) gel breaking, gel pieces

11 Image analysis Manipulation of image/ normalization -separation of overlapping spots, removing lines and speckles Spot detection/ quantification -background subtraction, spot segmentation, land-marking,spot matching Gel comparison -matching of gels (e.g. normal, diseased, treated),alignment Data analysis -changes in expression Data representation -annotation of spots, linking of data: spots -intensity - MS data

12 Organizing experiments Organizing the experiment: Creating projects, folders and subfolders. Importing gel images Melanie/ImageMaster 2D Platinum 6.0

13 Import gels Tool box to easy manipulate gels Melanie/ImageMaster 2D Platinum 6.0

14 Viewing and manipulating images Adjusting contrast Intensity variations in x- and y-direction 3D-view Melanie/ImageMaster 2D Platinum 6.0 Automatically subtracted background

15 Spot detection Adjust the separation between spots Split overlap Eliminate art affects/noise Stain saturation Incomplete resolution Melanie/ImageMaster 2D Platinum 6.0

16 Spots report A spot report summarizes the information about the selected spots Melanie/ImageMaster 2D Platinum 6.0

17 Modified from: mouse cardiac; 250  g loading; pH 3-10 IEF strips; 12.5% SDS-PAGE; file ID: sc5bcon vs. sc15iso PTM? Downregulation? Detection/matching Spot detectionSpot matchingNormalization of spot intensities

18 Matching Melanie/ImageMaster 2D Platinum 6.0 Reference gel Combining 2D gel images -creating a master gel, a “typical profile”.

19 Master gels Combine several images, creating the master image all the spots on a single image –even those that will never be expressed at the same time, a summary of groups of replicate gels (average gel) Delta 2D Any point on a gel can be labeled, and automatically transferred from one gel to another.

20 Gel image warping Compensates for running differences between gels After warping, corresponding spots will have the same position on every image. Variations in migration, protein separation, stain artifacts and stain saturation complicate gel matching and quantitation.

21 Expression Comparison of individual experimental gels to master gels. Identification of variant spots

22 Miscellaneous Automatic retrieval of web information. Send out a “Scout” to the web and bring back corresponding data like pI, MW, sequence, function Create a PowerPoint slide from a gel image Delta 2D

23 2D Gel Databases Swiss-2DPAGESwiss-2DPAGE www.expasy.ch www.expasy.ch GelBank GelBank http://www.gelscape.ualberta.ca:8080/htm/gdbIndex.html http://www.gelscape.ualberta.ca:8080/htm/gdbIndex.html Cornea 2D-PAGE Cornea 2D-PAGE http://www.cornea-proteomics.com/ http://www.cornea-proteomics.com/ World 2DPAGEWorld 2DPAGE, Index of 2D gel databases http://ca.expasy.org/ch2d/2d-index.html

24 Swiss 2D PAGE viewer

25 Gel bank

26 cornea

27 World-2DPAGE http://ca.expasy.org/ch2d/2d-index.html

28 It runs on most UNIX-based operating systems (Linux, Solaris/SunOS, IRIX). Being continuously developed, the tool is evolving in concert with the current Proteomics Standards Initiative of the Human Proteome Organization (HUPO).Proteomics Standards Initiative Data can be marked to be public, as well as fully or partially private. An administration Web interface, highly secured, makes external data integration, data export, data privacy control, database publication and versions' control a very easy task to perform. A software package to create, convert, publish, interconnect and keep up to date 2DE-databases. Provided by ExPASY The database is queryable via description, accession or spot clicking. Cross-references are provided to other federated 2D PAGE database entries, Medline and SWISS-PROT Entries are linked to images showing the experimentally determined and theoretical protein locations. Search via –clickable images, -keywords Make 2D database

29 Federated databases Limitations of current databases: Do not contain strict/detailed descriptions of protocol (buffers, sample volume, staining techniques all important information for gel comparisons). Designed as 2D (and not proteomics) databases and therefore not readily expandable to incorporate other proteomics data e.g. MS, MDLC. Designed for reference gels, not on-going projects. Robustness Consistency Maintenance of the database Data quality A collection of databases that are treated as one entity and viewed through a single user interface (pc.mag.com) (pc.mag.com)

30 Guidelines for building a federated 2-DE database http://ca.expasy.org/ch2d/fed-rules.html Individual entries in the database must be accessible by a keyword search. Other methods are possible but not required. The database must be linked to other databases by active hypertext cross- references, linking together all related databases. Database entries must be at least linked to the main index. A main index has to be supplied that provides a means of querying all databases through one unique query point. Individual protein entries must be available through clickable images. 2DE analysis software designed for use with federated databases, must be able to access individual entries in any federated 2DE databases. for a complete reference, see Appel et al., Electrophoresis 17, 1996, 540-546, 1996):Appel et al., Electrophoresis 17, 1996, 540-546, 1996

31 SWISS 2D PAGE http://au.expasy.org/ch2d/

32 Swiss 2D PAGE viewer Which gel you want to look at

33 Swiss 2D PAGE

34 Swiss-2D PAGE

35 Estimated position Estimated position in human liver sample

36 Vimentin_human (P08670)

37 Peptide Mass Fingerprinting A protein identification technique, that correlates experimental data with theoretical data. Theoretical MS Computer search Protein sequence from database In silico digestion “Experimental” MSProteolytic digestionProtein

38 Peptide Mass Fingerprinting Protein digestion with protease (trypsin) Determination of the mass by MS - Calibration Database searching - Generation of the peptide map Comparison with theoretical peptide maps of known proteins - In silico digestion Identification of the protein based on a probabilistic basis - percent coverage, similarity etc

39 Protein digestion with protease (trypsin) from tutorial written by: Dr J. R. Jefferies, Parasitology Group, Institute of Biological Sciences, University of Wales, UK Cleaves at lysine and arginine, unless either is followed by proline in C-terminal direction trypsin The molecule is cleaved at all the possible sites, which will produce a set of peptides, of varying masses, that are characteristic of that protein. The mass of each peptide will be the sum of the amino acids present including any modifications that those amino acids might have undergone.

40 Determination of mass Every peak corresponds to the exact mass (m/z) of a peptide ion MALDI - MS is used to measure the masses of the proteolytic peptide fragments. Select: Monoisotopic peaks [M+H]+ i.e. singly charged Peak list 1051.54 1086.52 1094.56 1111.59 1244.64 1421.7 1476.67 1542.84 1613.88 1664.97 1763.79 1777.82

41 Isotopes Isotopes are different forms of an element, each having different atomic mass. They have a nuclei with the same number of protons (same atomic number) but different numbers of neutrons. Isotope (A) massAbundance, % Isotope (A+1) massAbundance, % Isotope (A+2) mass 12C1298,9313C13.00335483781.07C1414.003241988 1H1.007825032199.98852H2.01410177800.01153H3.0160492675 14N14.003074005299.63215N15.00010889840.368 16O15.994914622199.75717O16.999131500.03818O17.9991604 Naturally occurring isotopes modified from: http://www.ionsource.com/Card/Mass/mass.htm

42 Monoisotopic- /Average mass Monoisotopic mass is expressed in atomic mass unites (amu), or in daltons (Da). A simulated isotopic distribution of the [M+H]+ ion of a compound (poly- ananine) 1156.3 average mass The monoisotopic mass is the mass of the isotopic peak whose elemental composition is composed of the most abundant isotope s of those elements

43 Accuracy Accurate measurements of peptide masses Accurate databases Relies on the ability to search data already present in various databases The higher the accuracy, the better and more specific the protein hit.

44 Effect of Mass Accuracy and Mass Tolerance search m/zmass tolerance (Da)# hits 15291478 1529,70,1164 1529,730,0125 1529,7340,0014 1529,73480,00012 Tryptic digestion of human hemoglobin alpha chain yields 14 tryptic peptides, of which the peptide VGAHAGEYHAEALER has an exact monoisotopic mass of 1528,7348 Da. The singly charged ion of this peptide has an m/z value of 1529,7348. The result of searching SWISS PROT database against all human and mouse proteins. Lieber, Introduction to Proteomics

45 Database search Peptide mass fingerprinting provides evidence for the most probable identity of a protein. The quality of the Protein identification will depend upon: Quality of the mass spectrometry data The accuracy of the database The power of the search algorithms and software used The genome should be verified for the organism that you are working on. If not, then the next most ideal situation is that there is good cDNA data available. If neither of these are the case then it is worth checking if there are any expressed sequence tags (EST) that can be used.

46 Tools for fingerprinting Mascot Mascot (Matrix Science) AldenteAldente (ExPasy) Profound MS-Fit MS-Fit (Prospector; UCSF) Several of the available peptide mass fingerprinting programs use more sophisticated scoring algorithms. Correct for scoring bias due to protein size, in which larger proteins give rise to greater number of peptides, for tendendy of smaller peptides I databases to have a greater number of matches with search m/z values. Some of these algorithms also apply probability based statistics to better define the significance of protein identifications.

47 Mascot PMF

48 The significance of the result depends on the size of the database being searched > 5% probability that the match is a random events, of no significance Mascot PMF score

49 Mascot PMF results Entry name Coverage similarity Probability to be random coverage % of protein length covered by the experimental peptides

50 Mascot protein view

51 ALDENTE (http://www.expasy.org/tools/aldente/): Aldente is a tool to identify proteins from peptide mass fingerprinting data

52 Aldente, protein window

53 Aldente, peptide view

54 Aldente, results S = S1 * S2 S = final score for this entry S1 = sum of each peptide score S2 = protein level score The scoring is tunable: the weights of each parameter in the score, can be defined independently

55 newt

56 Swiss prot entry

57 Aldente results

58 Profound http://prowl.rockefeller.edu/prowl-cgi/profound.exe

59 Profound results

60 Graphic display of results

61 MS-FIT University of California, San Fransisco UCSF Mass Spectrometry FacilityUCSF Mass Spectrometry Facility http://prospector.ucsf.edu/prospector/4.0.8/html/msfit.htm

62 MS Fit

63 Ms fit detailed report


Download ppt "“Proteomics & Bioinformatics” MBI, Master's Degree Program in Helsinki, Finland 8 May, 2007 Sophia Kossida, BRF, Academy of Athens, Greece Esa Pitkänen,"

Similar presentations


Ads by Google