Proteomics technologies and protein-protein interaction Lars Kiemer Center for Biological Sequence Analysis The Technical University of Denmark Advanced.

Slides:



Advertisements
Similar presentations
Genomes and Proteomes genome: complete set of genetic information in organism gene sequence contains recipe for making proteins (genotype) proteome: complete.
Advertisements

CELLULAR SIGNAL TRANSDUCTION 2) From the membrane to the nucleus TEIL F TRANSCRIPTION FACTORS AND THE REGULATION OF TRANSCRIPTION Rainer deMartin / Erhard.
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
Computational analysis of protein-protein interactions for bench biologists 2-8 September, Berlin Protein Interaction Databases Francesca Diella.
Biological networks: Types and sources Protein-protein interactions, Protein complexes, and network properties.
Research Methodology of Biotechnology: Protein-Protein Interactions Yao-Te Huang Aug 16, 2011.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Gene expression analysis summary Where are we now?
Chip arrays and gene expression data. With the chip array technology, one can measure the expression of 10,000 (~all) genes at once. Can answer questions.
Biological networks: Types and origin Protein-protein interactions, complexes, and network properties Thomas Skøt Jensen Center for Biological Sequence.
Introduction to biological networks. protein-gene interactions protein-protein interactions PROTEOME GENOME Citrate Cycle METABOLISM Bio-chemical reactions.
1 Protein-Protein Interaction Networks MSC Seminar in Computational Biology
Predicting protein functions from redundancies in large-scale protein interaction networks Speaker: Chun-hui CAI
Evidence for dynamically organized modularity in the yeast protein- protein interaction network Han, et al
Fa 05CSE182 CSE182-L8 Mass Spectrometry. Fa 05CSE182 Bio. quiz What is a gene? What is a transcript? What is translation? What are microarrays? What is.
Biological networks: Types and origin
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Announcements: Proposal resubmissions are due 4/23. It is recommended that students set up a meeting to discuss modifications for the final step of the.
A combination of the words Proteomics and Genomics. Proteogenomics commonly refer to studies that use proteomic information, often derived from mass spectrometry,
Proteomics Informatics (BMSC-GA 4437) Course Director David Fenyö Contact information
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
A highly abbreviated introduction to proteomics
Ch10. Intermolecular Interactions and Biological Pathways
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
MicroRNA Targets Prediction and Analysis. Small RNAs play important roles The Nobel Prize in Physiology or Medicine for 2006 Andrew Z. Fire and Craig.
9/14/20151 EECS 730 Introduction to Bioinformatics Introduction to Proteomics Luke Huan Electrical Engineering and Computer Science
Protein analysis and proteomics (Part 2 of 2). Many of the images in this powerpoint presentation are from Bioinformatics and Functional Genomics by Jonathan.
Presentation for Shamir group meeting Interactome under construction: protein-protein interaction and pathway databases 5/1/2011 Based on the papers: Protein-protein.
Interactions and more interactions
Transient Protein-Protein Interactions (TPPI)
The dynamic nature of the proteome
Finish up array applications Move on to proteomics Protein microarrays.
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
Proteome and interactome Bioinformatics.
Unraveling condition specific gene transcriptional regulatory networks in Saccharomyces cerevisiae Speaker: Chunhui Cai.
Cytokines, Growth Factors and Hormones SIGMA-ALDRICH.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Lecture 9. Functional Genomics at the Protein Level: Proteomics.
Introduction to IntAct Pablo Porras Millán, IntAct
TAP(Tandem Affinity Purification) Billy Baader Genetics 677.
Genomics II: The Proteome Using high-throughput methods to identify proteins and to understand their function.
Central dogma: the story of life RNA DNA Protein.
Overview of Mass Spectrometry
Proteomics, the next step What does each protein do? Where is each protein located? What does each protein interact with, if anything? What role does it.
Biol 729 – Proteome Bioinformatics Dr M. J. Fisher - Protein: Protein Interactions.
Chapter 3: Non-Associative Learning and Memory From Mechanisms of Memory, second edition By J. David Sweatt, Ph.D.
How many interactions are there? ~6,200 genes ~6,200 proteins x 2-10 interactions/protein ~12, ,000 interactions Yeast.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Extracellular Environment. Extracellular Environment.
1 Genomics Advances in 1990 ’ s Gene –Expressed sequence tag (EST) –Sequence database Information –Public accessible –Browser-based, user-friendly bioinformatics.
1 Having genome data allows collection of other ‘omic’ datasets Systems biology takes a different perspective on the entire dataset, often from a Network.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
What is proteomics? Richard Mbasu and Ben Richards.
Protein-protein Interactions
Functional organization of the yeast proteome by systematic analysis of protein complexes Presented by Nathalie Kirshman and Xinyi Ma.
General idea and concepts of cell-cell signaling
Biological networks CS 5263 Bioinformatics.
Proteomics technologies and protein-protein interaction
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presented by Meeyoung Park
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Proteomics Informatics David Fenyő
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Proteomics Informatics David Fenyő
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Proteomics technologies and protein-protein interaction Lars Kiemer Center for Biological Sequence Analysis The Technical University of Denmark Advanced bioinformatics – November 2005

November 2005 Outlining the problem Around 30% of the human proteins still have no annotated function. Even if the function is known, we often don’t know anything about the big picture (regulation?, multiple functions?, pathogenesis?, mutations?, splice variants?). In fact, the individual proteins are as interesting as bricks in a wall – what we want to know about is the system.

November 2005 Ras Raf MEK MAPK NUCLEUS CYTOPLASM EXTRACELLULAR Rap1 bRaf NCAM DAGL Ca 2+ Fyn FGFR CB1 NCAM Frs2 PLC  Shc Fak PKC PKA Grb2 Sos GAP43 CaMKII CREB C-Fos Example: signal transduction cascade

November 2005 Ras Raf MEK MAPK Transcription MAPK NUCLEUS CYTOPLASM EXTRACELLULAR cAMP Rap1 bRaf NCAM DAGL DAGPIP 2 2-AG Ca 2+ Fyn FGFR CB1 NCAM Frs2 PLC  Shc Fak PKC PKA IP 3 Grb2 Sos Grb2 Sos GAP43 CaMKIICREB C-Fos Example: signal transduction cascade

November 2005 Obtaining data High-throughput data can provide information about interactions with other proteins, protein abundance in different tissues, transcriptional regulation, etc. High-throughput experimental techniques provide large data sets – thus no manual curation is possible.  These data sets often contain false positives.  But combining several such data sets increases confidence.

November 2005 Protein interactions reveal a lot! Hints of the function of a protein are revealed when its interaction partners are known. Guilt by association! Complexes in which none of the interaction partners have known functions are even more interesting.

November 2005 Yeast-two-hybrid screening Has been widely used Only binary interactions High false postive rate Proteins must be able to enter the nucleus

November 2005 Affinity purification Large-scale Can be done on any preparation of cells Often complexes are purified and the order of binding is not obtained An extra step is needed to identify purified proteins

November 2005 Q1 TOF q2 Mass Analyzer(s) Separates gas-phase Ions by m/z Ion Source Converts the analyte into gasphase ions 3 principal components + Detector Ions are detected as they disharge on the detector Mass spectrometer

November 2005 Mass spectrometry in short Extremely sensitive Weight precision of one atom In principle, detection of one, relatively short peptide allows for unambiguous identification. Some proteins are difficult to chop up with proteases. Some peptides are very difficult to ionize. Due to the high sensitivity of the method, contaminations are difficult to avoid.

November 2005 Affinity pulldown BaitPrey SpokeMatrixTruth? Protein interaction databases: Spoke/Matrix

November 2005 Protein interaction data: A total of articles represented in the databases ( June 2005 ). DatabaseUnique article references # interaction pairs in unique references. DIP (binary?) MINT (spoke) Intact (spoke) GRID (binary?) BIND* (protein part) (spoke/matrix) HPRD (matrix) *Approx. 10% of pp interactions in BIND are db’ imports Protein interaction databases: Overlap

November 2005 Species bias in available data A few select organisms are very well-studied, while others are not. The BIND database, species distribution (Alfarano et al., NAR, 2005):

November 2005 Orthologs? Orthologous genes are direct descendants of a gene in a common ancestor: (O'Brien K, Remm et al. 2005) S. cerevisiae D. melanogaster H. sapiens Trans-organism protein interaction network

November 2005 D. melanogaster Experim. C. elegans Experim. S. cerevisiae Experim. H. sapiens MOSAIC Trans-organism protein interaction network

November 2005 Repetition of experiments adds credibility Light blue connection – 1 experiment. Darker blue connection – >1 experiment, 1 organism. Purple connection - >1 experiment, >1 organisms. Light blue connection – 1 experiment. Darker blue connection – >1 experiment, 1 organism. Purple connection - >1 experiment, >1 organisms.

November 2005 Adding co-expression data Red connector – co- expression in 80 different tissues with a correlation coefficient above 0.7. Grey nodes – no expression data available. Red connector – co- expression in 80 different tissues with a correlation coefficient above 0.7. Grey nodes – no expression data available.

November 2005 Nucleolus dynamics Nodes are coloured according to level of protein in the nucleolus following transcriptional inhibition (Andersen et al., Nature, 2005). decreased unchanged Relativelevelofprotein inthenucleolus afterinhibitionoftranscription increased decreased unchanged Relativelevelofprotein inthenucleolus afterinhibitionoftranscription increased

November 2005 Adding up to make high quality associations Integration of various data sources builds up confidence

November 2005 Upon integration comes enlightenment

November 2005 Upon integration comes enlightenment

November 2005 Identifying functional complexes Ribosome (predominantly 60S) DNA repair SMARCA complex TFIID Arp2/3

November 2005 Summary Protein-protein interactions can reveal hints about the function of a protein (guilt by association). Information about protein interactions is obtained with different technologies each with its own advantages and weaknesses. Due to the high degree of systemic conservation, interactions can be inferred from observed interactions in other species. Data are always error-prone. Repeated observations build up confidence. Integrating different types of data can futher build up confidence.