Bioinformatics master course DNA/Protein structure-function analysis and prediction Lecture 13: Protein Function Centre for Integrative Bioinformatics.

Slides:



Advertisements
Similar presentations
C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U E Bioinformatics Master Course: DNA/Protein Structure-Function Analysis and Prediction.
Advertisements

Applications of GO. Goals of Gene Ontology Project.
25th June 2007 Jane Lomax Using the Gene Ontology (GO) for analysis of expression data Jane Lomax EMBL-EBI.
Biological pathway and systems analysis An introduction.
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
Gene function analysis Stem Cell Network Microarray Course, Unit 5 May 2007.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
Internet tools for genomic analysis: part 2
Ontologies for Informatics. Infrastructure for Systems Biology. Oxford October
CACAO - Penn State Gene Function and Gene Ontology January 2011
EVOLUTIONARY AND COMPUTATIONAL GENOMICS Shin-Han Shiu Plant Biology / CMB / EEBB / Genetics / QBMI.
Bioinformatics for biomedicine More annotation, Gene Ontology and pathways Lecture 6, Per Kraulis
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
Daniel Rico, PhD. Daniel Rico, PhD. ::: Introduction to Functional Analysis Course on Functional Analysis Bioinformatics Unit.
Advanced Algorithms and Models for Computational Biology Class Overview Eric Xing & Ziv Bar-Joseph Lecture 1, January 18, 2005 Reading: Chap. 1, DTM book.
Using The Gene Ontology: Gene Product Annotation.
Gene Ontology (GO) Project
Genomes School B&I TCD Bioinformatics May Genome sizes Completed eukaryotic nuclear genomes Type of organismSpeciesGenome size (10 6 base pairs)
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
The aims of the Gene Ontology project are threefold: - to compile vocabularies to describe components, functions and processes - to produce tools to query.
Ontologies, data standards and controlled vocabularies.
March 24, Integrating genomic knowledge sources through an anatomy ontology Gennari JH, Silberfein A, and Wiley JC Pac Symp Biocomputing 2005:
Gene Ontology Consortium
1 Bio-Trac 40 (Protein Bioinformatics) October 8, 2009 Zhang-Zhi Hu, M.D. Associate Professor Department of Oncology Department of Biochemistry and Molecular.
COURSE OF BIOINFORMATICS Exam_31/01/2014 A.
IGEM 101: Session 7 4/2/15Jarrod Shilts 4/5/15Ophir Ospovat.
Computational biology of cancer cell pathways Modelling of cancer cell function and response to therapy.
The Gene Ontology: a real-life ontology, progress and future. Jane Lomax EMBL-EBI.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Gene Ontology Project
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
Lecture Four: GO: The Gene Ontology ----Infrastructure for Systems Biology.
Cell Signaling Ontology Takako Takai-Igarashi and Toshihisa Takagi Human Genome Center, Institute of Medical Science, University of Tokyo.
BIOINFORMATIK I UEBUNG 2 mRNA processing.
24th Feb 2006 Jane Lomax GO Further. 24th Feb 2006 Jane Lomax GO annotations Where do the links between genes and GO terms come from?
Part II GO-Vocabulary of Genome. S. cerevisiae D. melanogaster.
The Gene Ontology and its insertion into UMLS Jane Lomax.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
The Mammalian Protein – Protein Interaction Database and Its Viewing System That Is Linked to the Main FANTOM2 Viewer Genome Research (2003) Speaker: 蔡欣吟.
To Boldly GO… Amelia Ireland GO Curator EBI, Hinxton, UK.
Proteomics Session 1 Introduction. Some basic concepts in biology and biochemistry.
Gene Ontology Project
Biological Networks & Systems Anne R. Haake Rhys Price Jones.
Gene Ontology Consortium
Central dogma: the story of life RNA DNA Protein.
Bioinformatics and Computational Biology
Chapter 1 Introduction.
Domains, their prediction and domain databases Lecture 16: Introduction to Bioinformatics C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I.
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
The Membrane Plays a Key Role in a Cell’s Response to Environmental Signals Cells can respond to many signals if they have a specific receptor.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
PROTEIN INTERACTION NETWORK – INFERENCE TOOL DIVYA RAO CANDIDATE FOR MASTER OF SCIENCE IN BIOINFORMATICS ADVISOR: Dr. FILIPPO MENCZER CAPSTONE PROJECT.
Gene Ontology TM (GO) Consortium
Joined up ontologies: incorporating the Gene Ontology into the UMLS.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
Canadian Bioinformatics Workshops
Module 1: Gene List/Network Intro Canadian Bioinformatics Workshops
Module 1: Gene Lists 1 Canadian Bioinformatics Workshops
Gene Annotation & Gene Ontology May 24, Gene lists from RNAseq analysis What do you do with a list of 100s of genes that contain only the following.
Canadian Bioinformatics Workshops
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
What’s new in GO?. Priorities Annotation outreach Reference genomes User advocacy Ontology development Software.
Annotating with GO: an overview
Department of Genetics • Stanford University School of Medicine
Using the Gene Ontology (GO) for analysis of expression data Jane Lomax EMBL-EBI 25th June 2007 Jane Lomax.
Cell Signaling (Lecture 1)
Presentation transcript:

Bioinformatics master course DNA/Protein structure-function analysis and prediction Lecture 13: Protein Function Centre for Integrative Bioinformatics VU (IBIVU) Faculty of Sciences / Faculty of Earth & Life Sciences

Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction from structure – very difficult

Experimental Structural genomics Functional genomics Protein-protein interaction Metabolic pathways Expression data

Protein function categories Catalysis (enzymes) Binding – transport (active/passive) –Protein-DNA/RNA binding (e.g. histones, transcription factors) –Protein-protein interactions (e.g. antibody-lysozyme) (experimentally determined by yeast two-hybrid (Y2H) or bacterial two-hybrid (B2H) screening ) –Protein-fatty acid binding (e.g. apolipoproteins) –Protein – small molecules (drug interaction, structure decoding) Structural component (e.g.  -crystallin) Regulation Signalling Transcription regulation Immune system Motor proteins (actin/myosin)

Catalytic properties of enzymes K m k cat E + S ES E + P E = enzyme S = substrate ES = enzyme-substrate complex (transition state) P = product K m = Michaelis constant k cat = catalytic rate constant (turnover number) K cat /K m = specificity constant (useful for comparison) [S] Moles/s V max V max /2 KmKm V max × [S] V = Michaelis-Menten equation K m + [S]

Protein interaction domains Protein Interaction Domains

Energy difference upon binding Examples of protein interactions (and functional importance) include: Protein – protein (pathway analysis); Protein – small molecules (drug interaction, structure decoding); Protein – peptides, DNA/RNA (function analysis) The change in Gibb’s Free Energy of the protein-ligand binding interaction can be monitored and expressed by the following;  G =  H – T  S (H=Enthalpy, S=Entropy and T=Temperature)

Protein function Many proteins combine functions Some immunoglobulin structures are thought to have more than 100 different functions (and active/binding sites) Alternative splicing can generate (partially) alternative structures

Protein function Active site / binding cleft Protein-protein interaction Shape complementarity

Protein function evolution Chymotrypsin

How to infer function Experiment Deduction from sequence –Multiple sequence alignment – conservation patterns –Homology searching Deduction from structure –Threading –Structure-structure comparison –Homology modelling

Mevalonate plays a role in epithelial cancers: it can inhibit EGFR Cholesterol biosynthesis primarily occurs in eukaryotic cells. It is necessary for membrane synthesis, and is a precursor for steroid hormone production as well as for vitamin D. While the pathway had previously been assumed to be localized in the cytosol and ER, more recent evidence suggests that a good deal of the enzymes in the pathway exist largely, if not exclusively, in the peroxisome (the enzymes listed in blue in the pathway to the left are thought to be at least partly peroxisomal). Patients with peroxisome biogenesis disorders (PBDs) have a variable deficiency in cholesterol biosynthesis

Epidermal Growth Factor as a Clinical Target in Cancer Introduction: A malignant tumour is the product of uncontrolled cell proliferation. Cell growth is controlled by a delicate balance between growth-promoting and growth-inhibiting factors. In normal tissue the production and activity of these factors results in differentiated cells growing in a controlled and regulated manner that maintains the normal integrity and functioning of the organ. The malignant cell has evaded this control; the natural balance is disturbed (via a variety of mechanisms) and unregulated, aberrant cell growth occurs. A key driver for growth is the epidermal growth factor (EGF) and the receptor for EGF (the EGFR) has been implicated in the development and progression of a number of human solid tumours including those of the lung, breast, prostate, colon, ovary, head and neck.

Energy housekeeping: Adenosine diphosphate (ADP) – Adenosine triphosphate (ATP)

Metabolic networks Glycolysis and Gluconeogenesis Kegg database (Japan)

Gene Ontology (GO) Not a genome sequence database Developing three structured, controlled vocabularies (ontologies) to describe gene products in terms of: –biological process –cellular component –molecular function in a species-independent manner

The GO ontology

Gene Ontology Members FlyBase - database for the fruitfly Drosophila melanogaster Berkeley Drosophila Genome Project (BDGP) - Drosophila informatics; GO database & software, Sequence Ontology development Saccharomyces Genome Database (SGD) - database for the budding yeast Saccharomyces cerevisiae Mouse Genome Database (MGD) & Gene Expression Database (GXD) - databases for the mouse Mus musculus The Arabidopsis Information Resource (TAIR) - database for the brassica family plant Arabidopsis thaliana WormBase - database for the nematode Caenorhabditis elegans EBI GOA project : annotation of UniProt (Swiss-Prot/TrEMBL/PIR) and InterPro databases Rat Genome Database (RGD) - database for the rat Rattus norvegicus DictyBase - informatics resource for the slime mold Dictyostelium discoideum GeneDB S. pombe - database for the fission yeast Schizosaccharomyces pombe (part of the Pathogen Sequencing Unit at the Wellcome Trust Sanger Institute) GeneDB for protozoa - databases for Plasmodium falciparum, Leishmania major, Trypanosoma brucei, and several other protozoan parasites (part of the Pathogen Sequencing Unit at the Wellcome Trust Sanger Institute) Genome Knowledge Base (GK) - a collaboration between Cold Spring Harbor Laboratory and EBI) TIGR - The Institute for Genomic Research Gramene - A Comparative Mapping Resource for Monocots Compugen (with its Internet Research Engine) The Zebrafish Information Network (ZFIN) - reference datasets and information on Danio rerio