1-month Practical Course Genome Analysis Protein Structure-Function Relationships Centre for Integrative Bioinformatics VU (IBIVU) Vrije Universiteit Amsterdam.

Slides:



Advertisements
Similar presentations
Understanding biology through structures Course work 2006 Protein-Nucleic Acid Interactions: General Principles.
Advertisements

Transcriptional regulation in Eukaryotes The regulatory elements of bacterial, yeast, and human genes.
Phylogenetics workshop: Protein sequence phylogeny week 2 Darren Soanes.
CH. 11 : Transcriptional Control of Gene Expression Jennifer Brown.
Describe the structure of a nucleosome, the basic unit of DNA packaging in eukaryotic cells.
Basics of Comparative Genomics Dr G. P. S. Raghava.
Bioinformatics master course DNA/Protein structure-function analysis and prediction Lecture 1: Protein Structure Basics (1) Centre for Integrative Bioinformatics.
Bioinformatics for biomedicine Seminar: Sequence analysis of a favourite gene Lecture 5, Per Kraulis
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Integrative Bioinformatics Institute VU (IBIVU) Tel ,
Alpha/Beta structures Barrels, sheets and horseshoes.
Structural bioinformatics
Introduction to Bioinformatics
March, 2005 Chapter 13 Regulation of Gene Transcription DNA  RNA.
Bioinformatics master course DNA/Protein structure-function analysis and prediction Lecture 5: Protein Fold Families Jaap Heringa Integrative Bioinformatics.
Protein Secondary Structure : Kendrew Solves the Structure of Myoglobin “Perhaps the most remarkable features of the molecule are its complexity.
The following slides present some answers….. Please don’t peek before doing the exercise!
BACKGROUND E. coli is a free living, gram negative bacterium which colonizes the lower gut of animals. Since it is a model organism, a lot of experimental.
Protein Modules An Introduction to Bioinformatics.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
1-month Practical Course Genome Analysis Lecture 3: Residue exchange matrices Centre for Integrative Bioinformatics VU (IBIVU) Vrije Universiteit Amsterdam.
SnapDRAGON: protein 3D prediction-based DOMAINATION: based on PSI-BLAST Two methods to predict domain boundary sequence positions from sequence information.
Dual control of the lac operon. Glucose and lactose levels control the initiation of transcription of the lac operon through their effects on the lac repressor.
Gene expression.
MCB 317 Genetics and Genomics MCB 317 Topic 10, part 3 A Story of Transcription.
Sigma-aldrich.com/cellsignaling Modular Structure of Transcription Factors.
Protein Structure Lecture 2/26/2003. beta sheets are twisted Parallel sheets are less twisted than antiparallel and are always buried. In contrast, antiparallel.
Lecture 3. α domain structures Coiled-coil, knobs and hole packing Four-helix bundle Donut ring large structure Globin fold Ridges and grooves model CS882,
Bioinformatics for biomedicine Protein domains and 3D structure Lecture 4, Per Kraulis
Protein Tertiary Structure Prediction
Protein Bioinformatics Course
The structural organization within proteins Kevin Slep June 13 th, 2012.
Introduction to Protein Structure
STRUCTURAL ORGANIZATION
Identification of Protein Domains. Orthologs and Paralogs Describing evolutionary relationships among genes (proteins): Two major ways of creating homologous.
1 Orthology and paralogy A practical approach Searching the primaries Searching the secondaries Significance of database matches DB Web addresses Software.
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
Alpha/Beta Structures Branden & Tooze, Chapter 4.
Controlling the genes Lecture 15 pp Gene Expression Nearly all human cells have a nucleus (not red blood cells) Almost all these nucleated cells.
Bioinformatics For MNW 2 nd Year Jaap Heringa FEW/FALW Centre for Integrative Bioinformatics VU (IBIVU) Tel ,
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
3 Å i i+1 i+2 CαCα CαCα CαCα The  (extended) conformation General shape.
Molecular biology databases Based on Chapter 2 of Post-genome Informatics by Minoru Kanehisa, Oxford University Press, History 2.2 Information.
Domains, their prediction and domain databases Lecture 16: Introduction to Bioinformatics C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I.
Last Class 1. Transcription 2. RNA Modification and Splicing
Protein domains, function and associated prediction Lecture 14: Introduction to Bioinformatics C E N T R F O R I N T E G R A T I V E B I O I N F O R M.
The Biologist’s Wishlist A complete and accurate set of all genes and their genomic positions A set of all the transcripts produced by each gene The location.
Chapter 12. Transcription Activators in Eukaryotes
a/b domains are found in many proteins
a/b domains are found in many proteins
Basics of Comparative Genomics
Expression of Human Genes
Genomes and Their Evolution
Introduction to bioinformatics 2007
There are four levels of structure in proteins
Relationship between Genotype and Phenotype
Relationship between Genotype and Phenotype
Volume 124, Issue 1, Pages (January 2006)
Relationship between Genotype and Phenotype
SnapDRAGON: protein 3D prediction-based
Scot A Wolfe, Elizabeth I Ramm, Carl O Pabo  Structure 
Predicting protein structure and function
Volume 8, Issue 5, Pages (November 2001)
Basics of Comparative Genomics
Volume 14, Issue 8, Pages (August 2006)
Volume 12, Issue 11, Pages (November 2004)
Relationship between Genotype and Phenotype
Structure of an IκBα/NF-κB Complex
The Structure of T. aquaticus DNA Polymerase III Is Distinct from Eukaryotic Replicative DNA Polymerases  Scott Bailey, Richard A. Wing, Thomas A. Steitz 
Presentation transcript:

1-month Practical Course Genome Analysis Protein Structure-Function Relationships Centre for Integrative Bioinformatics VU (IBIVU) Vrije Universiteit Amsterdam The Netherlands C E N T R F O R I N T E G R A T I V E B I O I N F O R M A T I C S V U E

Genome/DNA Transcriptome/mRNA Proteome Metabolome Physiome Transcription factors Ribosomal proteins Chaperonins Enzymes Protein function

Not all proteins are enzymes:  -crystallin: eye lens protein – needs to stay stable and transparent for a lifetime (very little turnover in the eye lens)

Protein function groups Catalysis (enzymes) Binding – transport (active/passive) –Protein-DNA/RNA binding (e.g. histones, transcription factors) –Protein-protein interactions (e.g. antibody-lysozyme) –Protein-fatty acid binding (e.g. apolipoproteins) –Protein – small molecules (drug interaction, structure decoding) Structural component (e.g.  -crystallin) Regulation Transcription regulation Signalling Immune system Motor proteins (actin/myosin)

What can happen to protein function through evolution Proteins can have multiple functions (and sometimes many -- Ig). Enzyme function is defined by specificity and activity Through evolution: Function and specificity can stay the same Function stays same but specificity changes Change to some similar function (e.g. somewhere else in metabolic system) Change to completely new function

How to arrive at a given function Divergent evolution – homologous proteins –proteins have same structure and “same- ish” function Convergent evolution – analogous proteins – different structure but same function Question: can homologous proteins change structure (and function)?

Protein function evolution Chymotrypsin ‘Modern’ 2-barrel structure Putative ancestral barrel structure Active site (combination of ancestral active site residues) Activity ,000 times enhanced

How to evolve Important distinction: Orthologues: homologous proteins in different species (all deriving from same ancestor) Paralogues: homologous proteins in same species (internal gene duplication) In practice: to recognise orthology, bi-directional best hit is used in conjunction with database search program (this is called an operational definition)

How to evolve By addition of domains (at either end of protein sequence or at loop sites [see next slides]) Often through gene duplication followed by divergence Multi-domain proteins are a result of gene fusion (multiple genes ending up in a single ORF). Repetitions of the same domain in a single protein occur frequently (gene duplication followed by gene fusion)

Protein structure evolution Insertion/deletion of secondary structural elements can ‘easily’ be done at loop sites These sites are normally at the surface of a protein

Example -- Flavodoxin fold 5(  ) fold

Flavodoxin family - TOPS diagrams (Flores et al., 1994) These are four variations of the same basic topology (bottom) Do you see what is inserted as compared to the basic topology? = alpha-helix = beta-strand A TOPS diagram is a schematic representation of a protein fold

Protein structure evolution Insertion/deletion of structural domains can ‘easily’ be done at loop sites N C

The basic functional unit of a protein is the domain A domain is a: Compact, semi-independent unit (Richardson, 1981). Stable unit of a protein structure that can fold autonomously (Wetlaufer, 1973). Recurring functional and evolutionary module (Bork, 1992). “Nature is a ‘tinkerer’ and not an inventor” (Jacob, 1977).

Delineating domains is essential for: Obtaining high resolution structures (x-ray, NMR) Sequence analysis Multiple sequence alignment methods Prediction algorithms (SS, Class, secondary/tertiary structure) Fold recognition and threading Elucidating the evolution, structure and function of a protein family (e.g. ‘Rosetta Stone’ method – next lecture) Structural/functional genomics Cross genome comparative analysis

Pyruvate kinase Phosphotransferase  barrel regulatory domain  barrel catalytic substrate binding domain  nucleotide binding domain 1 continuous + 2 discontinuous domains Structural domain organisation can be nasty…

Complex protein functions are a result of multiple domains An example is the so-called swivelling domain in pyruvate phosphate dikinase (Herzberg et al., 1996), which brings an intermediate enzymatic product over about 45 Å from the active site of one domain to that of another. This enhances the enzymatic activity: delivery of intermediate product not by a diffusion process but by active transport

The DEATH Domain Present in a variety of Eukaryotic proteins involved with cell death. Six helices enclose a tightly packed hydrophobic core. Some DEATH domains form homotypic and heterotypic dimers.

Globin fold  protein myoglobin PDB: 1MBN

 sandwich  protein immunoglobulin PDB: 7FAB

TIM barrel  /  protein Triose phosphate IsoMerase PDB: 1TIM

A fold in  +  protein ribonuclease A PDB: 7RSA The red balls represent waters that are ‘bound’ to the protein based on polar contacts

434 Cro protein complex (phage) PDB: 3CRO

Zinc finger DNA recognition (Drosophila) PDB: 2DRP..YRCKVCSRVY THISNFCRHY VTSH...

Characteristics of the family: Function: The DNA-binding motif is found as part of transcription regulatory proteins. Structure: One of the most abundant DNA-binding motifs. Proteins may contain more than one finger in a single chain. For example Transcription Factor TF3A was the first zinc-finger protein discovered to contain 9 C2H2 zinc-finger motifs (tandem repeats). Each motif consists of 2 antiparallel beta-strands followed by by an alpha-helix. A single zinc ion is tetrahedrally coordinated by conserved histidine and cysteine residues, stabilising the motif. Zinc-finger DNA binding protein family

Binding: Fingers bind to 3 base-pair subsites and specific contacts are mediated by amino acids in positions - 1, 2, 3 and 6 relative to the start of the alpha-helix. Contacts mainly involve one strand of the DNA. Where proteins contain multiple fingers, each finger binds to adjacent subsites within a larger DNA recognition site thus allowing a relatively simple motif to specifically bind to a wide range of DNA sequences. This means that the number and the type of zinc fingers dictates the specificity of binding to DNA Characteristics of the family: Zinc-finger DNA binding protein family

Leucine zipper (yeast) PDB: 1YSA..RA RKLQRMKQLE DKVEE LLSKN YHLENEVARL...