polyQ and other homorepeats

Slides:



Advertisements
Similar presentations
GBrowse at TAIR Philippe Lamesch TAIR curator. Seqviewer.
Advertisements

Part I: Tips and Techniques from curators GBrowse at TAIR David Swarbreck.
NUCLEIC ACIDS {DNA;RNA} w 1. What are they? w 2. Where are they found? w 3. What are their functions? w 4. What is a nucleotide? Draw one. w (pages 219.
Secondary structure prediction. Amino acid sequence -> Secondary structure Alpha helix Beta strand Disordered/coil 70% accuracy 1991, 81% accuracy in.
Protein Interactions and Disease Audry Kang 7/15/2013.
Comparative Genomics of the Eukaryotes
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
Structural Bioinformatics R. Sowdhamini National Centre for Biological Sciences Tata Institute of Fundamental Research Bangalore, INDIA.
Genomes School B&I TCD Bioinformatics May Genome sizes Completed eukaryotic nuclear genomes Type of organismSpeciesGenome size (10 6 base pairs)
The FREAKS Session 3.1: Repeats Session 3.2: Biased regions Miguel Andrade Johannes-Gutenberg University of Mainz of PROTEIN SEQUENCE.
Repeats and composition bias. Repeats Frequency 14% proteins contains repeats (Marcotte et al, 1999) 1: Single amino acid repeats. 2: Longer imperfect.
Conserve Fibroblast growth factor 8 (fgf 8) domains Ana Tomas Judith Paridaen Susana Domingues.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Conservation and Evolution of Cis-Regulatory Systems Tal El-Hay Computational Biology Seminar חנוכה תשס"ו December 2005.
Searching for Transcription Start Sites in Drosophila
Overview of the Drosophila modENCODE hybrid assemblies Wilson Leung01/2014.
Brief Overview of Macromolecules DNA, RNA, and Proteins.
Repeats and composition bias. Repeats Frequency 14% proteins contains repeats (Marcotte et al, 1999) 1: Single amino acid repeats. 2: Longer imperfect.
You have worked for 2 years to isolate a gene involved in axon guidance. You sequence the cDNA clone that contains axon guidance activity. The sequence.
1 How do regulatory networks evolve? Module = group of genes co-regulated by the same regulatory system * Evolution of individual gene targets Gain or.
Homology 3D modeling and effect of mutations Miguel Andrade Faculty of Biology, Johannes Gutenberg University Institute of Molecular Biology Mainz, Germany.
Web Databases for Drosophila
Protein domains Miguel Andrade Mainz, Germany Faculty of Biology,
Repeats and composition bias
Homology 3D modeling Miguel Andrade Mainz, Germany Faculty of Biology,
Protein 3D representation
Prediction of protein features. Beyond protein structure
Secondary structure prediction
Protein domains Miguel Andrade Mainz, Germany Faculty of Biology,
Protein 3D representation
Metabolic Networks OF Fruit-fly
Some of the organisms that are used as highly informative models to study gene action and development. (a) Escherichia coli is a common bacterium; (b)
Repeats and composition bias
PBIO 4500/5500: Biotechnology and Genetic Engineering
Secondary structure prediction
EL: To find out what a genome is and how gene expression is regulated
Schematic of the Cell Cycle of Eukaryotic Cells
Protein 3D representation
Protein domains Miguel Andrade Mainz, Germany Faculty of Biology,
Homology 3D modeling and effect of mutations
Genomes and Their Evolution
Relationship between Genotype and Phenotype
Phosphorylation and sequence disorder in microtubule-associated protein Tau.A, schematic illustration of the domain profile of Tau with all known phosphorylation.
Homework #2 is due 10/17 Bonus #1 is due 10/24 FrakenFlowers.
Every living organism inherits a blueprint for life from its parents.
Part I: Tips and Techniques from curators
Volume 2, Issue 4, Pages (October 1998)
Molecular motors: Kinesin's string variable
Argonaute proteins Current Biology
Phosphopeptides identified harboring minimal binding motifs
Relationship between Genotype and Phenotype
Loyola Marymount University
SYCE2 directly binds to the chromoshadow domain of HP1α.
Homology 3D modeling Miguel Andrade Mainz, Germany Faculty of Biology,
Protein domains Miguel Andrade Mainz, Germany Faculty of Biology,
Protein 3D representation
Repeats and composition bias
Examining repeats with databases
Nora Pierstorff Dept. of Genetics University of Cologne
Volume 15, Issue 6, Pages (September 2004)
Structural Basis of Caspase-7 Inhibition by XIAP
Rapid Evolutionary Rewiring of a Structurally Constrained Eye Enhancer
Zhiqiang Hou, Lijing Su, Jimin Pei, Nick V. Grishin, Hong Zhang 
Jeffrey J. Wilson, Rhett A. Kovall  Cell 
How are synaptic vesicles clustered?
Relationship between Genotype and Phenotype
Relationship between Genotype and Phenotype
Alignment of the deduced amino acid sequences of the myosin light chain 2 (MLC2) proteins. Alignment of the deduced amino acid sequences of the myosin.
Zhiqiang Hou, Lijing Su, Jimin Pei, Nick V. Grishin, Hong Zhang 
Phosphopeptides identified harboring minimal binding motifs
Presentation transcript:

polyQ and other homorepeats Miguel Andrade Faculty of Biology, Johannes Gutenberg University Institute of Molecular Biology Mainz, Germany andrade@uni-mainz.de

Schaefer et al (2012) Nucleic Acids Res. Function of polyQ Martin Schaefer polyQ in Huntingtin Human Dog Mouse Opossum Chicken Frog Zebrafish Trout Fugu Stickleback Lancelet Capitella Limpet Nematostella Trichoplax Ciona intestinalis Ciona savignyi D. melanogaster D. mojavensis D. sechellia D. erecta D. yakuba D. grimshawi D. pseudoobscura D. persimilis D. ananassae D. willistoni D. virilis Schaefer et al (2012) Nucleic Acids Res.

Human Dog Mouse Opossum Chicken Frog Zebrafish Trout Fugu Stickleback Lancelet Capitella Limpet Nematostella Trichoplax Ciona intestinalis Ciona savignyi D. melanogaster D. mojavensis D. sechellia D. erecta D. yakuba D. grimshawi D. pseudoobscura D. persimilis D. ananassae D. willistoni D. virilis 14 families: Human polyQ, fly polyQ, fish no polyQ. P-value < 0.05 75/4759 4293/4759 354/4759 14 trios P-val < 0.05

human partners polyQ TFs long non polyQ 1 5 10 50 100 500 1000 partners polyQ TFs long non polyQ

polyQ non polyQ TFs long yeast human partners polyQ TFs long non polyQ 1 5 10 50 100 500 1000 human 1 5 10 50 100 500 1000 partners polyQ TFs long non polyQ Systems Biology no polyQ polyQ >14 polyQ 4-14 partners 1 2 5 10 20 50 100 200 500 1 5 10 50 100 500 1000

proteins with near polyP 86 human polyQ proteins polyP polyQ 13 polyQ proteins with near polyP 12 C-term polyQ coiled coil 109 polyQ regions 54 overlap/near N-terminal polyQ Run of P>=3 max dist 3. C-terminal

40 human polyQ/coiled-coil proteins (no polyP) 86 human polyQ proteins polyP polyQ 13 polyQ proteins with near polyP 12 C-term polyQ coiled coil N-terminal 36 N-term polyQ Run of P>=3 max dist 3. C-terminal

86 human polyQ proteins interacting proteins polyP coiled coil polyP coiled coil 49 interactions with another polyQ protein (p-value = 0.0023)

Non-polyQ interacting proteins 86 human polyQ proteins Non-polyQ interacting proteins polyP coiled coil coiled coil Enrichment (p-value < 2.2e-16)

polyQ protein N-terminal C-terminal polyP polyQ coiled coil unbound polyP polyQ disordered coiled coil

polyQ protein polyQ protein protein X N-terminal coiled coil C-terminal unbound polyP polyQ disordered coiled coil polyQ protein protein X polyP coiled coil polyQ

polyQ protein polyQ protein protein X N-terminal coiled coil C-terminal unbound polyP polyQ disordered coiled coil polyQ protein bound protein X coiled coil polyQ polyP

ATXN1Q82NT is toxic ATXN1Q82NT aggregates Spyros Petrakis Erich Wanker Toxicity: transfection into COS-1 cells (monkey fibroblasts), caspase activity assay Production of aggregates: filter retardation assay Erich Wanker Petrakis et al. (2012) PLoS Genetics

interactors that change ATXN1Q82NT toxicity

MED15 PUM1

MED15 GST PUM1

GST

Normal polyQ protein CC polyQ disordered CC partner

Normal polyQ protein CC polyQ disordered CC partner

Toxic polyQ protein Normal polyQ protein CC polyQ disordered CC partner polyQ alpha-helix

Toxic polyQ protein Normal polyQ protein beta-aggregates Normal polyQ protein CC polyQ disordered CC partner polyQ alpha-helix

Toxic polyQ protein Normal polyQ protein beta-aggregates Normal polyQ protein CC polyQ disordered CC partner polyQ alpha-helix polyQ beta-aggregates

Normal polyQ protein Toxic polyQ protein CC polyQ polyQ disordered beta-aggregates CC partner polyQ alpha-helix polyQ beta-aggregates

increased beta-aggregates Normal polyQ protein Toxic polyQ protein CC polyQ polyQ disordered beta-aggregates CC partner polyQ alpha-helix polyQ increased beta-aggregates non-CC partner

increased beta-aggregates Toxic polyQ protein polyQ beta-aggregates Normal polyQ protein CC polyQ disordered CC partner polyQ increased beta-aggregates polyQ alpha-helix non-CC partner

increased beta-aggregates Toxic polyQ protein polyQ beta-aggregates Normal polyQ protein CC polyQ disordered CC partner polyQ increased beta-aggregates polyQ alpha-helix non-CC partner

increased beta-aggregates Normal polyQ protein Toxic polyQ protein CC polyQ polyQ disordered beta-aggregates polyQ increased beta-aggregates CC partner polyQ alpha-helix non-CC partner

increased beta-aggregates Normal polyQ protein Toxic polyQ protein CC polyQ polyQ disordered beta-aggregates polyQ increased beta-aggregates CC partner polyQ alpha-helix non-CC partner

increased beta-aggregates Normal polyQ protein Toxic polyQ protein CC polyQ polyQ disordered beta-aggregates polyQ increased beta-aggregates CC partner polyQ alpha-helix non-CC partner

Caudate nucleus network 66 proteins 84 interactions HTT network 509 proteins 1319 interactions Erich Wanker Brain network 88 proteins 113 interactions Caudate nucleus network 66 proteins 84 interactions Matthias Futschik HD dysreg 14 proteins 13 interactions David Fournier Stroedicke et al. (2015) Genome Research

CRMP1 MED15 CRMP is the drosophila protein

Exercise 1/2. Search for a polyQ insertion in the MR family Open in jalview the alignment of the mineralocorticoid receptor: MR1_fasta.txt Find a polyQ insertion. Do you see any other biased region nearby?

Clustering proteins FastaHerder2 Pablo Mier Mier and Andrade-Navarro (2016) J. Comp. Biol.

Clustering proteins polyQ Escherichia Pablo Mier Mier and Andrade-Navarro (2016) J. Comp. Biol.

Context of polyX Pablo Mier Mier et al. In revision

polyQ followed by polyP dependency Context of polyX Pablo Mier polyQ followed by polyP dependency Mier et al. In revision

Species specific differences Context of polyX Pablo Mier Species specific differences Mier et al. In revision

Composition of linkers Daniel Brüne Pablo Mier Brüne et al. In preparation

Composition of linkers Daniel Brüne Pablo Mier 38 species: 9 Bacteria 12 Archaea 17 Eukaryota Brüne et al. In preparation

Composition of linkers Daniel Brüne Pablo Mier DNA binding domain Brüne et al. In preparation

3D context of polyQ 3D 3D Franziska Totzeck Pablo Mier polyQ Totzek et al. Submitted

3D context of polyQ Franziska Totzeck Pablo Mier Totzek et al. Submitted

3D context of polyQ Franziska Totzeck Pablo Mier Totzek et al. Submitted

3D context of polyQ Franziska Totzeck Pablo Mier Totzek et al. Submitted

3D context of polyQ Franziska Totzeck Pablo Mier Totzek et al. Submitted

polyQ variability Felix Korda (Joachim Burger)

Exercise 2/2. Find a 3D of a polyQ ortholog Go to FASTAHERDER2: http://cbdm-01.zdv.uni-mainz.de/~munoz/fh2/ Find a cluster containing polyQ and a PDB using mode 4 Find the structure surrounding the place of polyQ insertion Any problems?

Exercise 2/2. Find a 3D of a polyQ ortholog Go to FASTAHERDER2: http://cbdm-01.zdv.uni-mainz.de/~munoz/fh2/ Find a cluster containing polyQ and a PDB using mode 4 Find the structure surrounding the place of polyQ insertion Any problems? If yes, then use this example: Species: “Escherichia coli”, PDB “yes”, and polyQ “yes” Get the E. coli sequence and the one with polyQ and align them. Can you see the polyQ insertions? Compare to PDB:4JNF (from DNAK_ECOLI P0A6Y8)

Exercise 2/2. Find a 3D of a polyQ ortholog FH2 mode 3 with C4YKT4 / Candida albicans 288 aa with two polyQ Align and compare to P01123 S. cerevisiae 206 alpha-helix and polyQ inserts after. See PDB 2BCG chain Y = YPT1