Recerca de selenoproteïnes en el genoma d’organimes eucariotes Bioinformàtica, UPF.

Slides:



Advertisements
Similar presentations
Www. GeneOntology.org Gene Ontology Collaboration.
Advertisements

Finding regulatory modules from local alignment - Department of Computer Science & Helsinki Institute of Information Technology HIIT University of Helsinki.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
August 19, 2002Slide 1 Bioinformatics at Virginia Tech David Bevan (BCHM) Lenwood S. Heath (CS) Ruth Grene (PPWS) Layne Watson (CS) Chris North (CS) Naren.
[Bejerano Aut08/09] 1 MW 11:00-12:15 in Beckman B302 Profs: Serafim Batzoglou, Gill Bejerano TA: Cory McLean.
BIOE 109 Summer 2009 Lecture 4- Part I Mutation and genetic variation.
Alternative splicing and evolution Daniel Jeffares.
Comparative Genome Analysis. Comparative yeast genomics Kellis et al (2003) Nature 423,
Genome-Wide RNAi Analysis of Growth and Viability in Drosophila Cells Boutros et al.
Lecture 12 Splicing and gene prediction in eukaryotes
Selenoproteins and Selenium-dependent Redox Signaling Alter Diabetes Risk Dmitri Fomenko Department of Biochemistry and Redox Biology Center University.
Comparative Genomics of the Eukaryotes
Genome projects and model organisms Level 3 Molecular Evolution and Bioinformatics Jim Provan.
Genome of Drosophila species Olga Dolgova UAB Barcelona, 2008.
Chapter 2: From genes to Genomes. 2.1 Introduction.
Nutrient-dependent / Pheromone-controlled Adaptive Evolution J.V. Kohl Independent Researcher, Pheromones.com, Epworth, GA Contact:
1 Orthology and paralogy A practical approach Searching the primaries Searching the secondaries Significance of database matches DB Web addresses Software.
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
Molecular Biology Primer. Starting 19 th century… Cellular biology: Cell as a fundamental building block 1850s+: ``DNA’’ was discovered by Friedrich Miescher.
Anatomy of a Genome Project A.Sequencing 1. De novo vs. ‘resequencing’ 2.Sanger WGS versus ‘next generation’ sequencing 3.High versus low sequence coverage.
1 Genome Evolution Chapter Introduction Genomes contain the raw material for evolution; Comparing whole genomes enhances – Our ability to understand.
1 Transcript modeling Brent lab. 2 Overview Of Entertainment  Gene prediction Jeltje van Baren  Improving gene prediction with tiling arrays Aaron Tenney.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Aging and Reactive oxygen Species. Aging: What is it?  Aging, has been termed generally as a progressive decline in the ability of a physiological process.
Finding genes by comparing genomes roderic guigó i serra imim/upf/crg, barcelona.
Gene Prediction: Similarity-Based Methods (Lecture for CS498-CXZ Algorithms in Bioinformatics) Sept. 15, 2005 ChengXiang Zhai Department of Computer Science.
Comparative genomics Haixu Tang School of Informatics.
Using blast to study gene evolution – an example.
MCB 7200: Molecular Biology Biotechnology terminology Common hosts and experimental organisms Transcription and translation Prokaryotic gene organization.
A complex molecular machinery to specifically incorporate selenocysteine into proteins important for health and disease Laurence Wurth Supervisor: Christine.
David Sadava H. Craig Heller Gordon H. Orians William K. Purves David M. Hillis Biologia.blu B – Le basi molecolari della vita e dell’evoluzione The Eukaryotic.
Flexibility in energy metabolism supports hypoxia tolerance in Drosophila flight muscle: metabolomic and computational systems analysis Jacob Feala 1,2.
Chapter 21 Genomes and Their Evolution. Genomics ______________ is a new approach to biology concerned with the study of the ___________ set of __________.
Biol729 – The kinomes of model organisms. Phylogenetic comparison of the human kinome with those of yeast ( S. cerevisiae), worm (C. elegans) and fly.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Chapter 3 The Interrupted Gene.
What is genomics? Genes, promoters, regulatory elements, alignments, trees, …
How many genes are there?
Primary goal: Provide students with a way to understand the use/value of model organisms for gene discovery. Students will use web-based Bioinformatics.
Homework #2 is due 10/17 Bonus #1 is due 10/24 Exam key is online Office hours: M 10/ :30am 2-5pm in Bio 6.
Finding genes in the genome
1 Mona Singh What is computational biology?. 2 Mona Singh Genome The entire hereditary information content of an organism.
Ribonucleotide reductases (RNRs) catalyse the reduction of ribonucleotides to their corresponding 2`-deoxyribonucleotides and therefore play an essential.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
bacteria and eukaryotes
The Transcriptional Landscape of the Mammalian Genome
Genetics and Evolutionary Biology
Protein Families, Motifs & Domains.
Genes and Body plans
Eukaryotic selenoprotein genes identified so far.
Eukaryotic Gene Finding
more regulating gene expression
Genomes and Their Evolution
Prediction of selenoprotein genes in eukaryotic genomes roderic guigó i serra, bioinformatica, UPF curs 2005/ /29/2018 Bioinformatica UPF març.
Genome organization and Bioinformatics
Recerca de selenoproteïnes en el genoma d’organimes eucariotes
Homework #2 is due 10/17 Bonus #1 is due 10/24 FrakenFlowers.
Recerca de selenoproteïnes en el genoma d’organimes eucariotes
Evolution of eukaryote genomes
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Chapter 4 The Interrupted Gene.
Volume 11, Issue 3, Pages (March 2003)
Reprogrammed Genetic Decoding in Cellular Gene Expression
Identification of common factors regulating suspected gene battery
closing in on the set of human genes. The ENCODE project.
Evolutionary genetics
Size Control in Animal Development
Homework #2 is due 10/18 Bonus #1 is due 10/25 Exam key is online.
Comparison of the sequences of Fzo/Mfn and structure of mammalian Mfn2
Presentation transcript:

Recerca de selenoproteïnes en el genoma d’organimes eucariotes Bioinformàtica, UPF.

what are selenoproteins? Selenoproteins are proteins that incorporate selenocysteine, the 21st aminoacid

selenocysteine

Role of selenium Selenium is an essential nutrient for animals, microorganisms and some other eukaryotes. Selenium deficiency may lead to disease –Keshan disease (the primary symptom is myocardial necrosis, which leads to weakening of the heart) Named after a province in Keshan in China with low levels of Selenium. Studies in the Jiangsu province have indicated a reduction in the prevalence of this disease by taking selenium supplements. Excess selenium can be toxic –General Custer and the “Little Big Horn” massacre

Selenium is found in cells mostly in selenoproteins Mostly redox enzimes –Possible antioxidant protection capability Distributed in the three domains of life About 25 known selenoproteins in mammals, but the number varies for different taxa –3 selenoproteins in Drosophila melanogaster –1 selenoprotein in Caenorhabditis elegans Selenoproteins though to be essential for Metazoan life Sometimes the orthologue of a selenoprotein has Cys instead of Sec.

SelU is a selenoprotein in fishes, but it is not in humans

the selenocysteine codon?

the selenocysteine codon:UGA

recoding of UGA

Selenoprotein Biosynthesis

Selenoprotein biosynthesis Synthesis of Selenocysteine (Sec) –SPS1 –SPS2 –SLA/LP –Sec43p Incorporation of Sec into selenoproteins –SBP2 –Efsec –tRNAsec SECIS

Aren’t selenoproteins annotated? selenoproteins are usually incorrectly annotated SelenocysteineGlycineReal STOPNO SECIS!SelenocysteineGlycineReal STOPNO SECIS!

Selenoprotein identification

selenoprotein search: SECIS search SECIS came in a variety of sequences

SECIS search: PatScan

SECIS search in the Drosophila genome 35,876 potential SECIS elements 1,220 termodynamically stable

selenoprotein search: codon bias across TGA

selenoprotein search: SECIS + exon prediction 1.Predict SECIS with PatScan 2.Gene prediction with geneid (allowing TGA- interrupted exons) Geneid uses dynamic programming to chain input exons into gene structures maximizing a log-likelihood function. SECIS predictions and TGA-interrupted exons are now among the input exons. Chaining rules state that SECIS elements can only be chained if they terminate genes containing TGA exons, and that genes containing TGA exon can only be terminated by SECIS predictions.

5’3’ selenoprotein search:

5’3’ selenoprotein search:

5’3’ selenoprotein search:

Independent but coordinated TGA in-frame gene and SECIS prediction Putative selenoprotein 5’3’ selenoprotein search:

selenoprotein search in Drosophila (Castellano et al. EMBO Reports 2: , 2001) SECIS predicted35876 SECIS thermo assessment 1220 Genes predicted12194 Predicted Selenoproteins (4) Real Selenoproteins 3

dSelK

dSelH

dSelK and dSelH are ubiquitous selenoproteins

selenoprotein search in mammalian genomes Larger genome. Much more room for false positive SECIS predictions Poorer gene predicitons.

conserved SECIS between human and mouse

characterization of mammalian selenoproteins (Kryukov et al., Science 300: , 2003 )

selenoprotein search in other vertebrate genomes. SelH alignment

human vs. fugu

SelU

SelU: a novel selenoprotein family

SelU: scattered phylogenetic distribution

Copyright ©2005 by the National Academy of Sciences

Fig. 3. Subcellular localization of SelJ Fig Se labeling

SelJ and crystallins

Copyright ©2005 by the National Academy of Sciences Castellano, Sergi et al. (2005) Proc. Natl. Acad. Sci. USA 102, Fig. 4. Expression pattern of the SelJ gene during development in zebrafish embryos

the eukaryotic selenoproteome

Sergi Castellanos - Doctorat l’any PostDoc - Marla Berry, Universitat de Hawaii - Andrew G. Clark, Cornell University - Sean Eddy, Janelia Farm -Group Leader Max Plank Institute for Evolutioary Antropology, Leipzig

SelH alignment across fly genomes

SelH:alignment at the DNA level

SelH:alignment at the DNA level 3-period conservation pattern

increasing the sequences, increases the signal actgtgacattgactcccatgcaggacttgaca accgtgacattcactcccatgcaggacttgaca ** ******** *********************

increasing the sequences, increases the signal actgtgacattgactcccatgcaggacttgaca accgtgacattcactcccatgcaggacttgaca acagtgacattgacacccatgcaggacttgaca actgtgacatttactccaatacaggacttcaca ** ******** ** ***** ******** ***

increasing the sequences, increases the signal actgtgacattgactcccatgcaggacttgaca accgtgacattcactcccatgcaggacttgaca acagtgacattgacacccatgcaggacttgaca actgtgacatttactccaatacaggacttcacaactgtaacat tgactcccatgcacgacttgaca ** ** ***** ** ***** ** ***** ***

increasing the sequences, increases the signal actgtgacattgactcccatgcaggacttgaca accgtgacattcactcccatgcaggacttgaca acagtgacattgacacccatgcaggacttgaca actgtgacatttactccaatacaggacttcacaactgtaacat tgactcccatgcacgacttgacaactgtgacattgactcccat gcacgacttgact ** ** ***** ** ***** ** ***** **

increasing the sequences, increases the signal actgtgacattgactcccatgcaggacttgaca accgtgacattcactcccatgcaggacttgaca acagtgacattgacacccatgcaggacttgaca actgtgacatttactccaatacaggacttcacaactgtaacat tgactcccatgcacgacttgacaactgtgacattgactcccat gcacgacttgact actgtaactttgactcccatacaggacttgaca ** ** ** ** ** ***** ** ***** **

increasing the sequences, increases the signal actgtgacattgactcccatgcaggacttgaca accgtgacattcactcccatgcaggacttgaca acagtgacattgacacccatgcaggacttgaca actgtgacatttactccaatacaggacttcacaactgtaacat tgactcccatgcacgacttgacaactgtgacattgactcccat gcacgacttgact actgtaactttgactcccatacaggacttgaca accgtgacattgactcccatccaggacttgact ** ** ** ** ** ***** ** ***** **

increasing the sequences, increases the signal actgtgacattgactcccatgcaggacttgaca accgtgacattcactcccatgcaggacttgaca acagtgacattgacacccatgcaggacttgaca actgtgacatttactccaatacaggacttcacaactgtaacat tgactcccatgcacgacttgacaactgtgacattgactcccat gcacgacttgact actgtaactttgactcccatacaggacttgaca accgtgacattgactcccatccaggacttgactactgtgacgt tgactccgatgcaggacttgaca ** ** ** ** ** ** ** ** ***** **

scanning the fly mutiple-alingments for the 3-period conservation pattern across conserved TGA codons

glucose dehydrogenase (gld) proposed as a selenoprotein by Perlaky, S., Merritt, K., and Cavener (1998)

SelH alignment across fly genomes

willistoni lacks all known fly selenoproteins SPS2 does not exist SelK is a Cys homologue

willistoni lacks some of the genes involved in selenoprotein metabolism in addition to SPS2, tRNA-Sec EF-Sec SLA/LP The first animal known to lack selenoproteins

NHGRI, Press Release “Scientists Compare Twelve Fruit Fly Genomes” “ In a surprising finding, researchers found that the genes that produce selenoproteins appear to be absent in the D. willistoni genome. Selenoproteins are responsible for reducing excess amounts of the mineral selenium, an antioxidant found in a variety of food sources. Selenoproteins are present in all animals, including humans. D. willistoni appears to be the first animal known to lack these proteins. However, researchers suggest that D. willistoni may possibly encode selenoproteins in a different way, opening a new avenue for further research.”

Surprisingly willistoni maintains some others Secp43 is highly conserved SPS1 is highly conserved Which implies that these proteins are involved in functions other than selenoprotein biosynthesis

Alignment of SPS1 across Drosophilas (including willistoni)

Questions What are the functional consequences of the loss of selenoproteins (if any)? What is the evolutionary path leading to selenoprotein extinction? –Loss of selenoproteins  Loss of selenoprotein factors –Loss of selenoprotein factors  Loss of selenoproteins

Survival under oxidative stress C. Pallares M.Corominas, F.Serras Genetics, U. Barcelona

Survival under selenium toxicity, A. Punset, M. Corominas, F. Serras, Genetics, UB

Evolutionary path leading to selenoprotein extinction

Search for selenoproteins in other arthropoda

Evolutionary path leading to selenoprotein extinction It can not be attributed to a single evolutionary event A consequence of the relaxation of the selective constraints acting to maintain selenoproteins in other metazoans Dispensability of selenoproteins in insects, maybe related to the fundamental differences in the antioxidant defense systems between these animals and other metazoan.

Charles Chapple -Doctorat l’any PostDoc -Christine Brune INSERM, Marseille

UPF Biologia. Curs selenoproteins in protists

CRG/IMIM/UPF, Barcelona Charles Chapple Tyler Alioto Enrique Blanco Marco Mariotti Universitat de Barcelona Marta Morey Montserrat Corominas Florenci Serras Harvard Unversity, Boston University of Hawaii Nadia Morozova Marla J. Berry University of Nebraska Gregory V. Kryukov Sergey V. Novoselov Vadim N. Gladyshev IBMC, Strasbourg Alain Lescure Alain Krol MPI for Informatics, Saarbrucken Mario Albrecht Thomas Lengauer Barcelona/Hawaii/Cornell/Janelia, Sergi Castellano ACKNOWLEDGEMENTS