Bioinformatics Alternative splicing Multiple isoforms Exonic Splicing Enhancers (ESE) and Silencers (ESS) SpliceNest Lecture 13.

Slides:



Advertisements
Similar presentations
EAnnot: A genome annotation tool using experimental evidence Aniko Sabo & Li Ding Genome Sequencing Center Washington University, St. Louis.
Advertisements

Control of Gene Expression
Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
Lecture 4: DNA transcription
Two short pieces MicroRNA Alternative splicing.
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
15-20 september WABI031 A Method to Detect Gene Structure and Alternative Splice Sites by Agreeing ESTs to a Genomic Sequence Paola Bonizzoni Graziano.
1 Computational Molecular Biology MPI for Molecular Genetics DNA sequence analysis Gene prediction Gene prediction methods Gene indices Mapping cDNA on.
Introduction to Bioinformatics Lecturer: Dr. Yael Mandel-Gutfreund Teaching Assistant: Shula Shazman Sivan Bercovici Course web site :
Alignment of mRNAs to genomic DNA Sequence Martin Berglund Khanh Huy Bui Md. Asaduzzaman Jean-Luc Leblond.
1 Alternative Splicing. 2 Eukaryotic genes Splicing Mature mRNA.
1 Gene Finding Charles Yan. 2 Gene Finding Genomes of many organisms have been sequenced. We need to translate the raw sequences into knowledge. Where.
CSE182-L12 Gene Finding.
Alternative splicing and evolution Daniel Jeffares.
16 and 20 February, 2004 Chapter 9 Genomics Mapping and characterizing whole genomes.
The Influence of Alternative Splicing in Protein Structure The fact that gene number is not significantly different between mammals and some invertebrates.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Lecture 12 Splicing and gene prediction in eukaryotes
RNA processing. RNA species in cells RNA processing.
Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Genome Databases Computational Molecular Biology Biochem 218 – BioMedical Informatics.
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
Alternative Splicing. mRNA Splicing During RNA processing internal segments are removed from the transcript and the remaining segments spliced together.
Genome Informatics 2005 ~ 220 participants 1 keynote speaker: David Haussler 47 talks 121 posters.
LECTURE 2 Splicing graphs / Annoteted transcript expression estimation.
Arabidopsis Genome Annotation TAIR7 Release. Arabidopsis Genome Annotation  Overview of releases  Current release (TAIR7)  Where to find TAIR7 release.
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
Ch. 21 Genomes and their Evolution. New approaches have accelerated the pace of genome sequencing The human genome project began in 1990, using a three-stage.
MPL Identification of alternative spliced mRNA variants related to cancers by genome-wide ESTs alignment KIM DAE SOO Oncogene Apr.
Chapter 21 Eukaryotic Genome Sequences
1 Transcript modeling Brent lab. 2 Overview Of Entertainment  Gene prediction Jeltje van Baren  Improving gene prediction with tiling arrays Aaron Tenney.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Sackler Medical School
Gene Prediction: Similarity-Based Methods (Lecture for CS498-CXZ Algorithms in Bioinformatics) Sept. 15, 2005 ChengXiang Zhai Department of Computer Science.
Central dogma: the story of life RNA DNA Protein.
Complexities of Gene Expression Cells have regulated, complex systems –Not all genes are expressed in every cell –Many genes are not expressed all of.
Bioinformatics and Computational Biology
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
Today Elements of complex genomes Protein domains and exon shuffling
David Sadava H. Craig Heller Gordon H. Orians William K. Purves David M. Hillis Biologia.blu B – Le basi molecolari della vita e dell’evoluzione The Eukaryotic.
Genes and Genomes. Genome On Line Database (GOLD) 243 Published complete genomes 536 Prokaryotic ongoing genomes 434 Eukaryotic ongoing genomes December.
Introduction to Bioinformatics II Lecture 5 By Ms. Shumaila Azam.
Pre-mRNA secondary structures influence exon recognition Michael Hiller Bioinformatics Group University of Freiburg, Germany.
Comparative Genomics Methods for Alternative Splicing of Eukaryotic Genes Liliana Florea Department of Computer Science Department of Biochemistry GWU.
Exploring and Exploiting the Biological Maze Zoé Lacroix Arizona State University.
Research about Alternative Splicing recently 楊佳熒.
While replication, one strand will form a continuous copy while the other form a series of short “Okazaki” fragments Genetic traits can be transferred.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Motif Search and RNA Structure Prediction Lesson 9.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Intro to Probabilistic Models PSSMs Computational Genomics, Lecture 6b Partially based on slides by Metsada Pasmanik-Chor.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Finding genes in the genome
Ligate tags SAGE: Procedure Digest with “Tagging enzyme” BsmFI tm Isolate mRNA, RT to cDNA Digest with “Anchoring.
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Chapter 14 Opener RNA Splicing (In Eukaryotes)
Looking Within Human Genome King abdulaziz university Dr. Nisreen R Tashkandy GENOMICS ; THE PIG PICTURE.
The Transcriptional Landscape of the Mammalian Genome
Today… Review a few items from last class
Genome organization and Bioinformatics
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Introduction to Bioinformatics II
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Ensembl Genome Repository.
Chapter 4 The Interrupted Gene.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Alternative RNA Splicing
Alternative Splicing: New Insights from Global Analyses
Introduction to Alternative Splicing and my research report
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Bioinformatics Alternative splicing Multiple isoforms Exonic Splicing Enhancers (ESE) and Silencers (ESS) SpliceNest Lecture 13

ALTERNATIVE SPLICING Two or more mRNA molecules can be produced from the same gene Number of mRNAs produced by Dscam gene in Drosophila melanogaster exceeds 38, 016 different mature transcripts! The entire Drosophila genome consists of only ~14,000 gene. Gene mRNA 1 mRNA 2

One Gene Many Proteins The classical vision ONE GENE ONE PROTEIN is not correct for at least 40-60% of studied mammalian genes Data show that many variants of mRNA and proteins can be produced from the same gene mRNA 1 Protein 1 Gene mRNA 2 Protein 2 mRNA 3 Protein 3

While gene prediction can be done relatively precisely, this may not be sufficient to predict structure of the mature mRNA Different alternative mRNA isoforms can be produced from the same gene in different tissues and in different time It means that numerous factors can enhance of silence certain splicing points Identification of these factors is essential for improving the predictive power of computer programs It is particularly important to combine experimental and computational studies in order to get progress in this field Gene prediction/identification and alternative splicing

Five common models of mRNA alternative splicing Constitutive exonAlternatively spliced exon Exon skipping/inclusion Alternative 3’ splice sites Alternative 5’ splice sites Mutually exclusive exons Intron retention

Alternative splicing of the  -tropomyosin gene mRNA

Models of serine/arginine reach protein action in Exonic Splicing Enhancer (ESE) dependent splicing U2 snRNP – small nuclear ribonucleoprotein; RRM- RNA recognition motif; RS – Arg/Ser enriched domain ESS – Exonic Splicing Silencer; THE MODELS ARE NOT MUTUALY EXCLUISIVE AND MAY HAVE NUMEROUS VARIATIONS

ESEs play important roles in constitutive and alternative splicing. A computational method, RESCUE-ESE, was developed that predicts which sequences have ESE activity by statistical analysis of exon-intron and splice site composition. When large data sets of human gene sequences were used, this method identified 10 predicted ESE motifs. Representatives of all 10 motifs were found to display enhancer activity in vivo, whereas point mutants of these sequences exhibited sharply reduced activity. The motifs identified enable prediction of the splicing phenotypes of exonic mutations in human genes Predictive identification of exonic splicing enhancers (ESE) in human genes

Consensus RNA motifs for the sites attracting four serine/arginine reach proteins acting as exonic splicing enhancers (ESE)

An expressed sequence tag (EST) is a small part of the active part of a gene, made from cDNA, which can be used to fish the rest of the gene out of the chromosome, by matching base pairs with part of the gene. ESTs and particularly consensus of sequences of clustered ESTs provide useful information about splice variants of genes. Predicted human mRNA sequences were mapped onto human genomic DNA to compute gene structure and splice variants. The results have been collected in a public database, SpliceNest, with a web based interactive graphical user interface. Similar computations can be done for several other species. Expressed Sequence Tags and splice sites

htpp://splicenest.molgen.mpg.de/

SpliceNest is a tool to explore gene structure, including alternative splicing, based on a mapping on the EST consensus sequences (contigs) from GeneNest to the complete human genome. SpliceNest is integrated with GeneNest and the SYSTERS protein sequence cluster set in one framework, permitting an overall exploration of the whole sequence space covering protein, mRNA and EST sequences, as well as genomic DNA. SpliceNest: visualizing gene structure and alternative splicing based on EST clusters

Cluster: A group of ESTs and/or mRNAs that are sufficiently similar to assume that they constitute transcripts from the same gene. Contig: A representation of a (partial) transcript summarized by a consensus sequence, created by multiple alignment of overlapping sequences.

Alternative splice candidates