Using Exons to Define Isoforms in PRO Timothy Danford Novartis Institutes for Biomedical Research PRO / AlzForum Kickoff Meeting Oct. 4, 2011.

Slides:



Advertisements
Similar presentations
Molecular Genetics DNA RNA Protein Phenotype Genome Gene
Advertisements

RNA and Protein Synthesis
© 2006 W.W. Norton & Company, Inc. DISCOVER BIOLOGY 3/e
FINAL EXAM: TAKE-HOME Assessment of Significance in Cancer Gene SNPs.
Characterizing Alternative Splicing With Respect To Protein Domains BME 220 Project Charlie Vaske.
Sequence Analysis. Today How to retrieve a DNA sequence? How to search for other related DNA sequences? How to search for its protein sequence? How to.
Transcription and Translation
2.7 DNA Replication, transcription and translation
Transcription Transcription is the synthesis of mRNA from a section of DNA. Transcription of a gene starts from a region of DNA known as the promoter.
Integration of PRO and UniProtKB Amherst, NY May 16, 2013 Cathy H. Wu, Ph.D. PRO-PO-GO Meeting.
Bikash Shakya Emma Lang Jorge Diaz.  BLASTx entire sequence against 9 plant genomes. RepeatMasker  55.47% repetitive sequences  82.5% retroelements.
발표자 석사 2 년 김태형 Vol. 11, Issue 3, , March 2001 Comparative DNA Sequence Analysis of Mouse and Human Protocadherin Gene Clusters 인간과 마우스의 PCDH 유전자.
RNA and Protein Synthesis
1 The Interrupted Gene. Ex Biochem c3-interrupted gene Introduction Figure 3.1.
COURSE OF BIOINFORMATICS Exam_31/01/2014 A.
1 Genes and How They Work Chapter Outline Cells Use RNA to Make Protein Gene Expression Genetic Code Transcription Translation Spliced Genes – Introns.
Molecular Biology in a Nutshell (via UCSC Genome Browser) Personalized Medicine: Understanding Your Own Genome Fall 2014.
Protein Synthesis 6C transcription & translation.
DNA TO RNA Transcription is the process of creating a molecule that can carry the genetic blueprint for a particular protein coding gene from the DNA.
Fea- ture Num- ber Feature NameFeature description 1 Average number of exons Average number of exons in the transcripts of a gene where indel is located.
Review of Protein Synthesis. Fig TRANSCRIPTION TRANSLATION DNA mRNA Ribosome Polypeptide (a) Bacterial cell Nuclear envelope TRANSCRIPTION RNA PROCESSING.
AP Biology Discussion Notes Friday 02/06/2015. Goals for Today Be able to describe RNA processing and why it is EVOLUTIONARILY important. In a more specific.
Cell Protein Production. Transcription : process of mRNA formation. 1. Triggered by chem. messengers from cytoplasm which bind to DNA 2. This causes release.
Complexities of Gene Expression Cells have regulated, complex systems –Not all genes are expressed in every cell –Many genes are not expressed all of.
A brief comment on nomenclature. A croe (coding region of an exon) is not exactly an exon Steve Mount ISMB SIG meeting on alternative splicing June 23,
Chapter 2 From Genes to Genomes. 2.1 Introduction We can think about mapping genes and genomes at several levels of resolution: A genetic (or linkage)
While replication, one strand will form a continuous copy while the other form a series of short “Okazaki” fragments Genetic traits can be transferred.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
Chapter 3 The Interrupted Gene.
The Genetic Code. The DNA that makes up the human genome can be subdivided into information bytes called genes. Each gene encodes a unique protein that.
Mutations Learning Goal: Identify mutations in DNA (point mutation and frameshift mutation caused by insertion or deletion) and explain how they can affect.
Lesson Four Structure of a Gene. Gene Structure What is a gene? Gene: a unit of DNA on a chromosome that codes for a protein(s) –Exons –Introns –Promoter.
Transcription and Translation. Central Dogma of Molecular Biology  The flow of information in the cell starts at DNA, which replicates to form more DNA.
CFE Higher Biology DNA and the Genome Transcription.
DAY 2. Warm Up What type of RNA copies DNA? – mRNA What is this process called? – Transcription.
The Central Dogma of Molecular Biology DNA  RNA  Protein  Trait.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
COURSE OF BIOINFORMATICS Exam_30/01/2014 A.
Alternative Splicing. mRNA Splicing During RNA processing internal segments are removed from the transcript and the remaining segments spliced together.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
Chapter – 10 Part II Molecular Biology of the Gene - Genetic Transcription and Translation.
SC.912.L.16.3 DNA Replication. – During DNA replication, a double-stranded DNA molecule divides into two single strands. New nucleotides bond to each.
HOW PROTEINS ARE MADE BY THE CELL
Lesson Four Structure of a Gene.
Lesson Four Structure of a Gene.
Sequence based searches:
Relationship between Genotype and Phenotype
Transcription and Translation
BTY100-Lec#4.2 DNA to Protein (Central Dogma).
Relationship between Genotype and Phenotype
Protein Synthesis Genetics.
Transcription and Translation
Protein Synthesis in Detail
Transcription and Translation
Transcription and Translation
Ensembl Genome Repository.
Chapter 4 The Interrupted Gene.
Week 6 Vocab Definitions
Transcription and Translation
HOPX: The Unusual Homeodomain-Containing Protein
Relationship between Genotype and Phenotype
Essential Question: How cells make proteins
Transcription and Translation
Alternative Splicing May Not Be the Key to Proteome Complexity
Transcription and Translation
Higher Biology Unit 1: 1.3 Transcription.
From gene to protein.
Unit 4 - The Natural Environment and Species Survival
Basic Local Alignment Search Tool
Sequences of two isoforms of Arhgap39/Porf-2 protein in mouse.
Presentation transcript:

Using Exons to Define Isoforms in PRO Timothy Danford Novartis Institutes for Biomedical Research PRO / AlzForum Kickoff Meeting Oct. 4, 2011

Genes vs. Proteins Gene Transcript Exon Locus Allele Variant SNP Indel Rearrangement Motif Protein Isoform Variant Domain Site Complex Motif Fragment Can we join the worlds of PRO and of Genes, at a finer-grained level than that of “full sequence?”

Isoforms in PRO Today PRO v23 (10/2/2011) [Term] id: PR: name: microtubule-associated protein tau def: "A protein that is a translation product of the MAPT gene or a 1:1 ortholog thereof." [PRO:DNx] comment: Category=gene. Flag=automatic. synonym: "MAPT" EXACT PRO-short-label [] synonym: "neurofibrillary tangle protein" EXACT [] synonym: "paired helical filament-tau" EXACT [] synonym: "PHF-tau" EXACT [] synonym: "MAPTL" RELATED [] synonym: "Mtapt" RELATED [] synonym: "MTBT1" RELATED [] synonym: "TAU" RELATED [] is_a: PR: ! protein

Isoforms in PRO Today PRO v23 (10/2/2011) [Term] id: PR: name: microtubule-associated protein tau isoform Fetal-tau def: "A microtubule-associated protein tau that is a translation product of some mRNA giving rise to a protein with the amino acid sequence represented by UniProtKB:P or a 1:1 ortholog thereof." [PRO:DAN] comment: Category=sequence. synonym: "Fetal-tau" EXACT [] is_a: PR: ! microtubule-associated protein tau

Isoforms in PRO Today PRO v23 (10/2/2011)

Isoforms in PRO Today PRO v23 (10/2/2011)

Digression: Visual Notation

Isoforms in PRO Today PRO v23 (10/2/2011)

Isoforms in PRO Today PRO v23 (10/2/2011)

Isoforms in PRO Today PRO v23 (10/2/2011)

Tau Isoforms Share Functionally-relevant Exons Fetal Tau Adult Tau Slide: Gwen Wong (AlzForum), Image: “Conserved Protein Domains in Tau Suggest Functional Differences between Protein Isoforms”

What Questions Could We Ask of PRO + Genomic Data? Which isoform corresponds to which transcript(s)? Which isoforms share a common feature? – common exons? – common domains? (pfam, interpro, etc.) Which “normal” protein isoforms overlap with SNPs or other genetic variants? – How do protein sites line up to sites on the gene? How do mouse and human proteins correspond?

“Which isoform corresponds with which transcript(s)?” Transcript Variant: This variant (4) lacks six internal coding exons, as compared to variant 6. The reading frame is not affected, and the resulting isoform (4) has identical N- and C-termini but lacks five segments, as compared to isoform 6.

Define Exons as Parts-of-Proteins

Defined class of Isoforms based on has_part and lacks_part to particular exons

Integrate Existing Isoforms

How is “MAPT Exon 2” defined? Take the “exon” definition from SO: – “A region of the transcript sequence within a gene which is not removed from the primary RNA transcript by RNA splicing.” Exon number defined relative to the full-length or “canonical” transcript – “An exon that corresponds (aligns) to the second of 13 exons in the full-length MAPT transcript...” Define the part of the protein derived from this portion of the transcript…

What Questions Could We Ask of PRO + Genomic Data? Which isoform corresponds to which transcript(s)? Which isoforms share a common feature? – common exons? – common domains? (pfam, interpro, etc.) Which “normal” protein isoforms overlap with SNPs or other genetic variants? – How do protein sites line up to sites on the gene? How do mouse and human proteins correspond?

What Questions Could We Ask of PRO + Genomic Data? Which isoform corresponds to which transcript(s)? Which isoforms share a common feature? – common exons? – common domains? (pfam, interpro, etc.) Which “normal” protein isoforms overlap with SNPs or other genetic variants? – How do protein sites line up to sites on the gene? How do mouse and human proteins correspond?

What Questions Could We Ask of PRO + Genomic Data? Which isoform corresponds to which transcript(s)? Which isoforms share a common feature? – common exons? – common domains? (pfam, interpro, etc.) Which “normal” protein isoforms overlap with SNPs or other genetic variants? – How do protein sites line up to sites on the gene? How do mouse and human proteins correspond?