Codon models R CGT CGC R D GAC GCC A Synonymous substitution Nonsynonymous substitution.

Slides:



Advertisements
Similar presentations
Uses of Cloned Genes sequencing reagents (eg, probes) protein production insufficient natural quantities modify/mutagenesis library screening Expression.
Advertisements

The genetic code.
Towards realistic codon models: among site variability and dependency of synonymous and nonsynonymous rates Itay Mayrose Adi Doron-Faigenboim Eran Bacharach.
Protein Synthesis (making proteins)
ATG GAG GAA GAA GAT GAA GAG ATC TTA TCG TCT TCC GAT TGC GAC GAT TCC AGC GAT AGT TAC AAG GAT GAT TCT CAA GAT TCT GAA GGA GAA AAC GAT AAC CCT GAG TGC GAA.
Supplementary Fig.1: oligonucleotide primer sequences.
Gene Mutations Worksheet
Transcription and Translation
Transcription & Translation Worksheet
Transcription and Translation
 Genetic information, stored in the chromosomes and transmitted to the daughter cells through DNA replication is expressed through transcription to RNA.
Today… Genome 351, 8 April 2013, Lecture 3 The information in DNA is converted to protein through an RNA intermediate (transcription) The information in.
Figure S1. Sequence alignment of yeast and horse cyt-c (Identity~60%), green highly conserved residues. There are 40 amino acid differences in the primary.
GENE MUTATIONS aka point mutations. DNA sequence ↓ mRNA sequence ↓ Polypeptide Gene mutations which affect only one gene Transcription Translation © 2010.
Transcription and Translation
IGEM Arsenic Bioremediation Possibly finished biobrick for ArsR by adding a RBS and terminator. Will send for sequencing today or Monday.
Nature and Action of the Gene
Biological Dynamics Group Central Dogma: DNA->RNA->Protein.
DNA.
More on translation. How DNA codes proteins The primary structure of each protein (the sequence of amino acids in the polypeptide chains that make up.
Undifferentiated Differentiated (4 d) Supplemental Figure S1.
Supplemental Table S1 For Site Directed Mutagenesis and cloning of constructs P9GF:5’ GAC GCT ACT TCA CTA TAG ATA GGA AGT TCA TTT C 3’ P9GR:5’ GAA ATG.
Lecture 10, CS5671 Neural Network Applications Problems Input transformation Network Architectures Assessing Performance.
Fig. S1 siControl E2 G1: 45.7% S: 26.9% G2-M: 27.4% siER  E2 G1: 70.9% S: 9.9% G2-M: 19.2% G1: 57.1% S: 12.0% G2-M: 30.9% siRNF31 E2 A B siRNF31 siControl.
GENE EXPRESSION. Gene Expression Our phenotype is the result of the expression of proteins Different alleles encode for slightly different proteins Protein.
PART 1 - DNA REPLICATION PART 2 - TRANSCRIPTION AND TRANSLATION.
TRANSLATION: information transfer from RNA to protein the nucleotide sequence of the mRNA strand is translated into an amino acid sequence. This is accomplished.
Prodigiosin Production in E. Coli Brian Hovey and Stephanie Vondrak.
Cell Division and Gene Expression
Supplementary materials
DNA & GENES. What is DNA?  DNA (deoxyribonucleic acid) is a nucleic acid  It is composed of smaller units called nucleotides  These are:  A, T, C,
Lesson Four Structure of a Gene. Gene Structure What is a gene? Gene: a unit of DNA on a chromosome that codes for a protein(s) –Exons –Introns –Promoter.
©1998 Timothy G. Standish From DNA To RNA To Protein Timothy G. Standish, Ph. D.
Suppl. Figure 1 APP23 + X Terc +/- Terc +/-, APP23 + X Terc +/- G1Terc -/-, APP23 + X G1Terc -/- G2Terc -/-, APP23 + X G2Terc -/- G3Terc -/-, APP23 + and.
W ARM -U P / EOC P REP 1) The random distribution of homologous chromosomes during the metaphase of meiosis is called what? A. Crossing overB. Budding.
RA(4kb)- Atggagtccgaaatgctgcaatcgcctcttctgggcctgggggaggaagatgaggc……………………………………………….. ……………………………………………. ……………………….,……. …tactacatctccgtgtactcggtggagaagcgtgtcagatag.
Example 1 DNA Triplet mRNA Codon tRNA anticodon A U A T A U G C G
Name of presentation Month 2009 SPARQ-ed PROJECT Mutations in the tumor suppressor gene p53 Pulari Thangavelu (PhD student) April Chromosome Instability.
DNA, RNA and Protein.
The response of amino acid frequencies to directional mutation pressure in mitochondrial genomes is related to the physical properties of the amino acids.
Lesson Four Structure of a Gene.
Lesson Four Structure of a Gene.
Translation PROTEIN SYNTHESIS.
RNA and Protein Synthesis
Protein Synthesis DNA RNA Protein.
Modelling Proteomes.
Supplementary information Table-S1 (Xiao)
Sequence – 5’ to 3’ Tm ˚C Genome Position HV68 TMER7 Δ mt. Forward
Supplemental Table 3. Oligonucleotides for qPCR
GENE MUTATIONS aka point mutations © 2016 Paul Billiet ODWS.
It’s All About Proteins
Supplementary Figure 1 – cDNA analysis reveals that three splice site alterations generate multiple RNA isoforms. (A) c.430-1G>C (IVS 6) results in 3.
DNA By: Mr. Kauffman.
Protein Synthesis Review Answers
Gene architecture and sequence annotation
PROTEIN SYNTHESIS RELAY
More on translation.
Transcription You’re made of meat, which is made of protein.
Molecular engineering of photoresponsive three-dimensional DNA
SC-100 Class 25 Molecular Genetics
Fundamentals of Protein Structure
Transcription and Translation
Central Dogma and the Genetic Code
Python.
Structure of the 5′ Portion of the Human Plakoglobin Gene
Bellringer Please answer on your bellringer sheet:
Station 2 Protein Synethsis.
RNA.
Protein Synthesis Genes: They’re all about ‘dem Proteins!
Presentation transcript:

Codon models R CGT CGC R D GAC GCC A Synonymous substitution Nonsynonymous substitution

Na ï ve assumption: no selection against synonymous substitutions Selection sequence position rate of synonymous substitutions

Synonymous purifying selection (conservation)  Protein folding  Splicing regulatory elements  mRNA structure  Overlapping genes  Codon bias Species 1 Species 2 Species 3 T A ACT GCC ACG GCT ACA GCA T A L T S I CTT ACA AGC ATC L T S I G R GGG CGT GGT CGG GGA CGA G R sequence position

How should we model synonymous selection?

Testing for synonymous selection H0: free from synonymous selection → constant Ks H1: under synonymous selection → variable Ks likelihood ratio test

Research objective Quantify and characterize the magnitude and role of synonymous purifying selection

Comparative sequence data S.cerevisiae S.paradoxusS.mikataeS.bayanusS.castelli > 20 million years 70%-90% coding DNA sequence identity

Comparative sequence data 5,135 datasets of multiple sequence alignments + phylogenies (5,182 of ~6,000 S. cerevisiae genes) Obtained from Wapinski et al., Nature 2007 GATCGATTC GATCGATTA GATCGGTCC GCTCGGTCC GATAGACATGATAGACAT ?

Under synonymous selection Not under synonymous selection 54.4% (2,794) 45.6% (2,341)

position Under significant synonymous selection Under synonymous selection Not under synonymous selection 42% (2,154) 45.6% (2,341) 12.4% (640)

Synonymous selection underlies codon bias Different organisms prefer specific codons over others that encode the same amino acid R:S. cerevisiae AGA48% AGG21% CGA7% CGC6% CGG4% CGU14%

Codon bias maintains translational efficiency Translation speed Translation accuracy

Codon adaptation index (CAI) quantifies codon bias Sharp and Li. Nucleic Acids Res, 1987

Genes under synonymous selection are codon biased

GAT CAA AAT TTT GCT TCA TCT GGT GAT CAA AAT TTT GCG TCG TCC GGA GAT CAA AAT TTT GCA TCT TCC GGC GAT CAA ACT TTT GCG TCC TCA GGC Codons under synonymous selection are biased *

Synonymous selection underlies codon bias position

Codon bias (synonymous selection) derives from protein structure Translation speedTranslation accuracy

S. cerevisiae mitochondrial NADP(+)-dependent isocitrate dehydrogenase (PDB: 2QFY) Codon bias at the protein 3D structure

S. cerevisiae mitochondrial NADP(+)-dependent isocitrate dehydrogenase (PDB: 2QFY) codon bias core > codon bias surface

S. cerevisiae mitochondrial NADP(+)-dependent isocitrate dehydrogenase (PDB: 2QFY) codon bias interface > codon bias surface

MDR1 is a member of the ABC transporter family. They pump drugs out of the cell utilizing ATP, which change conformation of the protein. These proteins were shown to induce multi-drug resistance in various cancers.

C3435T is a synonymous SNP that was reported to be a risk factor for several diseases such as Parkinson’s diseases, colon cancer, and renal epithelial tumor. It can be either because: 1.Change in mRNA level 2.Change in splicing 3.Linkage disequilibrium with other causative SNPs 4.Something else

FACS analysis. In purple – cell transfected with empty vector All other colors – cell trasfected with a vector containing MDR1 (various haplotypes) MDR1 pumps the drug (Bodipy) out of the cells. Bodipy

All other colors – cell trasfected with a vector containing MDR1 – various haplotypes The inhibitor works differently on the various haplotypes

Trypsin works differently on the various haplotypes

They showed that synonymous substitutions did not change protein levels but rather the structure. This was shown by differential response to specific antibodies. Important for linking SNPs to diseases.

Conservation of Ks in pol Mayrose et al. Bioinformatics/ISMB (2007)

DNA flap cPPT CTS ? Conservation of Ks in pol (zoom in)

cPPT A This region serves as a primer for the reverse transcriptase in the synthesis of the plus- strand DNA. cPPT

CTS = Central Termination Sequence A The CTS is involved in the nuclear import of the HIV-1 genome. CTS

???? In Pol one region is of unknown function

Kudla et al. showed that the levels of GFP – which is a protein whose gene can easily be inserted into a host genome and its levels can then be easily quantified, are strongly affected by the secondary structure of the 5 ’ end of the mRNA.

Stable mRNANon stable mRNA Non- stable mRNA secondary structure at the 5 ’ end -> higher GFP level.

Mechanism: stable secondary structures at the 5 ’ end of the mRNA obstruct ribosome binding to the mRNA and result with lower protein levels

Based on that we hypothesized that the 5 ’ end of the mRNA should show signals of strong synonymous selection. This is exactly what we found in our yeast data … In addition, we found that the codon bias is reduced at this region, as to allow non- stable mRNA structures.