Presentation is loading. Please wait.

Presentation is loading. Please wait.

Codon models R CGT CGC R D GAC GCC A Synonymous substitution Nonsynonymous substitution.

Similar presentations


Presentation on theme: "Codon models R CGT CGC R D GAC GCC A Synonymous substitution Nonsynonymous substitution."— Presentation transcript:

1

2 Codon models R CGT CGC R D GAC GCC A Synonymous substitution Nonsynonymous substitution

3 Na ï ve assumption: no selection against synonymous substitutions Selection sequence position rate of synonymous substitutions

4 Synonymous purifying selection (conservation)  Protein folding  Splicing regulatory elements  mRNA structure  Overlapping genes  Codon bias Species 1 Species 2 Species 3 T A ACT GCC ACG GCT ACA GCA T A L T S I CTT ACA AGC ATC L T S I G R GGG CGT GGT CGG GGA CGA G R sequence position

5 How should we model synonymous selection?

6 Testing for synonymous selection H0: free from synonymous selection → constant Ks H1: under synonymous selection → variable Ks likelihood ratio test

7 Research objective Quantify and characterize the magnitude and role of synonymous purifying selection

8 Comparative sequence data S.cerevisiae S.paradoxusS.mikataeS.bayanusS.castelli > 20 million years 70%-90% coding DNA sequence identity

9 Comparative sequence data 5,135 datasets of multiple sequence alignments + phylogenies (5,182 of ~6,000 S. cerevisiae genes) Obtained from Wapinski et al., Nature 2007 GATCGATTC GATCGATTA GATCGGTCC GCTCGGTCC GATAGACATGATAGACAT ?

10 Under synonymous selection Not under synonymous selection 54.4% (2,794) 45.6% (2,341)

11 position Under significant synonymous selection Under synonymous selection Not under synonymous selection 42% (2,154) 45.6% (2,341) 12.4% (640)

12

13 Synonymous selection underlies codon bias Different organisms prefer specific codons over others that encode the same amino acid R:S. cerevisiae AGA48% AGG21% CGA7% CGC6% CGG4% CGU14%

14 Codon bias maintains translational efficiency Translation speed Translation accuracy

15 Codon adaptation index (CAI) quantifies codon bias Sharp and Li. Nucleic Acids Res, 1987

16 Genes under synonymous selection are codon biased

17

18 GAT CAA AAT TTT GCT TCA TCT GGT GAT CAA AAT TTT GCG TCG TCC GGA GAT CAA AAT TTT GCA TCT TCC GGC GAT CAA ACT TTT GCG TCC TCA GGC Codons under synonymous selection are biased *

19 Synonymous selection underlies codon bias position

20 Codon bias (synonymous selection) derives from protein structure Translation speedTranslation accuracy

21 S. cerevisiae mitochondrial NADP(+)-dependent isocitrate dehydrogenase (PDB: 2QFY) Codon bias at the protein 3D structure

22 S. cerevisiae mitochondrial NADP(+)-dependent isocitrate dehydrogenase (PDB: 2QFY) codon bias core > codon bias surface

23 S. cerevisiae mitochondrial NADP(+)-dependent isocitrate dehydrogenase (PDB: 2QFY) codon bias interface > codon bias surface

24

25

26

27 MDR1 is a member of the ABC transporter family. They pump drugs out of the cell utilizing ATP, which change conformation of the protein. These proteins were shown to induce multi-drug resistance in various cancers.

28 C3435T is a synonymous SNP that was reported to be a risk factor for several diseases such as Parkinson’s diseases, colon cancer, and renal epithelial tumor. It can be either because: 1.Change in mRNA level 2.Change in splicing 3.Linkage disequilibrium with other causative SNPs 4.Something else

29 FACS analysis. In purple – cell transfected with empty vector All other colors – cell trasfected with a vector containing MDR1 (various haplotypes) MDR1 pumps the drug (Bodipy) out of the cells. Bodipy

30 All other colors – cell trasfected with a vector containing MDR1 – various haplotypes The inhibitor works differently on the various haplotypes

31 Trypsin works differently on the various haplotypes

32 They showed that synonymous substitutions did not change protein levels but rather the structure. This was shown by differential response to specific antibodies. Important for linking SNPs to diseases.

33

34 Conservation of Ks in pol Mayrose et al. Bioinformatics/ISMB (2007)

35 DNA flap cPPT CTS ? Conservation of Ks in pol (zoom in)

36 cPPT A This region serves as a primer for the reverse transcriptase in the synthesis of the plus- strand DNA. cPPT

37 CTS = Central Termination Sequence A The CTS is involved in the nuclear import of the HIV-1 genome. CTS

38 ???? In Pol one region is of unknown function

39

40

41 Kudla et al. showed that the levels of GFP – which is a protein whose gene can easily be inserted into a host genome and its levels can then be easily quantified, are strongly affected by the secondary structure of the 5 ’ end of the mRNA.

42 Stable mRNANon stable mRNA Non- stable mRNA secondary structure at the 5 ’ end -> higher GFP level.

43 Mechanism: stable secondary structures at the 5 ’ end of the mRNA obstruct ribosome binding to the mRNA and result with lower protein levels

44 Based on that we hypothesized that the 5 ’ end of the mRNA should show signals of strong synonymous selection. This is exactly what we found in our yeast data … In addition, we found that the codon bias is reduced at this region, as to allow non- stable mRNA structures.


Download ppt "Codon models R CGT CGC R D GAC GCC A Synonymous substitution Nonsynonymous substitution."

Similar presentations


Ads by Google