The phylogenetics project data revealed! October 4, 2010 BIOS E-127
Geographic Distribution Pathogen = Salmonella Typhi
Why so little turnover? Letters indicate antibiotic resistant haplotypes Selection less effective due to “carriers” with no symptoms!
Codon Usage Bias & Selection October 4, 2010 BIOS E-127
Degenerate Code
Codon Usage Bias Non-random usage of codons Rickettsia (bacteria) - Winkler 1988 Percent codon usage in family Codons for each amino acid Preferred arginine codon Unpreferred arginine codon
Why non-random codon usage? Let’s look at tRNA
Codons correspond to anticodons on tRNAs
Unequal numbers of tRNAs There are six leucine codons So, there are six corresponding tRNAs tRNA copy number differences E. Coli - Ikemura 1981
Species differ in codon usage bias Without considering selection, why might different species have different codon usage patterns? Massey Leucine Codons
Different GC content alters codon usage Mutational Bias 232 Eubacterial genomes - Guchte 2006
Is there selection on codon bias? Four levels of evidence: 1.Genome level 2.Species level 3.Tissue level 4.Gene level Evidence: Non-random patterns of codon usage that are difficult to explain without selection
Selection on codons 1.Throughout genomes Same organism, so not explained by differences in tRNA abundance or mutational bias 2. Between species Why does the pattern decrease from E. coli to Human? Drummond and Wilke 2008 expression optimal codons High Expression level genes have high codon bias
Selection on codons 3. Within tissues Plotkin et al 2004 Red = selectively expressed in human testes Blue – selectively expressed In human ovaries
Selection on codons 4. Within genes Conserved sites more likely to use preferred codons Likely to find Preferred codon Likely to find Unpreferred codon Species in alignemnt: Os = Oryza sativa, Pp = Physcomitrella patens, Dd = Dictyostelium discoideum, Hs = Homo sapiens, Dr = Danio reiro, Dm = Drosophila melanogaster, Sc = Saccharomyces cerevisiae, Sp = Schizosaccharomyces pombe, Pf = Plasmodium falciparumGene = ubiquitin/SUMO-1 like protein Wang et. al. 2001
Why is there selection on codons? Many hypotheses relate to speed and accuracy of translation Translation Speed: Use of rare, unpreferred codons means the ribosome has to wait until a rare tRNA passes by. Translation Accuracy: While waiting, the liklihood that the wrong tRNA inserts the wrong amino acid may increase.
Codon Bias and Translation Speed Faster translation could allow more protein to be made: Might explain why highly expressed genes have more preferred codons *Does not explain within gene codon bias Slow translation may be desirable when translating certain parts of the protein: Translation speed may correlate with folding Some parts fold quickly, but some fold slowly Means sometimes unpreferred codons are advantageous Selection results from the cost of improper folding How could translation speed pose a selective cost or advantage? Hypotheses
Costs of improper protein folding
Codon Bias and Translation Accuracy Errors in translation affect folding of protein product: Loss of function Wasted resources Harmful misfolded byproducts Errors in translation affect function directly: Protein folds but cannot do it’s job How do these address correlation between highly expressed genes and codon bias? How could translation accuracy pose a selective cost or advantage? Hypotheses
Weak selection Is population size dependent – WHY? Smaller populations are more stochastic: 7 heads and 3 tails in 10 coin flips - possible vs. 70,000 heads and 30,000 tails in 100,000 coin flips – rare!
Weak selection Is population size dependent Other possible types of weak selection: Do humans and mammals have selection on codons? Much stronger in microbes! Allow us to study these patterns -ascertain mechanistic basis for selection -determine average effect of mutation Genome size Number of introns Urrutia & Hurst 2003
1.Could effect dN/dS (or other measures of selection)...but, It’s very weak selection that doesn’t effect all genes 1.Useful to detect HGT 2.Helpful to consider when building genetically modified organisms 4.Medical applications – Coleman et al Implications of selection on synonymous sites Pride et al 2006