Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.

Slides:



Advertisements
Similar presentations
Genetica per Scienze Naturali a.a prof S. Presciuttini Homologous genes Genes with similar functions can be found in a diverse range of living things.
Advertisements

Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
. Class 9: Phylogenetic Trees. The Tree of Life Evolution u Many theories of evolution u Basic idea: l speciation events lead to creation of different.
An Introduction to Phylogenetic Methods
 Aim in building a phylogenetic tree is to use a knowledge of the characters of organisms to build a tree that reflects the relationships between them.
Lecture 3 Molecular Evolution and Phylogeny. Facts on the molecular basis of life Every life forms is genome based Genomes evolves There are large numbers.
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
Phylogenetic reconstruction
Molecular Evolution Revised 29/12/06
Current Approaches to Whole Genome Phylogenetic Analysis Hongli Li.
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
UPGMA and FM are distance based methods. UPGMA enforces the Molecular Clock Assumption. FM (Fitch-Margoliash) relieves that restriction, but still enforces.
Review of cladistic technique Shared derived (apomorphic) traits are useful in understanding evolutionary relationships Shared primitive (plesiomorphic)
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
Bioinformatics and Phylogenetic Analysis
The Tree of Life From Ernst Haeckel, 1891.
Branch lengths Branch lengths (3 characters): A C A A C C A A C A C C Sum of branch lengths = total number of changes.
With astonishing advance of the Human Genome Project, essentially all human genomic sequences are available in public databases. The major task for the.
Alignment III PAM Matrices. 2 PAM250 scoring matrix.
Topic : Phylogenetic Reconstruction I. Systematics = Science of biological diversity. Systematics uses taxonomy to reflect phylogeny (evolutionary history).
Phylogenetic trees Sushmita Roy BMI/CS 576
Sequencing a genome and Basic Sequence Alignment
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Multiple Sequence Alignment CSC391/691 Bioinformatics Spring 2004 Fetrow/Burg/Miller (Slides by J. Burg)
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
Phylogenetic analyses Kirsi Kostamo. The aim: To construct a visual representation (a tree) to describe the assumed evolution occurring between and among.
Phylogeny Estimation: Traditional and Bayesian Approaches Molecular Evolution, 2003
Terminology of phylogenetic trees
Molecular phylogenetics
Protein Evolution and Sequence Analysis Protein Evolution and Sequence Analysis.
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Molecular basis of evolution. Goal – to reconstruct the evolutionary history of all organisms in the form of phylogenetic trees. Classical approach: phylogenetic.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
Computational Biology, Part D Phylogenetic Trees Ramamoorthi Ravi/Robert F. Murphy Copyright  2000, All rights reserved.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
Molecular phylogenetics 4 Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Chapter 8 Molecular Phylogenetics: Measuring Evolution.
Introduction to Phylogenetics
Calculating branch lengths from distances. ABC A B C----- a b c.
Chapter 24: Molecular and Genomic Evolution CHAPTER 24 Molecular and Genomic Evolution.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Why do trees?. Phylogeny 101 OTUsoperational taxonomic units: species, populations, individuals Nodes internal (often ancestors) Nodes external (terminal,
Table 8.3 & Alberts Fig.1.38 EVOLUTION OF GENOMES C-value paradox: - in certain cases, lack of correlation between morphological complexity and genome.
Copyright © 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings PowerPoint ® Lecture Presentations for Biology Eighth Edition Neil Campbell.
Phylogeny Ch. 7 & 8.
Phylogeny & Systematics
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
ASSEMBLY AND ALIGNMENT-FREE METHOD OF PHYLOGENY RECONSTRUCTION FROM NGS DATA Huan Fan, Anthony R. Ives, Yann Surget-Groba and Charles H. Cannon.
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
Modelling evolution Gil McVean Department of Statistics TC A G.
VI. Mutation A.Overview B.Changes in Ploidy C.Changes in ‘Aneuploidy’ (changes in chromosome number) D. Change in Gene Number/Arrangement.
Molecular Evolution. Study of how genes and proteins evolve and how are organisms related based on their DNA sequence Molecular evolution therefore is.
Phylogeny and the Tree of Life
Bioinformatics Overview
Introduction to Bioinformatics Resources for DNA Barcoding
Evolutionary genomics can now be applied beyond ‘model’ organisms
Evolution of eukaryotic genomes
Phylogenetic basis of systematics
Methods of molecular phylogeny
The Tree of Life From Ernst Haeckel, 1891.
Molecular Evolution.
Evolutionary genetics
Chapter 6 Clusters and Repeats.
1. Unequal Crossing-Over a. process: If homologs line up askew:
Presentation transcript:

Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.

1. Introduction

2. Substitutions in Protein and DNA sequences

3. Comparing Sequences Using Sequence Alignments

4. Substitutions and the Jukes-Cantor Model

5. Rates of Nucleotide Substitution

6. Comparative Genomics 1. Genome sequencing provides a map to genes but does not reveal their function. Comparative genome analysis: a. Compares genes with low evolutionary rate and high functional significance. b. Pseudogenes, which are free to mutate, are used to calculate expected mutation rates. c. Regions of high sequence similarity in distantly related species are likely to contain functional genes. 2. Between mice and humans, for example, pseudogenes show about five times as many changes as regions that encode proteins or regulate gene expression. 3. Natural selection evaluates the consequences of an enormous number of changes, on an evolutionary time scale. 4. Comparative genome analysis can point the way to meaningful experiments by: a. Saving the effort of saturation mutagenesis. b. Allowing use of model organisms (e.g., yeast).

7. Codon Usage Bias

8. Variation in Evolutionary Rates between Genes

9. Molecular Clocks

10. Molecular Phylogeny

11. Phylogenetic Trees N R = (2n - 3)! / [2 n-2 (n - 2)!] N U = (2n - 5)! / [2 n-3 (n - 3)!]

12. Number of Possible Trees # sequences# unrooted# rooted trees , ,395135, ,1352,027, ,027,02534,459,425 N R = (2n - 3)! / [2 n-2 (n - 2)!] N U = (2n - 5)! / [2 n-3 (n - 3)!]

13. Gene versus Species Treees

14. Reconstruction Methods Many possibilities exist for the REAL phylogenetic trees, and it is generally impossible to know which is the true tree that represents actual events in evolution. Most phylogenetic trees generated with molecular data are considered inferred trees. Computer algorithms that generate these inferred trees use three types of approaches: –Distance matrix methods. –Parsimony-based methods. –Maximum likelihood methods. We will examine each of these mehtods briefly.

15. Distance Matrix Phylogenetic Reconstruction Let C i and C j be two disjoint clusters: 1 d i,j = ————————  p  q d p,q, where p  C i and q  C j |Ci| × |Cj| In words: calculate the average over all pairwise inter-cluster distances for all taxa under consideration. CiCi CjCj

16. Parsimony-Based approaches to Phylogenetic Reconstruction

17. Parsimony based Approaches II

18. Maximum Likelihood Approaches to Phylogenetic Tree Reconstruction Maximum likelihood approaches offer a purely statistical alternative method to reconstruct phylogenies. –In a set of sequence alignments, probability is considered for every possible nucleotide substitution. –Transitions, for example, are three times more likely than transversions, and so individuals with transition divergences may be considered more closely related than those with transversions. Complications of the maximum likelihood method include: –Lack of knowledge about the ancestral sequence. –The probability that multiple substitutions have occurred. –The fact that sites are not necessarily independent or equivalent. A vast number of trees are possible with this computation- intensive method. The one with the highest aggregate probability is most likely to reflect the true phylogenetic tree.

19. Bootstrapping and Tree Reliability Large numbers (e.g., >30 species) of long sequences are difficult to analyze, even with fast computers and streamlined algorithms. Neither distance matrix nor maximum parsimony methods can guarantee the correct tree; but generally, if a similar tree results from both of these fundamentally different methods, it is considered fairly reliable. The confidence level for portions of inferred trees can be determined by bootstrap tests, in which a subset of the original data is drawn with replacement and a new tree inferred. Caution is needed in interpreting bootstrap results.

20. The Tree of Life

21. Multigene Families

22a. Gene Duplication and Gene Conversion 1.Duplication frees a copy of the sequence to undergo changes, since a functional copy will still exist. a.Most changes would produce less functional products, or even nonfunctional pseudogenes. b.A few changes, however, might alter function and/or pattern of expression to something more advantageous for the organism. Selection would allow these genes to become widespread in the population. Gene X Gene x’

22b. Gene Duplication and Gene Conversion 2.Misalignment between a pseudogene and a functional copy can result in gene conversion through recombination events. a.The allele on one homolog is copied and replaces the DNA sequence of the allele on the other homolog; it is not reciprocal exchange. b.Gene conversion gives organisms even more opportunities to create a gene with a new function. 3.Gene conversion continues to operate in modern humans. An example is two genes for red-green color vision on the X chromosome that undergo gene conversion in most of the known cases of spontaneous deficiencies in green color vision.

23. Domain (Exon) Shuffling 1. Often, less than an entire gene is duplicated, resulting in copies of protein domains. a. Example - human serum albumin. b. Internal duplication not a rapid method of producing proteins with new functions. 2. Most complex proteins arise from assemblages of several protein domains performing different functions a. The beginnings and ends of exons and protein domains often correspond. b. Gilbert (1978) - most gene families arose through domain shuffling involving duplication and rearrangement. c. Domain shuffling theory proposes that introns were a feature of early life on Earth, even though they are now missing from prokaryotes. d. Numerous examples of complex genes made from segments of other genes are known, and clearly some novel functions have been created in this way.