1 Chapter 4 Distance–Based Methods of Phylogenetics 暨南大學資訊工程學系 黃光璿 (HUANG, Guan-Shieng) 2004/03/29.

Slides:



Advertisements
Similar presentations
LG 4 Outline Evolutionary Relationships and Classification
Advertisements

Computational Molecular Biology Biochem 218 – BioMedical Informatics Doug Brutlag Professor.
Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
1 Chapter 7 Protein and RNA Structure Prediction 暨南大學資訊工程學系 黃光璿 2004/05/24.
. Class 9: Phylogenetic Trees. The Tree of Life Evolution u Many theories of evolution u Basic idea: l speciation events lead to creation of different.
1 Dan Graur Methods of Tree Reconstruction. 2 3.
1 Chapter 2 Data Searches and Pairwise Alignments 暨南大學資訊工程學系 黃光璿 2004/03/08.
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
Phylogenetics - Distance-Based Methods CIS 667 March 11, 2204.
Phylogenetic reconstruction
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
1 Chapter 3 Substitution Patterns 暨南大學資訊工程學系 黃光璿 (HUANG, Guan-Shieng) 2004/03/22.
資料蒐集的方法(三):實驗法(實驗設計) (第七章)
第二章 研究主題(研究題 目)與研究問題.
Molecular Evolution Revised 29/12/06
Tree Reconstruction.
統計與生物資訊學 1 : 課程簡介 (Introduction) 陳光琦助理教授 (Kuang-Chi Chen)
Chapter 2 Random Vectors 與他們之間的性質 (Random vectors and their properties)
核心能力意見調查 計畫主持人:劉義周教授 研究助理: 林珮婷 報告日期: 調查案的目標與性質 調查的主要目的在進行宣傳,讓全校師生可以瞭 解何謂「課程地圖」與「核心能力」。 通識中心將核心能力主要區分為「學術訓練」、 「就業準備」、「公民文化養成」、「個人特質 提升」等四大面向,本調查依據此四大面向進一.
1 Chapter 5 Character–Based Methods of Phylogenetics 暨南大學資訊工程學系 黃光璿 (HUANG, Guan-Shieng) 2004/04/05.
Chapter 1 序論 Introduction. 2/15 Contents 1.1. 工程分析 Engineering Analysis 1.2. 教學目標與方法 Objectives and Approaches.
In addition to maximum parsimony (MP) and likelihood methods, pairwise distance methods form the third large group of methods to infer evolutionary trees.
 Prentice Hall Chapter 101 了解群體並發展 為有效能的團隊.  Prentice Hall Chapter 102 學習目標 比較群體與團隊 描述三種類型的團隊 團隊團結與產能之間的關係 解釋群體懈怠 解釋所謂的葡萄藤.
第二十一章 研究流程、論文結構        與研究範例 21-1  研究流程 21-2  論文結構 21-3  研究範例.
CISC667, F05, Lec15, Liao1 CISC 667 Intro to Bioinformatics (Fall 2005) Phylogenetic Trees (II) Distance-based methods.
研究資料的分析. 資料分析的基本策略  General data analysis strategies 1.Sketching ideas 2.Taking notes 3.Summarize field nores 4.Getting feedback on ideas 5.Working with.
DECISION TREE LEARNING. Outlook Sunny Overcast Rain Humidity Yes Wind High Normal Strong Weak No Yes No Yes.
The effect of task on the information-related behaviors of individuals in a work-group environment. The effect of task on the information-related behaviors.
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
Building Phylogenies Distance-Based Methods. Methods Distance-based Parsimony Maximum likelihood.
Phylogenetic Analysis. 2 Phylogenetic Analysis Overview Insight into evolutionary relationships Inferring or estimating these evolutionary relationships.
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. 參 資料蒐集的方法.
Topic : Phylogenetic Reconstruction I. Systematics = Science of biological diversity. Systematics uses taxonomy to reflect phylogeny (evolutionary history).
McGraw-Hill/Irwin © 2003 The McGraw-Hill Companies, Inc.,All Rights Reserved. 肆 資料分析與表達.
Phylogenetic trees Sushmita Roy BMI/CS 576
1 Multiple Sequence Alignment 暨南大學資訊工程學系 黃光璿 2004/05/31.
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
Phylogenetic Analysis. 2 Introduction Intension –Using powerful algorithms to reconstruct the evolutionary history of all know organisms. Phylogenetic.
Molecular phylogenetics
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Phylogenetics Alexei Drummond. CS Friday quiz: How many rooted binary trees having 20 labeled terminal nodes are there? (A) (B)
Chapter 26: Phylogeny and the Tree of Life Objectives 1.Identify how phylogenies show evolutionary relationships. 2.Phylogenies are inferred based homologies.
1 Dan Graur Molecular Phylogenetics Molecular phylogenetic approaches: 1. distance-matrix (based on distance measures) 2. character-state.
PHYLOGENETIC TREES Dwyane George February 24,
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
Computational Biology, Part D Phylogenetic Trees Ramamoorthi Ravi/Robert F. Murphy Copyright  2000, All rights reserved.
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
BINF6201/8201 Molecular phylogenetic methods
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Warm-Up 1.Contrast adaptive radiation vs. convergent evolution? Give an example of each. 2.What is the correct sequence from the most comprehensive to.
Applied Bioinformatics Week 8 Jens Allmer. Practice I.
OUTLINE Phylogeny UPGMA Neighbor Joining Method Phylogeny Understanding life through time, over long periods of past time, the connections between all.
分子演化 Molecular Evolution
Building phylogenetic trees. Contents Phylogeny Phylogenetic trees How to make a phylogenetic tree from pairwise distances  UPGMA method (+ an example)
Introduction to Phylogenetics
Calculating branch lengths from distances. ABC A B C----- a b c.
Phylogenetic Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics Figures from Higgs & Attwood.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Phylogeny Ch. 7 & 8.
Phylogeny & the Tree of Life
Applied Bioinformatics Week 8 Jens Allmer. Theory I.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
1 CAP5510 – Bioinformatics Phylogeny Tamer Kahveci CISE Department University of Florida.
Distance-Based Approaches to Inferring Phylogenetic Trees BMI/CS 576 Colin Dewey Fall 2010.
Distance-based methods for phylogenetic tree reconstruction Colin Dewey BMI/CS 576 Fall 2015.
Phylogeny and the Tree of Life
Introduction to Bioinformatics Resources for DNA Barcoding
Phylogenetic Trees.
Presentation transcript:

1 Chapter 4 Distance–Based Methods of Phylogenetics 暨南大學資訊工程學系 黃光璿 (HUANG, Guan-Shieng) 2004/03/29

2 Motivation Evolution events on genomes:  substitutions  insertions  deletions  rearrangements We focus on cluster analysis in this chapter.

3 4.1 History of Molecular Phylogenetics taxonomists ( 分類學家 ) :  naming ( 命名 ) & grouping ( 分類 )  traditional approach based on anatomic difference ( 解剖 ) Linnaeu’s system  界 門 綱 目 科 屬 種 Darwing ( 達爾文 ) Nuttall (1902~1904)  humans & apes 最晚分化 ( 從免疫系統的角度 )

4 1950s  protein electrophoresis (size, charge) 1960s  protein sequencing 1970s  genomic information  先有 restriction enzyme  後有 DNA sequencing

5 比較解剖學

6 蟒蛇與人

7 Linnaeus 林奈  十八世紀博物學家  帶領未受訓練的學生到世界各地蒐集標本, 遠征過 程中有三分之一的學生死亡.  創立「二名法」 (binomial system of nomenclature) 屬名 (genus) +種名 (species)

8

9 4.2 Advantages of Molecular Phylogenies fundamental  evolution is defined as genetic changes  molecular clock hypothesis (Chap. 3) In early days, taxonomists inferred genotypes from phenotypes.  phenotypes( 表現型 ): how organisms looks  genotypes: the genes that gave rise to their physical appearance

10 And then  behavior ( 行為 )  ultrastructural ( 超顯微結構 )  biochemical characteristics were studied.

11 傳統研究方法有以下問題無法解決  convergent evolution 眼睛: humans, flies, mollusks ( 軟體動物 )  many organisms do not have easily studied phenotypic features bacteria ( 細菌 )  comparing distantly related organisms bacteria, worms, mammals few characteristics in common!

Phylogenetic Trees

13

Terminology of Tree Reconstruction phylogenetic tree, or dendrogram  nodes: taxonomical units  branches  terminal nodes collected data (I, II, III, IV, V)  internal nodes inferred ancestors (A, B, C, D) Newick format  (((I, II), (III, IV)), V)

15 bifurcate: 一變二 multifurcate: ≥3 scaled trees  branch lengths are proportional to the differences between pairs of neighboring nodes additive in a scaled tree  physical length of two nodes reflects their accumulated difference unscaled trees  convey only their relative kinship

Rooted and Unrooted Trees

17

18 N R =#(rooted binary trees) N U =#(unrooted binary trees)

Gene vs. Species Trees gene tree  within a single homologous gene species tree  best obtained from analysis of multiple genes Note:  Evolution occurs at the level of populations of organisms, not at the level of individuals.   Gene tree & species tree are different!

20

Character and Distance Data characters ( 特質 )  DNA sequences, protein sequences, color, behavior, response time, …… distance  overall, pairwise difference character data  distance data pheneticist: prefers distance based methods cladist: prefer character based methods

Distance Matrix Methods UPGMA (Unweighted-Pair-Group Method with Arithmetic mean) Transformed Distance Method Neighbor ’ s Relation Method Neighbor-Joining Method

UPGMA Unweighted-Pair-Group Method with Arithmetic mean 1960s  Assume a constant rate of evolution across all lineages.

24

25

26

27

28

29 The definition of d ij is and can be calculated by

30

31 Ultrametric Test A matrix is ultrametric iff

32 Theorem UPGMA can reconstruct the correct phylogenetic tree as long as the distance matrix is ultrametric.

33 Estimation of Branch Lengths Once the tree topology is given.topology

Transformed Distance Method Weakness of UPGMA  It assumes a constant rate of evolution across all lineages.   Modify the distance matrix so that UPGMA can perform better.

35 Outgroup J. Farris, 1977 Branch length

36 It only gives a tree topology and does not provide estimates of branch lengths (Nei, 1987). The transformed matrix is ultrametric. A matrix is ultrametric iff

37

Neighbor ’ s Relation Method Four-point condition d AB +d CD <d AC +d BD d AB +d CD <d AD +d BC holds if the tree is additive.

39 Given any four points, say A, B, C, D, we have d AB +d CD d AC +d BD d AD +d BC. The smallest indicates how to pair up.

40 S. Sattath & A. Tversky, 1977 For any four points, say A, B, C, D, compute d AB +d CD d AC +d BD d AD +d BC. The smallest should be paired, and wins a score 1 for each pair. After trying all possible quadruples, the pair wins the highest scores is grouped.

41 Example

42

43

44 The length of the branches can be determined by the outgroup method.outgroup

45 Theorem If a matrix is additive, then its phylogenetic tree (unrooted, binary) can be reconstructed correctly and uniquely by the Neighbor ’ s Relation Method.

Neighbor-Joining Methods

47

48 where L :the set of all leaves (7.4)

49

50 Theorem If a matrix is additive, then its phylogenetic tree (unrooted, binary) can be reconstructed correctly and uniquely by the Neighbor-Joining Method.

Maximum Likelihood Approaches purely statistically based method multiple substitutions All sites are not necessarily independent. No one substitution model is as yet as close to general acceptance.

Multiple Sequence Alignment

53 參考資料及圖片出處 1. Fundamental Concepts of Bioinformatics Dan E. Krane and Michael L. Raymer, Benjamin/Cummings, Fundamental Concepts of Bioinformatics 2. Biological Sequence Analysis – Probabilistic models of proteins and nucleic acids R. Durbin, S. Eddy, A. Krogh, G. Mitchison, Cambridge University Press, Biological Sequence Analysis 3. Biology, by Sylvia S. Mader, 8th edition, McGraw-Hill, Biology