Evolutionary Interpretation of Log Odds Scores for alignment Alexei Drummond Department of Computer Science.

Slides:



Advertisements
Similar presentations
1 Number of substitutions between two protein- coding genes Dan Graur.
Advertisements

Computing a tree Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.
Fa07CSE 182 CSE182-L4: Database filtering. Fa07CSE 182 Summary (through lecture 3) A2 is online We considered the basics of sequence alignment –Opt score.
Computational Biology, Part 7 Similarity Functions and Sequence Comparison with Dot Matrices Robert F. Murphy Copyright  1996, All rights reserved.
1 Introduction to Sequence Analysis Utah State University – Spring 2012 STAT 5570: Statistical Bioinformatics Notes 6.1.
Bayesian Evolutionary Distance P. Agarwal and D.J. States. Bayesian evolutionary distance. Journal of Computational Biology 3(1):1— 17, 1996.
Computing a tree Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.
OUTLINE Scoring Matrices Probability of matching runs Quality of a database match.
Phylogenetic Trees Lecture 4
Lecture 8 Alignment of pairs of sequence Local and global alignment
University of British Columbia Department of Computer Science Tamara Munzner Interactive Visualization of Evolutionary Trees and Gene Sequences February.
Sequence Similarity Searching Class 4 March 2010.
CPM '05 Sensitivity Analysis for Ungapped Markov Models of Evolution David Fernández-Baca Department of Computer Science Iowa State University (Joint work.
From population genetics to variation among species: Computing the rate of fixations.
Fa05CSE 182 CSE182-L4: Scoring matrices, Dictionary Matching.
Pairwise profile alignment Usman Roshan BNFO 601.
Molecular Evolution, Part 2 Everything you didn’t want to know… and more! Everything you didn’t want to know… and more!
. Computational Genomics Lecture #3a (revised 24/3/09) This class has been edited from Nir Friedman’s lecture which is available at
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
Sequence Alignments Revisited
Alignment III PAM Matrices. 2 PAM250 scoring matrix.
Pairwise alignment Computational Genomics and Proteomics.
Roadmap The topics:  basic concepts of molecular biology  more on Perl  overview of the field  biological databases and database searching  sequence.
(a) (b) (c) (d). What is (1,2,3)  (3,4,2)? (a) (1, 2, 3, 4) (b) (1,2)  (3,4) (c) (1,3,4,2) (d) (3,1)  (4,2)
Michael Cummings David Reisman University of South Carolina Genomes and Genomics Chapter 15.
Computer Science: A Structured Programming Approach Using C1 3-7 Sample Programs This section contains several programs that you should study for programming.
Pairwise alignments Introduction Introduction Why do alignments? Why do alignments? Definitions Definitions Scoring alignments Scoring alignments Alignment.
Evolution and Scoring Rules Example Score = 5 x (# matches) + (-4) x (# mismatches) + + (-7) x (total length of all gaps) Example Score = 5 x (# matches)
Lecture 3: Markov models of sequence evolution Alexei Drummond.
Pairwise Sequence Alignment. The most important class of bioinformatics tools – pairwise alignment of DNA and protein seqs. alignment 1alignment 2 Seq.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Eric C. Rouchka, University of Louisville Sequence Database Searching Eric Rouchka, D.Sc. Bioinformatics Journal Club October.
What is Genetic Research?. Genetic Research Deals with Inherited Traits DNA Isolation Use bioinformatics to Research differences in DNA Genetic researchers.
CHAPTER 13: THE ENVIRONMENT AND CHANGE OVER TIME
1 Evolutionary Change in Nucleotide Sequences Dan Graur.
Comp. Genomics Recitation 3 The statistics of database searching.
Construction of Substitution Matrices
Sequence Alignment Csc 487/687 Computing for bioinformatics.
Protein Secondary Structure, Bioinformatics Tools, and Multiple Sequence Alignments Finding Similar Sequences Predicting Secondary Structures Predicting.
Sequence alignment. aligned sequences substitution model.
July, 2000Guang Jin Statistics in Applied Science and Technology Chapter 7 - Sampling Distribution of Means.
MAT 4830 Mathematical Modeling
Pairwise Sequence Analysis-III
November 18, 2000ICTCM 2000 Introductory Biological Sequence Analysis Through Spreadsheets Stephen J. Merrill Sandra E. Merrill Marquette University Milwaukee,
Sequence Alignment.
Natural Selection Learning Target: I can apply scientific ideas to construct an explanation to infer evolutionary relationships. Friday-Monday 12/5-8/14.
Construction of Substitution matrices
Doug Raiford Phage class: introduction to sequence databases.
Step 3: Tools Database Searching
Molecular Evolution. Study of how genes and proteins evolve and how are organisms related based on their DNA sequence Molecular evolution therefore is.
Evolutionary Change in Sequences
Substitution Matrices and Alignment Statistics BMI/CS 776 Mark Craven February 2002.
Vineet Bafna. How can we compute the local alignment itself?
WARM UP List anything and everything you know about chromosomes, mitosis, meiosis, or cell division.
Inferring phylogenetic trees: Distance methods
Molecular Evolutionary Analysis
Distances.
Goals of Phylogenetic Analysis
AP Biology – Lecture #4 Big Idea #1 – Evolution Change in the genetic makeup of a population over time IS evolution. Adapted from Rebecca Rehder Wingerden.
Distance based phylogeny reconstruction
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Molecular Evolution.
30% grade = class presentations
A. Pearson, Department of Earth & Planetary Sciences
Structural evidence: Embryonic similarities Vestigial organs
Sequence Similarity Andrew Torda, wintersemester 2006 / 2007, Angewandte … What is the easiest information to find about a protein ? sequence history.
For First Place Most Times Up at the Table
Evidence for Evolution
Similarities Differences
Introduction to bioinformatics Lecture 5 Pair-wise sequence alignment
Presentation transcript:

Evolutionary Interpretation of Log Odds Scores for alignment Alexei Drummond Department of Computer Science

Evolutionary interpretation of match/mismatch scores xy xy a, b not homologous a, b homologous t xy xy (d=0.1 is roughly 90% similarity) d = average number of changes per site

Jukes Cantor Model All mutations are equally likely –x  y at the same rate for all x, y All nucleotides are equally likely (equal base frequencies: –{0.25, 0.25, 0.25, 0.25} for DNA –{0.05,…,0.05} for Proteins DNA Proteins

Evolutionary interpretation of match/mismatch scores (DNA) xy xy (d=0.1 is roughly 90% similarity) d = average number of changes per site

Evolutionary interpretation of match/mismatch scores (DNA)