VARIATION IN CONSERVATION AMONG DIFFERENT GENES WITHIN THE HERPES SIMPLEX VIRUS TYPE 1, AND ITS CORRELATION WITH FUNCTION Samantha Nadeau & Kerri Callahan.

Slides:



Advertisements
Similar presentations
Sequence comparison: Significance of similarity scores Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.
Advertisements

Permutation Tests Hal Whitehead BIOL4062/5062.
Comparative genomics Joachim Bargsten February 2012.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Introduction to Bioinformatics
Molecular Clock I. Evolutionary rate Xuhua Xia
Testing Differences Among Several Sample Means Multiple t Tests vs. Analysis of Variance.
The Cobweb of life revealed by Genome-Scale estimates of Horizontal Gene Transfer Fan Ge, Li-San Wang, Junhyong Kim Mourya Vardhan.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Model Selection Anders Gorm Pedersen Molecular Evolution Group Center for Biological Sequence Analysis Technical.
Current Approaches to Whole Genome Phylogenetic Analysis Hongli Li.
Methods of identification and localization of the DNA coding sequences Jacek Leluk Interdisciplinary Centre for Mathematical and Computational Modelling,
Copyright, ©, 2002, John Wiley & Sons, Inc.,Karp/CELL & MOLECULAR BIOLOGY 3E Footprints and Shadows Looking for Functional Pieces Within Genomes.
CSE 221: Probabilistic Analysis of Computer Systems Topics covered: Statistical inference (Sec. )
Molecular Evolution with an emphasis on substitution rates Gavin JD Smith State Key Laboratory of Emerging Infectious Diseases & Department of Microbiology.
Course overview Tuesday lecture –Those not presenting turn in short review of a paper using the method being discussed Thursday computer lab –Turn in short.
Different chi-squares Ulf H. Olsson Professor of Statistics.
Positive selection A new allele (mutant) confers some increase in the fitness of the organism Selection acts to favour this allele Also called adaptive.
Review: The Logic Underlying ANOVA The possible pair-wise comparisons: X 11 X 12. X 1n X 21 X 22. X 2n Sample 1Sample 2 means: X 31 X 32. X 3n Sample 3.
Inference of Genealogies for Recombinant SNP Sequences in Populations Yufeng Wu Computer Science and Engineering Department University of Connecticut
Finding Sequence Motifs in Alu Transposons that Enhance the Expression of Nearby Genes Kendra Baughman York Marahrens’ Lab UCLA.
Biological (genomic) information Dan Janies
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Probabilistic modeling and molecular phylogeny Anders Gorm Pedersen Molecular Evolution Group Center for Biological.
Materials and Methods Abstract Conclusions Introduction 1. Korber B, et al. Br Med Bull 2001; 58: Rambaut A, et al. Nat. Rev. Genet. 2004; 5:
Sequence comparison: Significance of similarity scores Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
Laboratory Training for Field Epidemiologists Typing May 2007 Sequencing and Phylogeny.
Multiple testing correction
Whole genome comparison Kelley Crouse And Greg Matuszek.
Fluidity of the 16S rRNA Gene Sequence within Aeromonas Strains Alessia Morandi Institute for Infectious Diseases University of Berne.
Computational Biology, Part D Phylogenetic Trees Ramamoorthi Ravi/Robert F. Murphy Copyright  2000, All rights reserved.
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
3- RIBOSOMAL RNA GENE RECONSTRUCITON  Phenetics Vs. Cladistics  Homology/Homoplasy/Orthology/Paralogy  Evolution Vs. Phylogeny  The relevance of the.
PHYLOGENETICS CONTINUED TESTS BY TUESDAY BECAUSE SOME PROBLEMS WITH SCANTRONS.
Introduction to Phylogenetic Trees
I. Statistical Tests: A Repetive Review A.Why do we use them? Namely: we need to make inferences from incomplete information or uncertainty þBut we want.
Calculating branch lengths from distances. ABC A B C----- a b c.
4 male, 4 female LCLs HumanChimpanzeeRhesus Macaque Expression: RNAseq Active Gene Marks: Pol II (ChIPseq) H3K4me3 (ChIPseq) Repressed Region Mark: H3K27me3.
Measures of Conserved Synteny Work was funded by the National Science Foundation’s Interdisciplinary Grants in the Mathematical Sciences All work is joint.
The Tree of Life How do we select a gene sequence for comparison?
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Paper Review on Cross- species Microarray Comparison Hong Lu
CENTER FOR BIOLOGICAL SEQUENCE ANALYSIS Probabilistic modeling and molecular phylogeny Anders Gorm Pedersen Molecular Evolution Group Center for Biological.
ASSEMBLY AND ALIGNMENT-FREE METHOD OF PHYLOGENY RECONSTRUCTION FROM NGS DATA Huan Fan, Anthony R. Ives, Yann Surget-Groba and Charles H. Cannon.
1 Discovery of Conserved Sequence Patterns Using a Stochastic Dictionary Model Authors Mayetri Gupta & Jun S. Liu Presented by Ellen Bishop 12/09/2003.
Bioinformatics Overview
Hierarchical Segmentation of Polarimetric SAR Images
Sequence and structure relatedness of matrix protein of human respiratory syncytial virus with matrix proteins of other negative-sense RNA viruses  K.
A Hybrid Algorithm for Multiple DNA Sequence Alignment
..,../' CJ " · "' ; '..
Methods of molecular phylogeny
Model Selection In multiple regression we often have many explanatory variables. How do we find the “best” model?
POINT ESTIMATOR OF PARAMETERS
-­,-­, '.-.- ·'·' '·..'·..... '-.: - - (p - C!J " 1.,.c_.. If ( '.. ' " ' ' " ' I.
I. Statistical Tests: Why do we use them? What do they involve?
Pairwise Sequence Alignment (cont.)
Insights into the Evolution of Longevity from the Bowhead Whale Genome
Sequence and structure relatedness of matrix protein of human respiratory syncytial virus with matrix proteins of other negative-sense RNA viruses  K.
Maximum likelihood (ML) unrooted tree based on the full-length 16S rRNA genes (A) and 31 conserved single-copy genes (B) showing the phylogenetic position.
Phylogenetic tree of 38 Pseudomonas type strains, based on the V3-V5 region sequence of the 16S rRNA gene (V3 primer, positions 442 to 492; and V5 primer,
Insights into the Evolution of Longevity from the Bowhead Whale Genome
Phylogenetic comparison of the capsule biosynthesis (cap) gene locus among selected Pasteurella multocida strains. Phylogenetic comparison of the capsule.
Evaluation of power for linkage disequilibrium mapping
Core genome phylogeny of V. anguillarum strains.
Phylogenetic analyses of alphacoronaviruses based on complete genome and ORF1ab protein sequence. Phylogenetic analyses of alphacoronaviruses based on.
Comparison of Nonpareil Nd sequence diversity and 16S rRNA gene OTU Shannon H′ taxonomic diversity indices on 90 metagenomes. Comparison of Nonpareil Nd.
Sequence Analysis Alan Christoffels
Phylogenetic comparison among selected Pasteurella multocida and Haemophilus influenzae species with completed genome sequences. Phylogenetic comparison.
S protein sequence-based phylogenetic analyses of alphacoronaviruses.
(A) Bayesian phylogenetic tree of the H gene nucleotide alignment from tigers Pt2004 and Pt and representative CDV sequences obtained from GenBank.
Presentation transcript:

VARIATION IN CONSERVATION AMONG DIFFERENT GENES WITHIN THE HERPES SIMPLEX VIRUS TYPE 1, AND ITS CORRELATION WITH FUNCTION Samantha Nadeau & Kerri Callahan

INTRODUCTION o Most research done on comparison between HSV-1 vs. HSV-2 o Compared dN/dS values o Chose three genes/gene complexes  UL20  UL49  UL15; UL28; UL33

BACKGROUND o Three main morphological regions to virion o Each gene involves a different region  UL20  UL49  UL15; UL28; UL33

HYPOTHESIS o UL20 gene will be least conserved (greatest dN/dS value) o Null: dN=dS

METHODS o Partial genome of strains CJ394, CJ360, CJ311, CJ790, OD4, and TFT401 o Genes UL15, UL20, UL28, UL33, UL49 o 3 sequence alignments: 1.UL20 2.UL15; UL28; UL33 3.UL49

METHODS: ANALYSIS o Codon-based-z-test of selection o Selection at codons via HyPhy o Pairwise distance estimates o Maximum likelihood trees o Null: dN=dS

RESULTS: CODON-BASED-Z-TEST OF SELECTION GeneZ-Statistic HA: dN>dS Prob UL UL15; UL28; UL UL

RESULTS: CODON-BASED-Z-TEST OF SELECTION GeneZ-Statistic HA: DN=/=DS Prob UL UL15; UL28; UL UL

RESULTS: CODON-BASED-Z-TEST OF SELECTION GeneZ statistic HA: dN<dS Prob UL UL15; UL28; UL UL

RESULTS: CODON SELECTION ESTIMATE (HYPHY) GenedN/dS value UL UL15; UL28; UL33 complex0.972 UL

RESULTS: PAIRWISE DISTANCE MATRICES

UL20 MAXIMUM LIKELIHOOD TREE

UL15; UL28; UL33 TREE

UL49 MAXIMUM LIKELIHOOD TREE

TAKE HOME MESSAGE o Can’t compare UL15, UL28, and UL33 against each other to test conservation o UL20 is less highly conserved than UL49 o UL49 is most highly conserved