Admixture in Horse Breeds Illustrated from Single Nucleotide Polymorphism Data César Torres, Yaniv Brandvain University of Minnesota, Department of Plant.

Slides:



Advertisements
Similar presentations
The Human Genome Project Main reference: Nature (2001) 409,
Advertisements

Lecture 2 Strachan and Read Chapter 13
After 13 years of scientist work predominatly in USA & UK the DNA sequence of the human genome was completed in 2003 Any ideas how they did it? What would.
CZ5225 Methods in Computational Biology Lecture 9: Pharmacogenetics and individual variation of drug response CZ5225 Methods in Computational Biology.
Combinatorial Algorithms for Haplotype Inference Pure Parsimony Dan Gusfield.
Sharlee Climer, Alan R. Templeton, and Weixiong Zhang
Multiple Comparisons Measures of LD Jess Paulus, ScD January 29, 2013.
Phylogenetic reconstruction
Reconstructing and Using Phylogenies
MALD Mapping by Admixture Linkage Disequilibrium.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Predicting the Function of Single Nucleotide Polymorphisms Corey Harada Advisor: Eleazar Eskin.
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
Biology and Bioinformatics Gabor T. Marth Department of Biology, Boston College BI820 – Seminar in Quantitative and Computational Problems.
FINAL EXAM: TAKE-HOME Assessment of Significance in Cancer Gene SNPs.
Dispersal models Continuous populations Isolation-by-distance Discrete populations Stepping-stone Island model.
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary, May 2006.
Inferring human demographic history from DNA sequence data Apr. 28, 2009 J. Wall Institute for Human Genetics, UCSF.
Human Migrations Saeed Hassanpour Spring Introduction Population Genetics Co-evolution of genes with language and cultural. Human evolution: genetics,
Genotyping of James Watson’s genome from Low-coverage Sequencing Data Sanjiv Dinakar and Yözen Hernández.
Inference of Genealogies for Recombinant SNP Sequences in Populations Yufeng Wu Computer Science and Engineering Department University of Connecticut
Scientific FieldsScientific Fields  Different fields of science have contributed evidence for the theory of evolution  Anatomy  Embryology  Biochemistry.
Computational Molecular Biology Biochem 218 – BioMedical Informatics Simple Nucleotide.
Hidenki Innan and Yuseob Kim Pattern of Polymorphism After Strong Artificial Selection in a Domestication Event Hidenki Innan and Yuseob Kim A Summary.
Exploring Current DNA Research of Longhorn Cattle.
Haplotype Blocks An Overview A. Polanski Department of Statistics Rice University.
Doug Brutlag 2011 Genomics & Medicine Doug Brutlag Professor Emeritus of Biochemistry &
SNPs Daniel Fernandez Alejandro Quiroz Zárate. A SNP is defined as a single base change in a DNA sequence that occurs in a significant proportion (more.
National Taiwan University Department of Computer Science and Information Engineering Haplotype Inference Yao-Ting Huang Kun-Mao Chao.
A single-nucleotide polymorphism tagging set for human drug metabolism and transport Kourosh R Ahmadi, Mike E Weale, Zhengyu Y Xue, Nicole Soranzo, David.
Biology 101 DNA: elegant simplicity A molecule consisting of two strands that wrap around each other to form a “twisted ladder” shape, with the.
Abstract To understand the population dynamics in rice we conducted detailed sequence study in 32 accessions of rice (10 japonica, 7 indica, 10 Asiatic.
CS177 Lecture 10 SNPs and Human Genetic Variation
SNP Haplotypes as Diagnostic Markers Shrish Tiwari CCMB, Hyderabad.
Gene Hunting: Linkage and Association
Genome-Wide Association Study (GWAS)
National Taiwan University Department of Computer Science and Information Engineering Pattern Identification in a Haplotype Block * Kun-Mao Chao Department.
Chapter 24: Molecular and Genomic Evolution CHAPTER 24 Molecular and Genomic Evolution.
Managing Next Generation Sequence Data with GMOD Dave Clements 1, Scott Cain 2, Paul Hohenlohe 3, Nicholas Stiffler 3, Paul Etter 3, Eric Johnson 3, William.
Methods in genome wide association studies. Norú Moreno
Julia N. Chapman, Alia Kamal, Archith Ramkumar, Owen L. Astrachan Duke University, Genome Revolution Focus, Department of Computer Science Sources
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
2007 Melvin Tooker Animal Improvement Programs Laboratory USDA Agricultural Research Service, Beltsville, MD, USA
Linear Reduction Method for Tag SNPs Selection Jingwu He Alex Zelikovsky.
Genetic differentiation of caribou herds and reindeer in Northern Alaska Karen H. Mager, Kevin E. Colson, and Kris J. Hundertmark Institute of Arctic Biology,
The International Consortium. The International HapMap Project.
Copyright © 2010 Pearson Education, Inc. publishing as Benjamin Cummings Lectures by Greg Podgorski, Utah State University Current Issues in Biology, Volume.
Linkage Disequilibrium and Recent Studies of Haplotypes and SNPs
Serial Founder Effects in Linguistics and Genetics Claire Bowern (with Keith Hunley and Meghan Healy) Yale and University of New Mexico Feb 9, 2012 Based.
Evolutionary Genome Biology Gabor T. Marth, D.Sc. Department of Biology, Boston College
Single Nucleotide Polymorphisms (SNPs) By Amira Jhelum Rahul Shweta.
Reconstructing and Using Phylogenies 16. Concept 16.1 All of Life Is Connected through Its Evolutionary History All of life is related through a common.
Evolution for Beginners. What is evolution? A basic definition of evolution… “…evolution can be precisely defined as any change in the frequency of alleles.
Population Structure High population divergence at the state level Populations from western Indiana genetically differed from the BONWR population Genetic.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Date of download: 7/2/2016 Copyright © 2016 American Medical Association. All rights reserved. From: How to Interpret a Genome-wide Association Study JAMA.
Pharmacogenetics: Implications of race and ethnicity on defining genetic profiles for personalized medicine  Victor E. Ortega, MD, Deborah A. Meyers,
1.2-Determining How Species are Related
By Michael Fraczek and Caden Boyer
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Volume 26, Issue 7, Pages (April 2016)
Sriram Sankararaman, Swapan Mallick, Nick Patterson, David Reich 
Haplotypes at ATM Identify Coding-Sequence Variation and Indicate a Region of Extensive Linkage Disequilibrium  Penelope E. Bonnen, Michael D. Story,
Selina Vattathil, Joshua M. Akey  Cell 
Unit Genomic sequencing
Selecting a Maximally Informative Set of Single-Nucleotide Polymorphisms for Association Analyses Using Linkage Disequilibrium  Christopher S. Carlson,
Sriram Sankararaman, Swapan Mallick, Nick Patterson, David Reich 
Volume 26, Issue 23, Pages (December 2016)
Giulio Genovese, Robert E. Handsaker, Heng Li, Eimear E
Evolution Biology Mrs. Johnson.
Presentation transcript:

Admixture in Horse Breeds Illustrated from Single Nucleotide Polymorphism Data César Torres, Yaniv Brandvain University of Minnesota, Department of Plant Biology, LSSURP Sciences Program INTRODUCTION More than 300 horse breeds exist in the modern world. Many of these horse breeds are thought to have closely related evolutionary paths, but with the horse genome being sequenced recently (2007), genome-wide single nucleotide polymorphism (SNP) studies confirming evolutionary relationships are lacking. This project determines whether there is admixture between horse breeds that are thought to be evolutionary related by using the program Treemix to create a phylogenic tree, which illustrates admixture in the multiple horse breeds. Allele frequencies across 34,648 SNP sites formatted from the genotype of 36 breeds (795 individuals in total) obtained with an Illumina Beadchip, were used as the input data for the program. We found evidence for admixture events between several breeds of horses, most of which were admixed with the Thoroughbred. Some horses were found to not have evidence of admixture. While for the Caspian, a breed rediscovered in 1965, some admixture evidence was found, implying they might not be 100% pure Caspian. This was confirmed with F3 statistics tests (three population test). These findings will facilitate future studies of the origin of the domestic horse and its evolutionary path by providing new insights on each breed and how they are related to each other. In the future we will use denser chips to obtain more SNP sites allowing us to find regions of introgression and determine whether or not they are coding sequences. METHODS Samples of 36 horse breeds (795 individuals) genotyped on the Illumina 50k SNP Beadchip (DNA Microarray) were obtained and formatted into allele frequencies. Out of the 50k SNP sites, only 34,648 were used due to triple allelic sites being removed. The program Treemix used the allele frequencies at the 34k SNP sites to infer the patterns of population splits and admixture. The program uses the similarities between allele frequencies to determine which horse breeds are related to each other and if there is admixture between any of them illustrating it with a red arrow. These trees were used to base our hypotheses. F3 statistic tests (also known as three population tests) were used to further confirm if there was admixture between the horse breeds. This test compares three populations or breeds in the form of (A; B, C), where a negative F3 statistic value implies admixture in individuals of the group A. RESULTS FIGURE 1: Treemix tree with multiple admixture arrows. Thoroughbred can be seen involved in multiple admixture patterns. FIGURE 2: F3 statistic results for horses thought to have admixture from the Thoroughbred. FIGURE 3: F3 statistic results for horses thought to have no admixture. CONCLUSIONS A diverse number of horse breeds including the Quarter Horse, Swiss Warmblood, Paint Horse and Hanoverian; seem to have admixture with the Thoroughbred from US and UK. This can be confirmed by their negative f3 statistic values ranging as low as for the Swiss Warmblood. A very interesting result is that the Thoroughbred US seemed to yield a more negative f3 statistic value than the Thoroughbred UK when compared with the Quarter Horse. This implies that the Thoroughbred US might have some admixture with the Quarter Horse. Also both the Thoroughbred US and Thoroughbred UK were shown to have admixture from each other. In addition, no evidence was found of admixture in the Mangalarga Paulista, Exmoor, North Swedish Horse, Puerto Rican Paso Fino, Arabian or Icelandic, implying that these breeds might be pure. As of the Caspian breed, few negative f3 statistic values were found, illustrating traces of admixture. This is evidence for thinking that the horses rediscovered might not be pure Caspian horses. FUTURE PLANS Since a higher number of SNP sites are needed to determine regions of introgression, denser chips will be used. Denser chips will also allow to determine whether or not these sites are are coding sequences. Linkage disequilibrium dating can be used to determine at what time the breeds’ history the admixture occurred. REFERENCES/ ACKNOWLEDGMENTS Pickrell and Pritchard (2012). Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genetics. In Press. Sponsored by NSF-REU in Molecular Genetics & Proteomics Funded by UPR-PRISE Program NIH Grant #R25GM A A A A B B No admixture 50% snps Breed 1 Breed 2 Breed 3 A A A A B B Admixture from Breed 3 to Breed 2 <50% snps >50% snps Breed 1 Breed 2 Breed 3 Admixture among horse breeds F3 Rank F3 value (<0 suggests admixture) o Quarter Horse ; Thoroughbred, _______ o Swiss Warmblood ; Thoroughbred, _______ o Paint Horse ; Thoroughbred, _______ o Hanoverian ; Thoroughbred, _______ o Other F3 Breeds: Putatively pure horse breeds Mangalarga Paulista Exmoor North Swedish Horse Puerto Rican Paso Fino Arabian IcelandicCaspian