LSJ3 TH1 CBS130569 BC7 BC4 AB1 AB2 LSJ1 TH2 SO1 SO6 LEC1 LEC2 ONT6

Slides:



Advertisements
Similar presentations
Genetic Statistics Lectures (5) Multiple testing correction and population structure correction.
Advertisements

Ortholog vs. paralog? 1. Collect Sequence Data Good Dataset
The Midwest Region Kansas, Missouri, North Dakota, South Dakota, Nebraska, Illinois, Wisconsin, Ohio, Indiana, Iowa, Minnesota, Michigan.
 Aim in building a phylogenetic tree is to use a knowledge of the characters of organisms to build a tree that reflects the relationships between them.
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
Molecular Evolution Revised 29/12/06
Current Approaches to Whole Genome Phylogenetic Analysis Hongli Li.
. Phylogeny II : Parsimony, ML, SEMPHY. Phylogenetic Tree u Topology: bifurcating Leaves - 1…N Internal nodes N+1…2N-2 leaf branch internal node.
Dispersal models Continuous populations Isolation-by-distance Discrete populations Stepping-stone Island model.
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=
I.1 ii.2 iii.3 iv.4 1+1=. i.1 ii.2 iii.3 iv.4 1+1=
Lecture 13 – Performance of Methods Folks often use the term “reliability” without a very clear definition of what it is. Methods of assessing performance.
Phylogenetic analyses Kirsi Kostamo. The aim: To construct a visual representation (a tree) to describe the assumed evolution occurring between and among.
Molecular phylogenetics
Site and Situation Factors
Phylogenetic trees School B&I TCD Bioinformatics May 2010.
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Gene tree discordance and multi-species coalescent models Noah Rosenberg December 21, 2007 James Degnan Randa Tao David Bryant Mike DeGiorgio.
Testing alternative hypotheses. Outline Topology tests: –Templeton test Parametric bootstrapping (briefly) Comparing data sets.
Supplementary Fig. S1. 16S RNA Neighbor-joining (NJ) tree of Brevibacterium metallicus sp. nov. NM2E3 T (in bold) and related species of genus Brevibacterium.
1 CAP5510 – Bioinformatics Phylogeny Tamer Kahveci CISE Department University of Florida.
Bayesian Optimization Algorithm, Decision Graphs, and Occam’s Razor Martin Pelikan, David E. Goldberg, and Kumara Sastry IlliGAL Report No May.
Choosing the most parsimonious tree
The Big Issues in Phylogenetic Reconstruction Randy Linder Integrative Biology, University of Texas
Building Phylogenies Maximum Likelihood. Methods Distance-based Parsimony Maximum likelihood.
Basics in R part 2. Variable types in R Common variable types: Numeric - numeric value: 3, 5.9, Logical - logical value: TRUE or FALSE (1 or 0)
Lecture 19 – Species Tree Estimation
From: On the Origin of Darwin's Finches
8/10/16 Lesson 1-1: States and Regions
2012 IFTA / IRP MANAGERS’AND LAW ENFORCEMENT WORKSHOP
Introduction to Bioinformatics Resources for DNA Barcoding
Phylogenetic basis of systematics
Figure 1. Map showing the sampling sites of Hexaplex trunculus
Fig. 1. —Midpoint-rooted Bayesian phylogram of the 128 unique control region mitochondrial DNA (mtDNA) haplotypes of Glaucomys sabrinus analyzed in this.
F igure 1. Current distribution of extant plethodontid salamanders
Fig. 1 Combined phylogenetic and RelTime timetree based on the topology obtained from the “best” tree (Bayesian, GTR Gamma, 54 taxa mt + nuDNA alignment,
Gene-sequence analysis reveals at least three species hidden in Zausodes arenicolus Erin Easton November 13, 2008.
Using Bioinformatics to Study Evolutionary Relationships
26.3 Shared Characters Are Used To Construct Phylogenetic Trees
FP1 Matrices Introduction
2,820 children living in poverty 17% of all residents live in poverty
2,910 children living in poverty 13% of all residents live in poverty
children living in poverty 17% of all residents live in poverty
Volume 25, Issue 24, Pages (December 2015)
Volume 24, Issue 7, Pages (March 2014)
Introductions by Region…
Brian K. Maples, Simon Gravel, Eimear E. Kenny, Carlos D. Bustamante 
A. Papa, K. Dumaidi, F. Franzidou, A. Antoniadis 
The number of immigrants admitted to Canada each year evolves with projected population growth. Under the low assumption for patterns of immigration set.
Chapter 19 Molecular Phylogenetics
Volume 25, Issue 22, Pages (November 2015)
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John.
A Flexible Bayesian Framework for Modeling Haplotype Association with Disease, Allowing for Dominance Effects of the Underlying Causative Variants  Andrew.
Volume 17, Issue 5, Pages (March 2007)
Phylogenetic tree of perA A2-domain DNA sequence.
Development of a real-time PCR assay for the specific detection and identification of Streptococcus pseudopneumoniae using the recA gene  V. Sistek, M.
Volume 8, Issue 3, Pages (March 2015)
Volume 16, Issue 15, Pages R577-R578 (August 2006)
Evolutionary relationships between sporadic and epidemic strains of healthcare- associated methicillin-resistant Staphylococcus aureus  M. Hallin, O. Denis,
Program Compliance Review Update
Neighbor-joining tree based on nifH for 20 organisms along with the maximum-likelihood sequence obtained from aligning the soil data to the gene sequence.
Novel West Nile virus lineage 1a full genome sequences from human cases of infection in north-eastern Italy, 2011  L. Barzon  Clinical Microbiology and.
Giulio Genovese, Robert E. Handsaker, Heng Li, Eimear E
Lactase Haplotype Diversity in the Old World
Analysis of C. canadensis mitochondria.
Messages through Bottlenecks: On the Combined Use of Slow and Fast Evolving Polymorphic Markers on the Human Y Chromosome  Peter de Knijff  The American.
5,070 children living in poverty 15% of all residents live in poverty
Haplotypes in the Dystrophin DNA Segment Point to a Mosaic Origin of Modern Human Diversity  Ewa Ziętkiewicz, Vania Yotova, Dominik Gehl, Tina Wambach,
Bruce Rannala, Jeff P. Reeve  The American Journal of Human Genetics 
Presentation transcript:

LSJ3 TH1 CBS130569 BC7 BC4 AB1 AB2 LSJ1 TH2 SO1 SO6 LEC1 LEC2 ONT6 ONT3 KEN1 KEN3 PEN1 PEN2 MIN2 MIN1 MIS2 MIS1 ND2 WIS2 ALB2 ALB1 SAS5 SAS3 BC12 CBS130562 ND1 WIS1 QC9 AGASSIZ CBS100042 QC8 QC11 ARCADIA DAVID5 pn02.02 pn01.03 20.0 98/1 100/1 From USA From Canada Sphaerulina populicola Sphaerulina musiva Sphaerulina populi Fig. S1 ITSrDNA maximum parsimony phylogram of Sphaerulina species. Statistical support for the topology is presented when above 70% for bootstrap and posterior probabilities above 95%. Bootstrap values are in front of the forward slash whilst posterior probabilities are after the forward slash. Isolate names in bold were sequenced as part of this study. All other isolate sequences were obtained from GenBank. The aligned ITS dataset contained 42 taxa, 395 characters of which 58 were informative. Parsimony analysis produced 10 trees of 63 steps (CI=0.952, R1=0.975, g1=-3.537537, minimum length= 60, maximum length=179) (tree 1 of 10 is presented here).

CBS 130562 SO4 LSJ2 BC7 LSJ3 BC9 BC8 BC6 BC10 BC5 BC1 BC2 BC3 BC14 BC4 BC15 BC11 BC12 BC13 CBS 130569 LSJ1 QC8 QC11 QC9 CBS 391.59 CBS 100042 AGASSIZ ARCADIA DAVID5 a. Sphaerulina musiva 100 72 Sphaerulina populicola 92 100 Fig. S2 Concatenated five-gene region maximum parsimony (a) and Bayesian (b) phylogram of Sphaerulina species. Statistical support for the topology is presented when bootstrap values are above 70% and posterior probabilities are above 95%. Isolate names in bold were sequenced as part of this study. All other isolate sequences were obtained from GenBank. For the maximum parsimony tree, the combined dataset of five gene regions contained 1764 characters of which 82 were informative and produced 400 equal MP trees of 141 steps (CI=0.74, RI=0.91 g1=-1.0635, minimum length=105, maximum length=522) (tree 1 of 400 is presented). Sphaerulina populi b. 0.02 BC5 BC1 BC2 BC3 BC14 CBS 130562 CBS 130569 LSJ1 BC11 BC4 BC15 BC12 BC13 SO4 LSJ2 BC7 LSJ3 BC9 BC6 BC8 BC10 AGASSIZ CBS100042 QC9 ARCADIA DAVID5 QC8 QC11 CBS391.59 Sphaerulina musiva Sphaerulina populicola Sphaerulina populi 1 0.99 0.98 0.97 From eastern Canada From western Canada

Fig. S3 Statistical parsimony network of the 73 Sphaerulina musiva isolates using the sequence dataset (concatenated sequence of eight genes used in the population genetic analysis). Each hatch mark indicates a single mutation.

Fig. S4 STRUCTURE analysis (with no priors) of 301 unlinked SNPs found amongst clone-corrected populations of 65 Sphaerulina musiva isolates. (a) Log likelihood values of DK values against the ranging K values for the STRUCTURE analysis. (b) Standard deviation of the mean likelihoods against the ranging K values for the STRUCTURE analysis. (c) STRUCTURE barplots from K2- K6; supported K are highlighted in orange. Abbreviations: LSJ= Leclercville, LEC= Lac Saint-Jean, SO= Saint-Ours, TH= Thetford-Mines, AB= Abitibi-Témiscamingue, KEN= Kentucky, PEN= Pennsylvania and MISS= Missouri, ND= North Dakota, WIS= Wisconsin and MIN= Minnesota.

Fig. S5 Scenarios tested. 1-4) Round One- Determining the source population: 1) independent introduction scenario, 2) three-way split at t7, 3) three-way split at T6, 4) serial introduction scenario; 5-14) Round Two- Formation of MWUS: 5-8) Single-source population contributing to the formation of the MWUS group- Class I (without a bottleneck) and Class II (with a bottleneck), 9-14) Two-source populations contributing to the formation of the MWUS group- Class III (without a bottleneck) and Class IV (with a bottleneck); 15-29) Round Three- Formation of the BC population: 15-20) Single source population contributing to the formation of the BC population- Class I (without a bottleneck) and Class II (with a bottleneck), 21-29) Two source populations contributing to the formation of the BC population- Class III (without a bottleneck) and Class IV (with a bottleneck); 30-34) Round Four- Formation of the Alberta population: Class I (without a bottleneck) and Class II (with a bottleneck). *Tested with and without a bottleneck after the formation of each population. # Indicates the admixed population is formed with an admixture rate relative to the (ra) population (and 1-rate to the other contributing population) this was tested in both directions. Abbreviations: ?= unsampled ghost population; EC= eastern Canada [QUE= Québec (Lac Saint-Jean, Leclercville, Saint-Ours and Thetford-Mines) and ONT= Ontario]; EUS= eastern US (KEN= Kentucky, PEN= Pennsylvania and MISS= Missouri); MWUS=mid-west US (ND= North Dakota, WIS= Wisconsin and MIN= Minnesota); and WC= western Canada (SAS= Saskatchewan, ALB= Alberta and BC= British Columbia).

Round 1: ancestral populations 1. 2. 3. 4. t9 19000-100000 t8 10000-26500 t7 6-10000 t6 6-10000 PI PJ PK PI PJ PK PI PJ PK PI PJ PK pd Class I* Class II* Class III* Class IV* Round 2: formation of MWUS 7. SAS WUS EUS EC 8. 6. 5. t9 19000-100000 WUS EUS SAS EC SAS WUS EUS EC WUS EUS SAS EC t8 10000-26500 t7 6-10000 t6 6-10000 t5 6-10000 t4 6-10000 pd Class I and Class II*: single source t9 19000-100000 t8 10000-26500 t7 6-10000 t6 6-10000 pd t4 6-10000 t5 6-10000 10. 11. 9. WUS EUS SAS EC 1- ra ra WUS EUS SAS 1- ra ra EC WUS EUS SAS 1- ra ra EC t9 19000-100000 t8 10000-26500 t7 6-10000 t6 6-10000 pd t4 6-10000 t5 6-10000 12. WUS EUS SAS EC 1- ra ra 13. EC WUS EUS SAS 1- ra ra 14. EC WUS EUS SAS 1- ra ra Class III and Class IV*#: multiple sources Fig. S5 Scenarios tested. 1-4) Round One- Determining the source population: 1) independent introduction scenario, 2) three-way split at t7, 3) three-way split at T6, 4) serial introduction scenario; 5-14) Round Two- Formation of MWUS: 5-8) Single-source population contributing to the formation of the MWUS group- Class I (without a bottleneck) and Class II (with a bottleneck), 9-14) Two-source populations contributing to the formation of the MWUS group- Class III (without a bottleneck) and Class IV (with a bottleneck). Continued next page. *Tested with and without a bottleneck after the formation of each population. # Indicates the admixed population is formed with an admixture rate relative to the (ra) population (and 1-rate to the other contributing population) this was tested in both directions. Abbreviations: ?= unsampled ghost population; EC= eastern Canada [QUE= Québec (Lac Saint-Jean, Leclercville, Saint-Ours and Thetford-Mines) and ONT= Ontario]; EUS= eastern US (KEN= Kentucky, PEN= Pennsylvania and MISS= Missouri); MWUS=mid-west US (ND= North Dakota, WIS= Wisconsin and MIN= Minnesota); and WC= western Canada (SAS= Saskatchewan, ALB= Alberta and BC= British Columbia)

Round 3: formation of BC 15. EC EUS SAS BC 16. 17. t9 19000-100000 t8 10000-26500 t7 6-10000 t6 6-10000 pd t4 6-10000 t5 6-10000 t3 6-10000 t2 6-100 EC EUS SAS BC ra ra-1 ra ra-1 ra ra-1 WUS BC EC EUS WUS SAS WUS 20. t9 19000-100000 t8 10000-26500 t7 6-10000 t6 6-10000 pd t4 6-10000 t5 6-10000 t3 6-10000 t2 6-100 18. EC EUS SAS BC 19. EC SAS EUS ra ra-1 WUS BC ra ra-1 WUS ra ra-1 WUS EC EUS BC SAS Class I and Class II*: single source 22. 23. 21. t9 19000-100000 t8 10000-26500 t7 6-10000 t6 6-10000 pd t4 6-10000 t5 6-10000 t3 6-10000 t2 6-100 EC SAS EUS ra ra-1 WUS BC EC SAS EUS ra ra-1 WUS BC EC SAS EUS ra ra-1 WUS BC t9 19000-100000 t8 10000-26500 t7 6-10000 t6 6-10000 pd t4 6-10000 t5 6-10000 t3 6-10000 t2 6-100 24. EC SAS EUS ra ra-1 WUS BC 25. 26. EUS SAS BC ra ra-1 EC SAS EUS ra ra-1 WUS BC ra ra-1 EC WUS Class III and Class IV*#: multiple sources Fig. S5 cont 15-29) Round Three- Formation of the BC population: 15-20) Single source population contributing to the formation of the BC population- Class I (without a bottleneck) and Class II (with a bottleneck), 21-29) Two source populations contributing to the formation of the BC population- Class III (without a bottleneck) and Class IV (with a bottleneck). Continued next page. *Tested with and without a bottleneck after the formation of each population. # Indicates the admixed population is formed with an admixture rate relative to the (ra) population (and 1-rate to the other contributing population) this was tested in both directions. Abbreviations: ?= unsampled ghost population; EC= eastern Canada [QUE= Québec (Lac Saint-Jean, Leclercville, Saint-Ours and Thetford-Mines) and ONT= Ontario]; EUS= eastern US (KEN= Kentucky, PEN= Pennsylvania and MISS= Missouri); MWUS=mid-west US (ND= North Dakota, WIS= Wisconsin and MIN= Minnesota); and WC= western Canada (SAS= Saskatchewan, ALB= Alberta and BC= British Columbia)

Round 3: formation of BC cont. 27. 28. 29. t9 19000-100000 t8 10000-26500 t7 6-10000 t6 6-10000 pd t4 6-10000 t5 6-10000 t3 6-10000 t2 6-100 EC EUS SAS BC ra ra-1 WUS EUS SAS ra ra-1 BC EC EUS SAS BC ra ra-1 WUS ra ra-1 WUS ra ra-1 EC Class III and Class IV*#: multiple sources cont Round 4: formation of Alberta t9 19000-100000 t8 10000-26500 t7 6-10000 t6 6-10000 pd t4 6-10000 t5 6-10000 t3 6-10000 t2 6-100 EC EUS SAS BC ra ra-1 ALB WUS 30. EC EUS SAS BC ra ra-1 ALB WUS 31. EC EUS SAS BC ra ra-1 ALB WUS 32. t1 6-100 33. 34. EC EUS SAS BC ra ra-1 ALB WUS t9 19000-100000 t8 10000-26500 t7 6-10000 t6 6-10000 pd t4 6-10000 t5 6-10000 t3 6-10000 t2 6-100 EC EUS SAS BC ra ra-1 ALB WUS t1 6-100 Class I and Class II*: single source Fig. S5 cont 21-29) Two source populations contributing to the formation of the BC population- Class III (without a bottleneck) and Class IV (with a bottleneck); 30-34) Round Four- Formation of the Alberta population: Class I (without a bottleneck) and Class II (with a bottleneck). *Tested with and without a bottleneck after the formation of each population. # Indicates the admixed population is formed with an admixture rate relative to the (ra) population (and 1-rate to the other contributing population) this was tested in both directions. Abbreviations: ?= unsampled ghost population; EC= eastern Canada [QUE= Québec (Lac Saint-Jean, Leclercville, Saint-Ours and Thetford-Mines) and ONT= Ontario]; EUS= eastern US (KEN= Kentucky, PEN= Pennsylvania and MISS= Missouri); MWUS=mid-west US (ND= North Dakota, WIS= Wisconsin and MIN= Minnesota); and WC= western Canada (SAS= Saskatchewan, ALB= Alberta and BC= British Columbia)