DNA barcoding and evolutionary relationships in Accipiter Brisson,1760 (Aves, Falconiformes: Accipitridae) with a focus on African and Eurasian representatives.

Slides:



Advertisements
Similar presentations
Classification of Organisms
Advertisements

Introduction Classification Phylogeny Cladograms Quiz
DNA barcoding and evolutionary relationships in Accipiter Brisson,1760 (Aves, Falconiformes: Accipitridae) with a focus on African and Eurasian representatives.
Taxonomy & Phylogeny Classification of Organisms.
Lichens and Ascomycota broadly Alternative markers to COI ITS.
 Aim in building a phylogenetic tree is to use a knowledge of the characters of organisms to build a tree that reflects the relationships between them.
THE EVOLUTIONARY HISTORY OF BIODIVERSITY
Fig Copyright © 2008 Pearson Education, Inc., publishing as Pearson Benjamin Cummings Overview: Investigating the Tree of Life Phylogeny is the.
Nomenclature is the science of naming organisms Evolution has created an enormous diversity, so how do we deal with it? Names allow us to talk about groups.
Plant Molecular Systematics (Phylogenetics). Systematics classifies species based on similarity of traits and possible mechanisms of evolution, a change.
Reconstructing and Using Phylogenies
Phylogeny and Systematics
1 Apply Concepts To an evolutionary taxonomist, what determines whether two species are in the same genius 2 Explain What is a derived character 3 Review.
Molecular Evolution Revised 29/12/06
Ten species in one: DNA barcoding reveals cryptic species in the neotropical skipper butterfly Astraptes Fulgerator Paul Hebert, Erin Penton, John Burns,
Bell Work Dogs of a certain breed can have black fur or white fur. Black fur is dominant, but the breeder only wants puppies with white fur. Cross two.
Chapter 2 Opener How do we classify organisms?. Figure 2.1 Tracing the path of evolution to Homo sapiens from the universal ancestor of all life.
Topic : Phylogenetic Reconstruction I. Systematics = Science of biological diversity. Systematics uses taxonomy to reflect phylogeny (evolutionary history).
Phylogeny & The Tree of Life. Phylogeny  The evolutionary history of a species or group of species.
Phylogeny and the Tree of Life
Systematics the study of the diversity of organisms and their evolutionary relationships Taxonomy – the science of naming, describing, and classifying.
Classification and Systematics Tracing phylogeny is one of the main goals of systematics, the study of biological diversity in an evolutionary context.
Chapter 26: Phylogeny and the Tree of Life Objectives 1.Identify how phylogenies show evolutionary relationships. 2.Phylogenies are inferred based homologies.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
DNA barcoding: bane or boon (or both) for taxonomy? Donal A. Hickey, Concordia University, Montreal. Collaborators: Mehrdad Hajibabaei and Gregory Singer.
Warm-Up 1.Contrast adaptive radiation vs. convergent evolution? Give an example of each. 2.What is the correct sequence from the most comprehensive to.
PHYLOGENY AND THE TREE OF LIFE Chapter 26 Sections 1-3 and 6.
Diversity of Living Things 1.1: Biodiversity. Biodiversity Number and variety of species and ecosystems on Earth By the end of 2010, 1.7 million species.
Classification Section 18.2 & Phylogeny: Evolutionary relationships among organisms Biologists group organisms into categories that represent lines.
Classification Chapter 18.
Phylogeny & the Tree of Life
PHYLOGENY AND THE TREE OF LIFE CH 26. I. Phylogenies show evolutionary relationships A. Binomial nomenclature: – Genus + species name Homo sapiens.
Classification. Cell Types Cells come in all types of shapes and sizes. Cell Membrane – cells are surrounded by a thin flexible layer Also known as a.
Classification.
Identifying, Naming, and Classifying Species
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Chapter 25: Phylogeny and Systematics. “Taxonomy is the division of organisms into categories based on… similarities and differences.” p. 495, Campbell.
Phylogenetic analysis of flatfish species (Teleostei, Pleuronectiformes) based on cytochrome oxidase 1 (Co-1) and cytochrome b (Cyt-b) genes Sharina S.N.,
Chapter 26 Phylogeny and Systematics. Tree of Life Phylogeny – evolutionary history of a species or group - draw information from fossil record - organisms.
PHYOGENY & THE Tree of life Represent traits that are either derived or lost due to evolution.
Evolution common ancestor. Alvin Faulhafer Oleva Rose RobertMarjorie Ernest PatriciaWayne Linda Miller BradleyTimothy Diana Hyde.
Chapter 26 Phylogeny and the Tree of Life
Reconstructing and Using Phylogenies 16. Concept 16.1 All of Life Is Connected through Its Evolutionary History All of life is related through a common.
Phylogeny & Systematics The study of the diversity and relationships among organisms.
Section 2: Modern Systematics
Phylogeny and the Tree of Life
Introduction to Bioinformatics Resources for DNA Barcoding
Phylogeny & the Tree of Life
Classification of Organisms
Section 2: Modern Systematics
5.4 Cladistics.
Biological Classification: The science of taxonomy
Phylogeny and the Tree of Life
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Hierarchical Classification vs. Systematics
Systematics: Tree of Life
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Test 1 Outline.
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Systematics: Tree of Life
Phylogeny and the Tree of Life
18.2 Modern Systematics I. Traditional Systematics
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
18.2 Modern Systematics I. Traditional ______________
Comparing read recruitment, de novo, and insertion tree strategies for phylogenetic diversity computation. Comparing read recruitment, de novo, and insertion.
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Molecular data assisted morphological analyses
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Chapter 26 Phylogeny and the Tree of Life
Chapter 20 Phylogeny and the Tree of Life
Presentation transcript:

DNA barcoding and evolutionary relationships in Accipiter Brisson,1760 (Aves, Falconiformes: Accipitridae) with a focus on African and Eurasian representatives Title page with from left to right Accipiter fasciatus, trivirgatus, madagascariensis, minullus and cooperii.

Nomenclature A. gentilis: Nl; havik, Eng; gosshawk, Fr; Autour des palombes A. nisus: Nl; sperwer, Eng; sparrowhawk, Fr; epervier d’Europe Buteo buteo: Nl; buizerd, Eng; Common buzzard Fr; Buse variable Buteo albicaudatus; Nl; witstaart buizerd, Eng; white tailed hawk, Fr; Buse à queue blanche Justification for using the latin names throughout the presentation rather than vernacular names. Latin is the language used in science at least for names of species and names in different languages are confusing.

Introductory “sheet”. (Accipiter are bird hunters)

Content DNA barcoding a short introduction Coverage Methods General DNA barcoding Mini barcodes 10X threshold, best compromise threshold BCM and BM criteria Character based identification Coverage Sample origins The dataset Methods Character based method (CB) The 10 X intraspecific distance threshold (10 X) The Best compromise threshold (BCTh) Best match and best close match explained Results for DNA barcoding Character based identification Different thresholds Phylogeny reconstruction 291 bp dataset 647 bp dartaset Discussion Conclusion

DNA barcoding Identification of species with an agreed upon (part of a) gene (COI) 5’-3’ part of Cytochrome C Oxidase subunit I ~650bp in length Polymerase chain reactions and Sanger sequencing What marker do we use, shortly mention PCR and that it is a common technique. The picture serves as the basis for the lower picture where already a first start of DNA barcoding may be seen (as colours the colourfull lines each represent a specimen)

Mini barcodes Genetic material from museums is often degraded Shorter fragments are needed E.g. sequences of 250-300bp or, in some cases even shorter, also work for identification purposes Explain that through time and certain environmental influences (temp, water, radioation) DNA degrades. This illustrates the reason for selecting short fragments, because longer fragments are perfectly possible with fresher material. Explain that the shorter sequences mentioned are still part of the DNA barcode region. This is why they are usefull as well for identifying species. Sonet et al. in press

Sample coverage and origins African and European taxa are well represented The new world partially represented Australia and S-E Asia poorly represented Most samples from RMCA/RBINS (n=83/9) Supplemented with available genbank sequences (n=28) Most, currently recognized, species of African and European Accipiter are represented in this study by one or more samples. The new world samples are almost all from genbank (except A. gundlachii). Unfortunatly Australia ans SE Asia are underrepresented. Samples are from all sources, however most of them are from RMCA/RBINS samples. Explain that about half of the samples included is more than 20 years old, which means that they are certainly degraded. Even the moderns samples are degraded when they are not kept properly.

The resulting dataset 25 European and African Accipiter species with a total of 140 specimens sequenced for a 291bp (mini) and 647bp (standard) BC fragment of the COI gene three datasets analysed: dataset A: 25 species, 291bp dataset B: 19 species, 647bp dataset C: 19 species, 291bp This sheet continues from the previous one. It details the exact amount of samples used for this study and the division in datasets. Dataset C is used the investigate the effect of species coverage and sequence length in more detail. This is not an important part of this presentation.

Character based indentification Easy it is about detecting unique differences May be unreliable with insufficient sampling No threshold whatsoever is used Here the first analysis method is introduced. Character based identification is intuitive and straight forward. One just compares sequences and notes the differences. Note that you may compare as many sequences together as you like from as many species, as long as a given position uniquely separates it from the others. Note that the position or substitution type does not matter at all.

Average 10 X intraspecific distances and the barcoding gap Arbitrary Does not always hold (Tephritidae) However, works well in birds Speak shortly about distances (added up differences, see previous sheet) and that thresholds are determined which makes analysis easier and less ad hoc than the previous method. With these methods it already matters what the changes in base comopistion are (mention the K2P parameter here). Mention that it seemed like a good idea to use the average 10 X intra-specific distance value as a good idea for species boundaries. However it is not based on anything other than a majority of tested species met this criteria. Note also that such a value has nothing to do with the biology of a group under study. Then tell that it does not always hold, in particular when intraspecific distances are very small or very big. Note that when analysing DNA barcodes you would like to have a clear difference in the distribution of intra and interspecific distances, but that the figure below approaches the real situations better and that this is an illustration of the difference between the theoretical and the real datasets (which are rarely, if ever, ideal).

The Best compromise threshold (BCTh) Based on the intercept between cumulative intra and interspecific distances Based on the dataset under study rather than a predetermined threshold Results from one dataset not applicable to another Here introduce a different methods. Explain that weak overlap is preferable because you want to identify a high percentage of samples. Refer to the fact that this is not always the case, however this method bases its threshold on the dataset under study and is set up to achieve the highest possible percentage. This allows to also detect new species, id errors, cryptic species are species cluster that are not well defined. Note that this approach only goes for the group/dataset under study. Inclusion of even a few species (especially in a small datset) can lead to a shifting of the threshold, however it reflects the new best optimum rather than simply a predetermined value. Explain what is where. Under circle one are the taxa that are identified on (for instance) morphological basis as species however their sequences overlap with other (ambiguous). This also includes the wrongly identified species. Circle two is where the potentially new taxa are, here taxa with high intraspecific distances are. Give an example if necesary. Example two sequences identified as one species differ more from each other than the threshold. What are they? Lefébure et al. 2006

BM and Best Close match Best Match (BM), assignes the species name of its best-matching sequence regardless of how similar the query and reference sequences were. Identification is considered correct when both sequences were from the same species, incorrect if the query species differed from the closest reference species or ambiguous if multiple species were the BM of the query species Best close match (BCM) also considers the threshold This sheet introduces the methods to analyse the dataset with differnt methods. The difference between the methods is in discriminating impossible identifications (pointing to single sequences and new potentially new species). Een nieuwe soort, ‘foutief’ geïdentificeerd als soort A, heeft dan een sequentie met een groter verschil dan de gekozen drempelwaarde.

BM vs. BCM example 4 sequences 3 species: A. francesiae (n=2) A. badius (n=1) & A. toussenelii (n=1) BM would assign A. badius and A. toussenelii to the “ambiguous” or “wrong” (dependent on the threshold) categories BCM would classify them as impossible when the difference is larger than the threshold This example dataset is an example and can be used to illustrate the differences between the approaches. Consider the different results for the different approaches with the thresholds of 3% and 10% respectively.

Results character based identification This shows a few results of groups that are closely related or poorly supported by the phylogeny (illustrated later in the presentation). This is more nuanced that blindly looking at the entire sequence. Rather it focuses on the relevant part.

Results character based identification Note that although A. unduliventer has only one unique character, that if you choose only part of the sequences it is still very separable. (Of course a more samples are included it may occur that one A. unduliventer has a mutation at site 96 as well and then the poistion is not unique anymore. Finally identifying either of the three species is possible even if you look at a small part of the gene, lets say the last three bases. The 2nd example show an entire sequence of unique differences between A. minullus and A. erythropus and is straightforward.

Results BCTh vs. 10 X distance threshold An overview of the results. Yellow corresponds the the BCTh and grey to the 10 X thresholds in both figures. Also not the differecences between the BM and the BCM approaches. Here BCM identifies sequences that would otherwise be wrong or ambiguous

Results BCTh vs. 10 X distance threshold Different thresholds Partial overlap Caused by A. nisus/rufiventris A. virgatus/gularis A. cooperii/gundlachi At or just below the BC treshold are A. badius subsp A. gentilis subsp A closer look at the 291bp dataset and the DNA barcoding results. This shows species with very similar sequences (i.e. they are wrongly identified or ambiguous.). Then The A. badius shows the differences between the threshold and shows that they may classified with A. brevipes under the 10 X criterium and as such would be ambiguous or that they would be classified as correct with the BCTh. Even though A. badius sequences may end up in the “incorrect” categorie there because some sequences are now more different from each other than the BCTh threshold.

Results BCTh vs. 10 X distance threshold: Accipiter nisus example A. rufiventris A. nisus These species have similar sequences and are located below the threshold for species identification

Results BCTh vs. 10 X distance threshold: A. Badius and A Results BCTh vs. 10 X distance threshold: A. Badius and A. brevipes example These species are located at or slightly above the BCTh and below the 10 X distances threshold.

Phylogeny reconstruction Hypothesis of evolution Clades are units of supported taxa (related by direct descent) This dataset is only suited for low taxonomic level inferences One genegene tree rather than species tree Comparison with the current taxonomy based on morphology and morphometrics This explain the possibilities and limitations of the dataset under study when applying phylogenetic analyses to it. The first two points define to terms the 2nd two explain the limitations and the thirds servers to explain the use.

Phylogeny based on 291bp 11 supported clades [tachiro] A. minullus A. tachiro subclades A. minullus A. francesiae/ soloensis [cooperii] A. badius/ brevipes A. nisus/ striatus/ erythronemius A. ovampensis/ madagascariensis [virgatus] including A. fasciatus An overview of the phylogeny based on 291bp of COI. Dashed boxes indicate clades supported by three (any three) of the analysis methods used. They are still counted as supported, thus the number of 11 is reached

Phylogeny based on 291bp a closer look This details two clades of the phylogeny.

Phylogeny based on 291bp a closer look: Accipiter tachiro example The clade with A. tachiro in it is worth noting because it clearly includes A. castanilius as sister to all the other taxa and it shows the distinctive position of A. unduliventerwithin the wider A. tachiro clade.

Phylogeny based on 291bp a closer look: Accipiter badius and A Phylogeny based on 291bp a closer look: Accipiter badius and A. brevipes example The clade with A. badius and A. brevipes is remarkable because A. badius is not supported as monophyletic, rather the relationship with A. brevipes remains unresolved

Phylogeny based on 647bp 19 supported clades [tachiro] A. tachiro A. toussenelii & subsp A. soloensis & A. francesiae subsp [gentilis] [cooperii] [nisus] (without A. ovampensis/ madagascariensis [minullus] (without A. erythropus) [virgatus] including A. fasciatus An overview of the phylogeny based on 647bp of COI. Dashed boxes indicate clades supported by three (any three) of the analysis methods used. They are still counted as supported, thus the number of 19 is reached, this more than the 11 with the 291bp dataset. It shows that longer sequences carry more signal.

Phylogeny based on 647bp a closer look Detailed observations on the phylogeny.

Phylogeny based on 647bp a closer look: The example from Accipiter soloensis and A. francesiae The clade of A. francesiae with A. soloensis is remarkable because A. soloensis is from Asia and A. francesiae is from the Comoros and Madagascar. They are sisterspecies. This implies some colonisatioin event in the past. Presumable from Asia, although without prior knowledge that cannot be concluded from this dataset. The last common ancestor must have occured over a wider range or may have had different migration routes. Also worth noting is the support of some of A. francesiae’s subspecies.

Phylogeny based on 647bp a closer look: an example with Accipiter gentilis and A. melanoleucus The clade with A. gentilis and A. melanoleucus is noteworthy because it remains unresolved. A. gentilis atricapillis is not closer related to A. gentilis gentilis than it is to A. melanoleucus. This implies some morphological evolution in A. melanoleucus. This clade clearly deserves more study. Based on these results the species or subspecies status of either of the taxa is debatable. (A. melanoleucus is a species and A. g. gentilis and A.g. atricapillus are ranked as susbspecies)

Discussion Thresholds Species concepts Support values Sampling The 10 X threshold would have considered A. brevipes and A. badius, as well as A. gentilis and A. melanoleucus as belonging to the same species and Species concepts Morphological Distances based Phylogenetic Support values Models and interpretation remain issues of discussion Sampling Incomplete or lacking in some cases Take home message: The different threshold may have (subtle) influence on the outcome of the DNA barcoding analysis The species concept used and tested against must always be kept in memory and must be well defined. The three mentioned here are all compared in the study. Morphological was being tested. The distances based appraoch was treated via DNA barcodig and the phylogenetic concept with the phylogeny reconstruction The values remain debated, but the ones observed here are generally very good and as all four usually confirm each other there is some confidence in the observed patterns Sampling must be increased as it can be seen that sister species have spread all over the world (also not this in the A. nisus/striatus/erythronemius clade)

Conclusions Molecular techniques provide a powerfull method for reviewing taxonomy Museum material works! The content of the dataset matters COI can identify 19 out of the 25 species included COI carries a phylogenetic signal suitable for molecular taxonomy on a low taxonomic level Parts of genus Accipiter may need revision

Thank you for your attention If this ever happens to you, now you know what birds to look for for help Thank you for your attention