Language evolution Brian O’Meara EEB464 Fall 2016.

Slides:



Advertisements
Similar presentations
Outgroups Outgroups are the most common method for rooting trees Outgroup criteria 1. “Outside” the group of study 2.Closely related enough to be informative.
Advertisements

Lecture 3 Molecular Evolution and Phylogeny. Facts on the molecular basis of life Every life forms is genome based Genomes evolves There are large numbers.
Molecular Evolution Revised 29/12/06
Current Approaches to Whole Genome Phylogenetic Analysis Hongli Li.
UPGMA and FM are distance based methods. UPGMA enforces the Molecular Clock Assumption. FM (Fitch-Margoliash) relieves that restriction, but still enforces.
Bioinformatics and Phylogenetic Analysis
BACKGROUND E. coli is a free living, gram negative bacterium which colonizes the lower gut of animals. Since it is a model organism, a lot of experimental.
Lecture 13 – Performance of Methods Folks often use the term “reliability” without a very clear definition of what it is. Methods of assessing performance.
Chapter 2 Opener How do we classify organisms?. Figure 2.1 Tracing the path of evolution to Homo sapiens from the universal ancestor of all life.
Phylogenetic trees Sushmita Roy BMI/CS 576
Where are other language families distributed?
Comparative methods: Using trees to study evolution.
A simulation study comparing phylogeny reconstruction methods for linguistics Collaborators: Francois Barbancon, Don Ringe, Luay Nakhleh, Steve Evans Tandy.
Phylogenetics Alexei Drummond. CS Friday quiz: How many rooted binary trees having 20 labeled terminal nodes are there? (A) (B)
Computational Biology, Part D Phylogenetic Trees Ramamoorthi Ravi/Robert F. Murphy Copyright  2000, All rights reserved.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Phylogenetic Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics Figures from Higgs & Attwood.
PHYLOGENY and SYSTEMATICS CHAPTER 25. VOCABULARY Phylogeny – evolutionary history of a species or related species Systematics – study of biological diversity.
Invasive humans Brian O’Meara EEB464 Fall 2015 BBC Monsters We Met.
Pama-Nyungan Phylogenetics and Beyond Claire Bowern, Yale University.
Introduction to Phylogenetic trees Colin Dewey BMI/CS 576 Fall 2015.
Language evolution Brian O’Meara EEB464 Fall 2014.
Math AL COS 12 Recognize data as either categorical or numerical. Examples:categorical—gender, race, languages spoken, genre; numerical—age, height, weight.
1.What is a language family?. A group of languages that came from the same ancestor language and have words in common.
Introductory Phylogenetic Workflows in the Discovery Environment Sheldon McKay iPlant Collaborative, DNALC, Cold Spring Harbor Laboratory Feb 8, 2012.
© 2014 Pearson Education, Inc. Language © 2014 Pearson Education, Inc. Where are folk languages distributed?
World Languages Families (Trees) ( ) This Lecture reviews: Another domain of Historical linguistics which deals with: - The Methods of classifying.
From: On the Origin of Darwin's Finches
Chapter 5: Languages.
Figure 1. Subtypes of homology in molecular biology
Stylistic resources of the language
Evolutionary genomics can now be applied beyond ‘model’ organisms
Non Linear Data Structure
Greedy Algorithms Alexandra Stefan.
CSC317 Greedy algorithms; Two main properties:
DATA INTEGRATION FOR LANGUAGE DOCUMENTATION
Modelling language evolution
Issue 3: Distribution of Other Language Families
Language evolution Brian O’Meara EEB464 Fall 2017.
Multy- Objective Differential Evolution (MODE)
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
The Huffman Algorithm We use Huffman algorithm to encode a long message as a long bit string - by assigning a bit string code to each symbol of the alphabet.
Molecular basis of evolution.
Language… Chapter 5 – Key Issue 1.
Intro to Language.
Slow rate of lexical replacement and deeper genetic relationships
A B Tumour 74 – Whole Genome Tumour 74 – Whole Genome C D
Key Issues Where are folk languages distributed? Why is English related to other languages? Why do individual languages vary among places? Why do people.
Interactive Visual Analytics for Discovering Simpson’s Paradox
NFA vs DFA DFA: For every state q in S and every character  in , one and only one transition of the following form occurs:  q q’ NFA: For every state.
Chapter 5: Language Rayan Hyder, Catherine Rubio, Valeria Guerra, Emery Feliciano, Rachel Torres, & Judith Herrera.
Tandy Warnow Department of Computer Sciences
Island of Lemurs: Madagascar
Reading Phylogenetic Trees
The Most General Markov Substitution Model on an Unrooted Tree
Genotyping and origin of the emergent U. S
Volume 25, Issue 1, Pages 1-9 (January 2015)
Volume 173, Issue 1, Pages e9 (March 2018)
Volume 14, Issue 7, Pages (February 2016)
Genomic Flatlining in the Endangered Island Fox
Chapter 5 Language.
Gautam Dey, Tobias Meyer  Cell Systems 
But what if there is a large amount of homoplasy in the data?
Volume 23, Issue 7, Pages (April 2013)
Diversity of whale blow and seawater samples from minimum entropy decomposition (MED) node groupings (17), including observed number of MEDs, a relative.
Fig. 2. —Phylogenetic relationships and motif compositions of some representative MORC genes in plants and animals. ... Fig. 2. —Phylogenetic relationships.
Volume 21, Issue 23, Pages (December 2011)
Language evolution Brian O’Meara EEB464 Fall 2018.
Toward Accurate and Quantitative Comparative Metagenomics
A, Raw growth data were plotted by gender and adiposity classification group at age 2 months (gray box), and mean BMI for each group is shown over time.
Presentation transcript:

Language evolution Brian O’Meara EEB464 Fall 2016

Could languages evolve? A: Yes B: No

Could languages evolve through natural selection? A: Yes B: No

Gavin et al. 2013

Pagel 2009

The Austronesian language family is the one of the largest in the world, with around 1200 languages spread from Taiwan to New Zealand and Madagascar to Easter Island. We have constructed a large database of Austronesian basic vocabulary (23, 26), which stores 210 items of basic vocabulary from each language, including words for animals, kinship terms, simple verbs, colors, and numbers. Basic vocabulary is both relatively stable over time and generally less likely to be borrowed between languages (27). From this database, a team of linguists identified the sets of homologous words (“cognates”) following the linguistic comparative method (28). We extracted the cognate sets for 400 well-attested languages for analysis. These languages comprise a third of the entire family and include a representative sample of each recognized Austronesian subgroup. We included two non-Austronesian languages as outgroups to “root” the trees: an archaic variant of the Sino-Tibetan language Chinese that was spoken between 2300 and 2900 years B.P. and the Tai-Kadai language Buyang (28). These languages are not traditionally part of the Austronesian family, but a number of cognates have been identified (29). The cognate sets for all 210 meanings across these 400 languages were encoded into a binary matrix. Identified “borrowings” between languages were removed from further analyses. Simulation studies have shown that the amount of undetected borrowing needs to be very substantial (>20%) to substantially bias either the tree topology or the date estimates (30). The resulting matrix contained a total of 34,440 characters (twice the length of whole mitochondrial genomes), and 6436 of these characters were parsimony informative. Activity: figure out common words in multiple languages, put on board Gray et al. 2009

Gray et al. 2009

Pagel et al. 2013

Pagel et al. 2013

Pagel et al. 2013

Altschuler et al. 2013

Altschuler et al. 2013