Dan Wright Developing Algorithms for Computational Comparative Diachronic Historical Linguistics.

Slides:



Advertisements
Similar presentations
Every edge is in a red ellipse (the bags). The bags are connected in a tree. The bags an original vertex is part of are connected.
Advertisements

Phone An Animated and Narrated Glossary of Terms used in Linguistics presents.
Phonics Information.
Major branches of phonetics 1. Experimental – How are speech sounds studied? 2. Articulatory – How are speech sounds produced? 3. Acoustic – What is the.
Emerging Spelling: Stages and Teaching Strategies
Teppo Räisänen School of Business and Information Management Oulu University of Applied Sciences.
CIED 4013: Capstone Course for Foreign Language Licensure Language Sounds Chapter Three Dr. Freddie A. Bowles.
NLP and Speech Course Review. Morphological Analyzer Lexicon Part-of-Speech (POS) Tagging Grammar Rules Parser thethe – determiner Det NP → Det.
Phonetics The study of productive sounds within a language 2 Basic types of sounds in English: Consonants (C): restriction on airflow Vowels (V): no restriction.
SPEECH PERCEPTION The Speech Stimulus Perceiving Phonemes Top-Down Processing Is Speech Special?
Introduction to Speech Production Lecture 1. Phonetics and Phonology Phonetics: The physical manifestation of language in sound waves. –How sounds are.
JPN494/598: History of the Japanese Language Introduction.
Chapter three Phonology
Lecture 5: Chapter 4: The sounds of language Lecturer: Haifa Alroqi
The Effect of Incongruent Visual Cues on the Heard Quality of Front Vowels Hartmut Traunmüller Niklas Öhrström Dept. of Linguistics, University of Stockholm.
The IPA Chart An Animated and Narrated Glossary of Terms used in Linguistics presents.
Phonics Instruction II: Moving on to Long Vowels.
EDC 424 Spring 2014 JMaggiacomo Development of Orthographic Knowledge.
Chapter 1: Structure of English
Diachronic Change in Loanword Constraint Rankings An analysis of multiple outputs for the same input in English Loanwords in Korean.
Phonetics and Phonology
Overview: Humans are unique creatures. Everything we do is slightly different from everyone else. Even though many times these differences are so minute.
1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.
Explanation. -Status of linguistics now and before 20 th century - Known as philosophy in the past, now new name – Linguistics - It studies language in.
Phonemic Awareness = Phonics. Phonemic Awareness w The understanding that spoken words are made up of a series of discrete sounds Is different from Phonics:
Investigating the Ancient Meroitic Language Using Statistical Natural Language Techniques: Zipf’s Law and Word Co-Occurrences Reginald Smith August 10,
Managing XML and Semistructured Data Lecture 13: XDuce and Regular Tree Languages Prof. Dan Suciu Spring 2001.
Parsing Lecture 5 Fri, Jan 28, Syntax Analysis The syntax of a language is described by a context-free grammar. Each grammar rule has the form A.
Split infinitive You need to explain your viewpoint briefly (unsplit infinitive) You need to briefly explain your viewpoint (split infinitive) Because.
Alphabet Fun Hillary Bordeaux. Essential Questions: Why do we need to know the alphabet? When do we use the alphabet in our everyday life?
Chapter 2: Linguistic Organization Mafuyu Kitahara
A Survey of English Lexicology
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Learning Phonetic Similarity for Matching Named Entity.
Hello, Everyone! Part I Review Review questions 1.In what ways can English consonants be classified? 2. In what ways can English vowels be classified?
Chapter II phonology II. Classification of English speech sounds Vowels and Consonants The basic difference between these two classes is that in the production.
Essential Question Why is the Alphabet important? Why do we need to know how to write each letter?. Unit Question Can you show me how to write each letter.
LIN 3201 Sounds of Human Language Sayers -- Week 1 – August 29 & 31.
Levels of Linguistic Analysis
Proposed Vedic Sanskrit Coding Scheme: Some suggestions Akshar Bharati Amba Kulkarni Department of Sanskrit Studies University of Hyderabad Hyderabad
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
1 Friends and Neighbors on the Web Presentation for Web Information Retrieval Bruno Lepri.
BINARY TREES Objectives Define trees as data structures Define the terms associated with trees Discuss tree traversal algorithms Discuss a binary.
Phonetics and Phonology.
STD Approach Two general approaches: word-based and phonetics-based Goal is to rapidly detect the presence of a term in a large audio corpus of heterogeneous.
Providing Learning Innovations and Curriculum Solutions Strengthening Our Teaching Skills in Reading & Writing Mary Mount Easter Institute Bogota, Columbia.
Ɑ rt ɪ ky ə leš ə n d ɪ s ɔ rd ə r l ɛ s ə n ( Articulation Disorder Lesson ) By: Juan Palma.
TDS-Curator DANS MPI for Psycholinguistics Utrecht Institute of Linguistics OTS languagelink.let.uu.nl/tds/ 9/21/20101CLARIN-NL - Call 1 - ISOcat status.
The Core of Linguistics. Phonetics Speech sounds are produced by human beings. Then transmitted through the medium of air in the form of sound waves,
English Pronunciation Clinic Week 1: Phonemes
Korean Phoneme Discrimination
Adding s or ies to words ending in Y
Improving voice and diction Introduction
an Introduction to English
Phonetics Lauren Dobbs.
Segments and Divergences.
PHONETICS They spell it "da Vinci" and pronounce it "da Vinchy". Foreigners always spell better than they pronounce. (Mark Twain)
College of Engineering
Consonant variegations in first words: Infants’ actual productions of
1. Phonetics 1.1 Introduction
An Animated and Narrated Glossary of Terms used in Linguistics
Job Google Job Title: Linguistic Project Manager
Cuddington & Dinton C of E school
عمادة التعلم الإلكتروني والتعليم عن بعد
Levels of Linguistic Analysis
Rohit Kumar *, Amit Kataria, Sanjeev Sofat
Kindergarten/1st Grade
Fundamentals of Sensation and Perception
English phonetic symbols (Consonant Symbols)
Morphology Mrs. Veena Dixit 14/9/04 Mrs. Veena Dixit 14/9/04
English vs Spanish!. Extra Facts Spanish is a Romance language and is part Spanish is a Romance language and is part of the Indo-
Presentation transcript:

Dan Wright Developing Algorithms for Computational Comparative Diachronic Historical Linguistics

Historical Linguistics ● Historical linguistics is the study of how language changes over time. ● Languages split into groups, forming a hierarchy or web of languages, each related to its ancestors ● All changes in language are completely regular, so they can be analyzed and to a degree discovered from the current state of the descendant languages.

Phonetics ● The fundamental unit of language is the phoneme. ● In order to analyze language, one must first devise a method to deal with phonemes. ● Phonemes can be classified on five axes, using the separations of the International Phonetic Alphabet.

Phoneme Categorization

Phoneme Storage Vowels Roundedness Openness Frontness Offset Consonants Voicedness Place of Articulation Method of Articulation Not used Vowel or Consonant

Correspondence ● My first attempts to analyze the web- structures of languages was by measuring correspondence between languages. ● I ran lists of words through algorithms which measured how much certain phonemes and axial structures matched up. ● I attempted to build a web of languages from the bottom up, connecting languages through correspondence.

● But there is a better way!

From the Top Down! ● My second approach to web formation was to start with all of the languages in one organization. ● I then separated them into languages which are more related to each other than a regressed hypothetical ancestor language. ● This was recursively applied to the new families.

Conclusion ● My top-down approach was able to somewhat reliably separate languages into their actual categories based on phonetics alone.