CS626-449: Speech, NLP and the Web/Topics in AI Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture-27: Phonology (quiz took place on 12/10/09; Lect 26.

Slides:



Advertisements
Similar presentations
CS : Speech, NLP and the Web/Topics in AI
Advertisements

Pushpak Bhattacharyya CSE Dept., IIT Bombay 31st March, 2011
Introduction to linguistics
Phonetics.
Hello, Everyone! Review questions  Give examples to show the following features that make human language different from animal communication system:
CS460/626 : Natural Language Processing/Speech, NLP and the Web (Lecture 35– Phonetics and phonology; syllabification) Pushpak Bhattacharyya CSE Dept.,
Phonetics The study of the sounds of spoken language.
Pushpak Bhattacharyya CSE Dept. IIT Bombay 1st Nov, 2012
CS : Speech, NLP and the Web/Topics in AI
Digital Systems: Hardware Organization and Design
Today  Parts of vocal tract used in producing consonants  Articulatory Description of consonants Readings: it’s all about air!
Phonetics (Part 1) Dr. Ansa Hameed.
Speech Anatomy and Articulation
Lecture 2: Phonology (1) Shao Junzong.
English Phonetics and Phonology Lesson 3B
Chapter 6 Features PHONOLOGY (Lane 335).
Recap: Vowels & Consonants V – central “sound” of the syllable C – outer “shell” of the syllable (C) V (C) (C)(C)(C)V(C)(C)(C)
Phonetics and Phonology 1.4; 3.1, 3.2, 3.3, 3.4, 3.5 (ex.) 4.1, 4.2, 4.3; Ref. 3.8 Homework: 3.6, #1-7, #8 (choose any three) [Mar 5]
Chapter 2 Introduction to articulatory phonetics
Chapter 3 Phonetics: Describing Sounds. Phonetics -study of speech sounds Sounds and symbols --use a system of written symbols --one sound represents.
Phonetics III: Dimensions of Articulation October 15, 2012.
Linguistics I Chapter 4 The Sounds of Language.
Speech Sounds of American English and Some Iranian Languages
The sounds of language Phonetics Chapter 4.
MTP I Stage Project Presentation Guided by- Presented by- Prof. Pushpak Bhattacharyya Abhijeet Padhye Department of Computer Science and Engineering Indian.
English Pronunciation Practice A Practical Course for Students of English By Wang Guizhen Faculty of English Language & Culture Guangdong University of.
Phonetics and Phonology
Phonetics Phonetics: It is the science of speech sounds. It is the study of the production and reception of speech sounds. It is concerned with the sounds.
LING 001 Introduction to Linguistics Fall 2010 Sound Structure I: Phonetics Articulatory phonetics Phonetic transcription Jan. 25.
Chapter 2 Speech Sounds Phonetics and Phonology
1 4. Consonants  Consonants are produced ‘ by a closure in the vocal tract, or by a narrowing which is so marked that air cannot escape without producing.
1 L103: Introduction to Linguistics Phonetics (consonants)
Introduction to Linguistics Ms. Suha Jawabreh Lecture # 7.
Phonological and Phonemic Awareness Jeanne M. Maggiacomo Spring 2014 EDC424.
1 Phonetics and Phonemics. 2 Phonetics and Phonemics : Phonetics The principle goal of Phonetics is to provide an exact description of every known speech.
Phonetics Class # 2 Chapter 6. Homework (Ex. 1 – page 268)  Judge [d ] or [ ǰ ]  Thomas [t]  Though [ ð ]  Easy [i]  Pneumonia [n]  Thought [ θ.
English Phonetics and Phonology
SPEECH ORGANS & ARTICULATION
CS460/626 : Natural Language Processing/Speech, NLP and the Web Lecture 28, 29: Phonetics, Phonology and Speech; introduce transliteration Pushpak Bhattacharyya.
Phonetics: Dimensions of Articulation October 13, 2010.
Phonetics 2. Phonology 2.1 The phonic medium of language Sounds which are meaningful in human communication constitute the phonic medium of language.
WEBSITE Please use this website to practice what you learn during lessons 1.
Introduction to Language Phonetics 1. Explore the relationship between sound and spelling Become familiar with International Phonetic Alphabet (IPA )
Phonetics Definition Speech Organs Consonants vs. Vowels
Matakuliah: G0922/Introduction to Linguistics Tahun: 2008 Session 3 Phonetics: Consonants.
ACE TESOL Diploma Program – London Language Institute OBJECTIVES You will understand: 1. How each of the phonemes in English is articulated 2. The differences.
Phonetics Overview/review Transcription Describing Phones Drills Overview/review Transcription Describing Phones Drills.
Ch4 – Features Features are partly acoustic partly articulatory aspects of sounds but they are used for phonology so sometimes they are created to distinguish.
CS : Speech, NLP and the Web/Topics in AI Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture-25: Vowels cntd and a “grand” assignment.
Phonetics Description and articulation of phones.
CS : Speech, NLP and the Web/Topics in AI Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture-19: Speech: Phonetics (Using Ananthakrishnan’s presentation.
Welcome to all.
ARTICULATORY PHONETICS
ARTICULATORY PHONETICS
Phonetics Dimensions of Articulation
Introduction to Linguistics
Sounds of Language: fənɛ́tɪks
Introduction to Linguistics
Consonant articulation
Essentials of English Phonetics
The articulation of consonants
Overview/review Transcription Describing Consonants
Structure of Spoken Language
Phonetics: The Sounds of Language
Speech is made up of sounds.
Manner of Articulation
Phonetics and Phonemics
CONSONANTS ARTICULATORY PHONETICS. Consonants When we pronounce consonants, the airflow out of the mouth is completely blocked, greatly restricted, or.
PHONETICS AND PHONOLOGY INTRODUCTION TO LINGUISTICS Lourna J. Baldera BSED- ENGLISH 1.
Presentation transcript:

CS : Speech, NLP and the Web/Topics in AI Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture-27: Phonology (quiz took place on 12/10/09; Lect 26 on 9/10/09 was on HMM, jointly with CS621) (Thanks to material from my studnets Abhijeet Padhye and Ankit Agarwal)

What is Phonology Phonetics: Study of sounds produced by the articulatory system (place and manner of articualtion) Phonology: Study of sound units combine to form bigger units like syllables

Places and Manners of Articulation mainly from Speech and Natural Language Processing: Jurafski and Martin, 2 nd Edition Recap of phonetics

Ancient 5 x 5 Indian Classification of Consonants Group क वर्गकखगघङ Velar च वर्गचछजझञ Palatal ट वर्गटठडढण Alveolar त वर्गतथदधन Dental प वर्गपफबभम Labial

Phonteic Symbols and IPA notation

IPA: vowels

Places of articulation

Place of Articulation Labial: Two lips coming together –[p] as in possum, [b] as in bear Dental: Tongue against the teeth –[th] of thing or the [dh] of though Alveolar: Alveolar ridge is the portion of the roof of the mouth just behind the upper teeth; tip of the tongue against the alveolar ridge. –Phones [s], [z], [t], and [d] Palatal: Roof of the mouth; blade of the tongue against this rising back of the alveolar ridge –sounds [sh] (shrimp), [ch] (china), [zh] (Asian), and [jh] (jar) Velar: Movable muscular flap at the back of the roof of the mouth; back of the tongue up against the velum –sounds [k] (cuckoo), [g] (goose), and [N] (kingfisher) Glottal: closing the glottis (by bringing the vocal folds together) –glottal stop [q] (IPA [P]) is made by

Manner of Articulation: Stops and Nasals All consonants are produced by restriction of airflow; Manner of Articulation; how the restriction is produced: –complete or partial stoppage A stop is a consonant in which airflow is completely blocked for a short time English has voiced stops like [b], [d], and [g] as well as unvoiced stops like [p], [t], and [k]. Stops are also called plosives Nasal sounds [n], [m], and [ng] are made by lowering the velum and allowing air to pass into the nasal cavity

Fricatives Fricatives, airflow is constricted but not cut off completely. The turbulent airflow that results from the constriction produces a characteristic “hissing” sound. –The English labiodental fricatives [f] and [v] are produced by pressing the lower lip against the upper teeth, allowing a restricted airflow between the upper teeth. The dental fricatives [th] and [dh] allow air to flow around the tongue between the teeth. –The alveolar fricatives [s] and [z] are produced with the tongue against the alveolar ridge, forcing air over the edge of the teeth. –In the palato-alveolar fricatives [sh] and [zh] the tongue is at the back of the alveolar ridge forcing air through a groove formed in the tongue.

Manner of Articulation: Affricates, Laterals/Liquids and Taps/Flaps Affricates are stops followed immediately by fricatives –English [ch] (chicken); Marathi chaa (e.g., gharaachaa; of the house) Lateral or Liquids: tip of the tongue up against the alveolar ridge or the teeth, with one or both sides of the tongue lowered to allow air to flow over it –[l] (learn) Tap or flap: quick motion of the tongue against the alveolar ridge –[dx] (IPA [R]) –The consonant in the middle of the word lotus ([l ow dx ax s]) is a tap in most dialects of American English –speakers of many UK dialects would use a [t] instead of a tap in this word.

Articulation of consonants: Larynx action/glottis state Vocal cords are pulled apart. The air passes freely through the glottis. This is called the voicelessness state and sounds produced with this configuration of the vocal cords are called voiceless: p t k f θ s ʃ t ʃ Vocal cords are pulled close together. The air passing through the glottis causes the vocal cords to vibrate. This is called the voicing state and sounds produced with this configuration of the vocal cords are called voiced: b d g v ð z ʒ d ʒ Vocal cords are apart at the back and pulled together at the front. This is called the whisper state. Vocal cords assume the voicing state but are relaxed. This is called the murmur state.

Pushpak Bhattacharyya Vowels (1/2)

Pushpak Bhattacharyya Vowels (2/2)

Phonology: Syllables

Basic of syllables “Syllable is a unit of spoken language consisting of a single uninterrupted sound formed generally by a Vowel and preceded or followed by one or more consonants.”  Vowels are the heart of a syllable (Most Sonorous Element) (svayam raajate iti svaraH)  Consonants act as sounds attached to vowels.

Syllable structure  A syllable consists of 3 major parts:-  Onset (C)  Nucleus (V)  Coda (C)  Vowels sit in the Nucleus of a syllable  Consonants may get attached as Onset or Coda.  Basic structure - CV

Possible syllable structures  The Nucleus is always present  Onset and Coda may be absent  Possible structures  V  CV  VC  CVC

syllable theories  Prominence Theory  E.g. entertaining /entəte ɪ n ɪ ŋ/  The peaks of prominence: vowels /e ə e ɪ ɪ /  Number of syllables: 4  Chest Pulse Theory  Based on muscular activities  Sonority Theory  Based on relative soundness of segment within words

Introduction to sonority theory “The Sonority of a sound is its loudness relative to other sounds with the same length, stress and speech.”  Some sounds are more sonorous  Words in a language can be divided into syllables  Sonority theory distinguishes syllables on the basis of sounds.

Sonority hierarchy  Defined on the basis of amount of sound associated  The sonority hierarchy is as follows:-  Vowels (a, e, i, o, u)  Liquids (y, r, l, v)  Nasals (n, m)  Fricatives (s, z, f,…..sh, th etc.)  Affricates (ch, j)  Stops (b, d, g, p, t, k)

Sonority scale  Obstruents can be further classified into:-  Fricatives  Affricates  Stops

Sonority theory & syllables “A Syllable is a cluster of sonority, defined by a sonority peak acting as a structural magnet to the surrounding lower sonority elements.”  Represented as waves of sonority or Sonority Profile of that syllable Nucleus Onset Coda

Sonority sequencing principle “The Sonority Profile of a syllable must rise until its Peak(Nucleus), and then fall.” Peak (Nucleus) Onset Coda

examples  ABHIJEET  Sonority Profile 1 AIE E H J B T  Sonority Profile 2 AIE E H J B T

Maximal onset principle “The Intervocalic consonants are maximally assigned to the Onsets of syllables in conformity with Universal and Language-Specific Conditions.”  Determines underlying syllable division  Example  DIPLOMA DIPLOMA &DIPLOMA

Syllable Structure: amore detailed look Count of no. of syllables in a word is roughly/intuitively the no. of vocalic segments in a word. Thus, presence of a vowel is an obligatory element in the structure of a syllable. This vowel is called “nucleus”. Basic Configuration: (C)V(C). Part of syllable preceding the nucleus is called the onset. Elements coming after the nucleus are called the coda. Nucleus and coda together are referred to as the rhyme. S ≡ Syllable, O ≡ Onset R ≡ Rhyme, N ≡ Nucleus Co ≡ Coda

Syllable Structure: Examples ‘word’ ‘sprint’

Syllable Structure: Examples ‘may’ ‘opt’ ‘air’  No Coda.  No Onset.  No Coda, No Onset.

Syllable Structure Open Syllable: ends in vowel Closed syllable: ends in consonant or consonant cluster Light Syllable: A syllable which is open and ends in a short vowel –General Description – CV. –Example, ‘air’. Heavy Syllable: Closed syllables or syllables ending in diphthong –Example: ‘opt’ –Example, ‘may’

Syllabification: Determining Syllable Boundaries Given a string of syllables (word), what is the coda of one and the onset of another? In a sequence such as VCV, where V is any vowel and C is any consonant, is the medial C the coda of the first syllable (VC.V) or the onset of the second syllable (V.CV)? To determine the correct groupings, there are some rules, two of them being the most important and significant: –Maximal Onset Principle, –Sonority Hierarchy

Discussion on the assignment

Data The Carnegie Mellon University Pronouncing Dictionary machine-readable pronunciation dictionary for North American English that contains over 125,000 words and their transcriptions. The current phoneme set contains 39 phonemes

“Parallel” Corpus Phoneme Example Translation AA odd AA D AE at AE T AH hut HH AH T AO ought AO T AW cow K AW AY hide HH AY D B be B IY

“Parallel” Corpus cntd Phoneme Example Translation CH cheese CH IY Z D dee D IY DH thee DH IY EH Ed EH D ER hurt HH ER T EY ate EY T F fee F IY G green G R IY N HH he HH IY IH it IH T IY eat IY T JH gee JH IY

The tasks First obtain the Carnegie Mellon University's Pronouncing Dictionary Train and Test the following Statistical Machine Learning Algorithms HMM - For HMM you can use either Natural Language Toolkit or you can use GIZA++ with MOSES. MEMM - For MEMM use MaxEnt package. CRF - Use CRF++

Tasks (cntd) Feed Forward Neural Network. For this you can use either SCILAB or JavaNNS Train and Test the following Knowledge Based Learning Algorithms –Decision Tree –Decision List For the Knowledge based learning Algorithms use the Weka package.

Tasks (cntd) Report all the results using 5-fold cross Validation Compare all the results obtained in the previous steps in terms of –Precision –Recall –F-Score Finally do a detailed error analysis.