Statistical learning, cross- constraints, and the acquisition of speech categories: a computational approach. Joseph Toscano & Bob McMurray Psychology.

Slides:



Advertisements
Similar presentations
Change-Point Detection Techniques for Piecewise Locally Stationary Time Series Michael Last National Institute of Statistical Sciences Talk for Midyear.
Advertisements

Psych 156A/ Ling 150: Acquisition of Language II Lecture 5 Sounds of Words.
Tone perception and production by Cantonese-speaking and English- speaking L2 learners of Mandarin Chinese Yen-Chen Hao Indiana University.
{ “Age” Effects on Second Language Acquisition Examination of 4 hypotheses related to age and language learning
Voice Onset Time (VOT) An Animated and Narrated Glossary of Terms used in Linguistics presents.
Psych 156A/ Ling 150: Acquisition of Language II Lecture 3 Sounds.
Infant sensitivity to distributional information can affect phonetic discrimination Jessica Maye, Janet F. Werker, LouAnn Gerken A brief article from Cognition.
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
Ling 240: Language and Mind Acquisition of Phonology.
Speech perception 2 Perceptual organization of speech.
Method Participants Fifty-six undergraduate students (age range 19-37), 14 in each of the four language groups (monolingual, Spanish-English bilingual,
Psych 156A/ Ling 150: Acquisition of Language II Lecture 4 Sounds.
The Perception of Speech. Speech is for rapid communication Speech is composed of units of sound called phonemes –examples of phonemes: /ba/ in bat, /pa/
Perception of syllable prominence by listeners with and without competence in the tested language Anders Eriksson 1, Esther Grabe 2 & Hartmut Traunmüller.
Phonetic Detail in Developing Lexicon Daniel Swingley 2010/11/051Presented by T.Y. Chen in 599.
Machine Learning: Connectionist McCulloch-Pitts Neuron Perceptrons Multilayer Networks Support Vector Machines Feedback Networks Hopfield Networks.
Sonority as a Basis for Rhythmic Class Discrimination Antonio Galves, USP. Jesus Garcia, USP. Denise Duarte, USP and UFGo. Charlotte Galves, UNICAMP.
Psych 156A/ Ling 150: Acquisition of Language II
Characterisation of individuals’ formant dynamics using polynomial equations Kirsty McDougall Department of Linguistics University of Cambridge
Psych 56L/ Ling 51: Acquisition of Language Lecture 8 Phonological Development III.
Acoustic Continua and Phonetic Categories Frequency - Tones.
1 LM Approaches to Filtering Richard Schwartz, BBN LM/IR ARDA 2002 September 11-12, 2002 UMASS.
Second Language Proficiency Places Cognitive Constraints on Sentence Processing Noriko Hoshino Department of Psychology The Pennsylvania State University.
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Auditory-acoustic relations and effects on language inventory Carrie Niziolek [carrien] may 2004.
Today Concepts underlying inferential statistics
BPT 2423 – STATISTICAL PROCESS CONTROL.  Frequency Distribution  Normal Distribution / Probability  Areas Under The Normal Curve  Application of Normal.
Psych 56L/ Ling 51: Acquisition of Language Lecture 8 Phonological Development III.
Psych 156A/ Ling 150: Psychology of Language Learning
Sebastián-Gallés, N. & Bosch, L. (2009) Developmental shift in the discrimination of vowel contrasts in bilingual infants: is the distributional account.
Speech Perception 4/6/00 Acoustic-Perceptual Invariance in Speech Perceptual Constancy or Perceptual Invariance: –Perpetual constancy is necessary, however,
Learning Objectives Copyright © 2004 John Wiley & Sons, Inc. Sample Size Determination CHAPTER Eleven.
Infant Speech Perception & Language Processing. Languages of the World Similar and Different on many features Similarities –Arbitrary mapping of sound.
Psych 156A/ Ling 150: Psychology of Language Learning Lecture 5 Sounds III.
VOT trumps other measures in predicting Korean children’s early mastery of tense stops Eun Jong Kong Mary E. Beckman Jan Edwards LSA2010 January 7 th.
Adaptive Design of Speech Sound Systems Randy Diehl In collaboration with Bjőrn Lindblom, Carl Creeger, Lori Holt, and Andrew Lotto.
Learning Objectives Copyright © 2002 South-Western/Thomson Learning Sample Size Determination CHAPTER thirteen.
Is phonetic variation represented in memory for pitch accents ? Amelia E. Kimball Jennifer Cole Gary Dell Stefanie Shattuck-Hufnagel ETAP 3 May 28, 2015.
Adele E. Goldberg. How argument structure constructions are learned.
Acoustic Cues to Laryngeal Contrasts in Hindi Susan Jackson and Stephen Winters University of Calgary Acoustics Week in Canada October 14,
1. Background Evidence of phonetic perception during the first year of life: from language-universal listeners to native listeners: Consonants and vowels:
Ch 3 Slide 1 Is there a connection between phonemes and speakers’ perception of phonetic differences? (audibility of fine distinctions) Due to phonology,
Compression No. 1  Seattle Pacific University Data Compression Kevin Bolding Electrical Engineering Seattle Pacific University.
SPEECH AND WRITING. Spoken language and speech communication In a normal speech communication a speaker tries to influence on a listener by making him:
3308 First Language acquisition Acquisition of sounds Perception Sook Whan Cho Fall, 2012.
Psych 156A/ Ling 150: Psychology of Language Learning Lecture 6 Sounds of Words I.
Chapter 9 Probability. 2 More Statistical Notation  Chance is expressed as a percentage  Probability is expressed as a decimal  The symbol for probability.
The long-term retention of fine- grained phonetic details: evidence from a second language voice identification training task Steve Winters CAA Presentation.
L i a b l eh kC o m p u t i n gL a b o r a t o r y Test Economics for Homogeneous Manycore Systems Lin Huang† and Qiang Xu†‡ †CUhk REliable computing laboratory.
Psych 156A/ Ling 150: Psychology of Language Learning Lecture 3 Sounds II.
Acoustic Continua and Phonetic Categories Frequency - Tones.
1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.
Neurophysiologic correlates of cross-language phonetic perception LING 7912 Professor Nina Kazanina.
Psych 156A/ Ling 150: Psychology of Language Learning Lecture 2 Sounds I.
Source of change –Combination of feedback and explain- experimenter’s-reasoning led to greater learning than feedback alone Path of change –Children relied.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Fita Ariyana Rombel 7 (Thursday 9 am).
Stop-Consonant Perception in 7.5-month-olds: Evidence for gradient categories Bob McMurray & Richard N. Aslin Department of Brain and Cognitive Sciences.
Psych 156A/ Ling 150: Psychology of Language Learning Lecture 3 Sounds I.
Maintainance and Reliability Pertemuan 26 Mata kuliah: J Manajemen Operasional Tahun: 2010.
Quality Control Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill.
Against formal phonology (Port and Leary).  Generative phonology assumes:  Units (phones) are discrete (not continuous, not variable)  Phonetic space.
Exemplar Dynamics VOT of stops varies in languages VOT of stops varies in languages So people learn language specific VOT So people learn language specific.
2nd Language Learning Chapter 2 Lecture 4.
Properties of the Normal Distribution
Psychology 202a Advanced Psychological Statistics
Abstraction versus exemplars
Brain Mechanisms in Early Language Acquisition
Distribution Model A smooth representation of the distribution of ALL individuals in the POPULATION Quantitative Value
Noriko Hoshino Department of Psychology
Presentation transcript:

Statistical learning, cross- constraints, and the acquisition of speech categories: a computational approach. Joseph Toscano & Bob McMurray Psychology Department University of Iowa

Acknowledgements Acknowledgements: –Dick Aslin –The MACLab

Learning phonetic categories Infants are initially able to discriminate many different phonetic contrasts. They must learn which ones are relevant to their native language. This is accomplished within the first year of life, and infants quickly adopt the categories present in their language (Werker & Tees, 1984).

Learning phonetic categories What is needed for statistical learning? A signal and a mechanism –Availability of statistics (signal) –Sensitivity to statistics (mechanism) continuous sensitivity to VOT ability to track frequencies and build clusters

Statistics in the signal What statistical information is available? Lisker & Abramson, 1964 did a cross-language analysis of speech –Measured voice-onset time (VOT) from several speakers in different languages

Statistics in the signal The statistics are available in the signal Tamil Cantonese English

Sensitivity to statistics Are infants sensitive to statistics in speech? –Maye et al., 2002 asked this –Two groups of infants Infants are sensitive to within-category detail (McMurray & Aslin, 2005)

Learning phonetic categories Infants can obtain phoneme categories from exposure to tokens in the speech signal VOT frequency 0ms50ms +voice-voice

Statistical Learning Model Statistical learning in a computational model What do we need the model to do: –Show learnability. Are statistics sufficient? –Developmental timecourse. –Implications for speech in general. –Can model explain more than category learning?

Statistical Learning Model Clusters of VOTs are Gaussian distributions Tamil Cantonese English

Statistical Learning Model Gaussians defined by three parameters: Each phoneme category can be represented by these three parameters  VOT  Φ μ – the center of the distribution σ – the spread of the distribution Φ – the height of the distribution, reflected by the probability of a particular value

Statistical Learning Model Modeling approach: mixture of Gaussians /b//p/

Statistical Learning Model Gaussian distributions represent the probability of occurrence of a particular feature (e.g. VOT) Start with a large number of Gaussians to reflect many different values for the feature. /b//p/

Statistical Learning Model Learning occurs via gradient descent –Take a single data point as input –Adjust the location and width of the distribution by a certain amount, defined by a learning rule   Move the center of the dist closer to the data point Make the dist wider to accommodate the data point

Statistical Learning Model Learning rule: { Probability of a particular point Proportion of space under that Gaussian Equation of a Gaussian = x

Can the model learn? Can the model learn speech categories?

Can the model learn? The model in action Fails to learn correct number of categories –Too many distributions under each curve –Is this a problem? Maybe. Solution: Introduce competition Competition through winner-take-all strategy –Only the closest matching Gaussian is adjusted

Does learning need to be constrained? Can the model learn speech categories?Yes. Does learning need to be constrained?

Unconstrained feature space –Starting VOTs distributed from to ms –Model fails to learn –Similar to a situation in which the model has too few starting distributions

Does learning need to be constrained? Constrained feature space –Starting VOTs distributed from -100 to +100 ms –Within the range of actual voice onset times used in language.

Are constraints linguistic? Can the model learn speech categories?Yes. Does learning need to be constrained?Yes. Do constraints need to be linguistic?

Are constraints linguistic? Cross-linguistic constraints –Combined data from languages used in Lisker & Abramson, 1964, and several other languages

Are constraints linguistic? VOTs from: –English –Thai –Spanish –Cantonese –Korean –Navajo –Dutch –Hungarian –Tamil –Eastern Armenian –Hindi –Marathi –French

Test the model with two different sets of starting states: Cross-linguistic: based on distribution of VOTs across languages Random normally distributed: centered around 0ms, range ~ - 100ms to +100ms VOT

Test the model with two different sets of starting states: Cross-linguistic: based on distribution of VOTs across languages Random normally distributed: centered around 0ms, range ~ - 100ms to +100ms

Are linguistic constraints helpful? Can the model learn speech categories?Yes. Does learning need to be constrained?Yes. Do constraints need to be linguistic?No. Do cross-language constraints help?

Are linguistic constraints helpful? This is the part of the talk that I don’t have any slides for yet.

What do infants do? Can the model learn speech categories?Yes. Does learning need to be constrained?Yes. Do constraints need to be linguistic?No. Do cross-language constraints help? Sometimes. What do infants do?

As infants get older, their ability to discriminate different VOT contrasts decreases. –Initially able to discriminate many contrasts –Eventually discriminate only those of their native language

What do infants do? Each model’s discrimination over time –Random normal: decreases –Cross-linguistic: slight increase

What do infants do? Cross-linguistic starting states lead to faster category acquisition Why wouldn’t infants take advantage of this? –Too great a risk of over-generalization –Better to take more time to do the job right than to do it too quickly