Sound Categories.

Slides:

Advertisements

Similar presentations

Psych 156A/ Ling 150: Acquisition of Language II Lecture 5 Sounds of Words.

Advertisements

09/01/10 Kuhl et al. (1992) Presentation Kuhl, P. K., Williams, K. A., Lacerda, F., Stevens, K. N., & Lindblom, B. (1992) Linguistic experience alters.

Tone perception and production by Cantonese-speaking and English- speaking L2 learners of Mandarin Chinese Yen-Chen Hao Indiana University.

Plasticity, exemplars, and the perceptual equivalence of ‘defective’ and non-defective /r/ realisations Rachael-Anne Knight & Mark J. Jones.

Human Speech Recognition Julia Hirschberg CS4706 (thanks to John-Paul Hosum for some slides)

18 and 24-month-olds use syntactic knowledge of functional categories for determining meaning and reference Yarden Kedar Marianella Casasola Barbara Lust.

Psych 156A/ Ling 150: Acquisition of Language II Lecture 3 Sounds.

Infant sensitivity to distributional information can affect phonetic discrimination Jessica Maye, Janet F. Werker, LouAnn Gerken A brief article from Cognition.

Ling 240: Language and Mind Acquisition of Phonology.

Speech perception 2 Perceptual organization of speech.

Development of Speech Perception. Issues in the development of speech perception Are the mechanisms peculiar to speech perception evident in young infants?

Psych 156A/ Ling 150: Acquisition of Language II Lecture 4 Sounds.

Phonetic Detail in Developing Lexicon Daniel Swingley 2010/11/051Presented by T.Y. Chen in 599.

CSD 2230 HUMAN COMMUNICATION DISORDERS Topic 2 Normal Communication Development and Communication Across the Lifespan.

Psych 156A/ Ling 150: Acquisition of Language II

Psych 156A/ Ling 150: Acquisition of Language II Lecture 4 Sounds of Words.

Psych 56L/ Ling 51: Acquisition of Language Lecture 8 Phonological Development III.

Distributional Cues to Word Boundaries: Context Is Important Sharon Goldwater Stanford University Tom Griffiths UC Berkeley Mark Johnson Microsoft Research/

Language Acquisition Species-specific, species-universal accomplishment Central issue for cognitive science Important distinction between language comprehension.

Acoustic Continua and Phonetic Categories Frequency - Tones.

Chapter three Phonology

Adrienne Moore section COGS1

Psycholinguistics Lecture 7

CSD 2230 HUMAN COMMUNICATION DISORDERS

A Lecture about… Phonetic Acquisition Veronica Weiner May, 2006.

Psych 156A/ Ling 150: Psychology of Language Learning

Sebastián-Gallés, N. & Bosch, L. (2009) Developmental shift in the discrimination of vowel contrasts in bilingual infants: is the distributional account.

Background Infants and toddlers have detailed representations for their known vocabulary items Consonants (e.g., Swingley & Aslin, 2000; Fennel & Werker,

Speech Perception 4/6/00 Acoustic-Perceptual Invariance in Speech Perceptual Constancy or Perceptual Invariance: –Perpetual constancy is necessary, however,

Infant Speech Perception & Language Processing. Languages of the World Similar and Different on many features Similarities –Arbitrary mapping of sound.

Psych 156A/ Ling 150: Psychology of Language Learning Lecture 5 Sounds III.

Statistical learning, cross- constraints, and the acquisition of speech categories: a computational approach. Joseph Toscano & Bob McMurray Psychology.

1 Speech Perception 3/30/00. 2 Speech Perception How do we perceive speech? –Multifaceted process –Not fully understood –Models & theories attempt to.

A chicken-and-egg problem

Building a Lexicon Statistical learning & recognizing words.

Adaptive Design of Speech Sound Systems Randy Diehl In collaboration with Bjőrn Lindblom, Carl Creeger, Lori Holt, and Andrew Lotto.

Acoustic Continua and Phonetic Categories Frequency - Tones.

Acoustic Cues to Laryngeal Contrasts in Hindi Susan Jackson and Stephen Winters University of Calgary Acoustics Week in Canada October 14,

1. Background Evidence of phonetic perception during the first year of life: from language-universal listeners to native listeners: Consonants and vowels:

Ch 3 Slide 1 Is there a connection between phonemes and speakers’ perception of phonetic differences? (audibility of fine distinctions) Due to phonology,

SPEECH PERCEPTION DAY 16 – OCT 2, 2013 Brain & Language LING NSCI Harry Howard Tulane University.

Assessment of Phonology

Psych 156A/ Ling 150: Psychology of Language Learning Lecture 7 Sounds of Words II.

Psych 156A/ Ling 150: Psychology of Language Learning Lecture 6 Sounds of Words I.

Sensation & Perception

The long-term retention of fine- grained phonetic details: evidence from a second language voice identification training task Steve Winters CAA Presentation.

Psych 156A/ Ling 150: Psychology of Language Learning Lecture 3 Sounds II.

Sounds and speech perception Productivity of language Speech sounds Speech perception Integration of information.

Acoustic Continua and Phonetic Categories Frequency - Tones.

1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.

CSD 2230 INTRODUCTION TO HUMAN COMMUNICATION DISORDERS Normal Sound Perception, Speech Perception, and Auditory Characteristics at the Boundaries of the.

Neurophysiologic correlates of cross-language phonetic perception LING 7912 Professor Nina Kazanina.

Psych 156A/ Ling 150: Psychology of Language Learning Lecture 2 Sounds I.

Source of change –Combination of feedback and explain- experimenter’s-reasoning led to greater learning than feedback alone Path of change –Children relied.

Infant Perception. William James, 1890 “The baby, assailed by eyes, ears, nose, skin and entrails all at once, feels it all as one great blooming, buzzing.

What infants bring to language acquisition Limitations of Motherese & First steps in Word Learning.

A Psycholinguistic Perspective on Child Phonology Sharon Peperkamp Emmanuel Dupoux Laboratoire de Sciences Cognitives et Psycholinguistique, EHESS-CNRS,

Sound Categories Frequency - Tones Frequency - Complex Sounds.

Psycholinguistics I LING 640 What is psycholinguistics about?

Psych 156A/ Ling 150: Psychology of Language Learning Lecture 3 Sounds I.

AUDITORY CORTEX 1 SEPT 11, 2015 – DAY 8 Brain & Language LING NSCI Fall 2015.

Speech Perception in Infants Peter D. Eimas, Einar R. Siqueland, Peter Jusczyk, and James Vigorito 1971.

Constraints on definite article alternation in speech production: To “thee” or not to “thee”? By M. GARETH GASKELL, HELEN COX, KATHERINE FOLEY, HELEN GRIEVE,

Sentence Durations and Accentedness Judgments

PSYC 206 Lifespan Development Bilge Yagmurlu.

Step 1: Memorize IPA - practice quiz today - real quiz on Tuesday (over consonants)! Phonology is about looking for patterns and arguing your assessment.

Theories of Language Development

Susan Geffen, Suzanne Curtin and Susan Graham

Quantifying Sensitivity

Job Google Job Title: Linguistic Project Manager

Presentation transcript:

Sound Categories

Frequency - Tones

Frequency - Tones

Frequency - Tones

Frequency - Tones

Frequency - Complex Sounds

Frequency - Complex Sounds

Frequency - Vowels Vowels combine acoustic energy at a number of different frequencies Different vowels ([a], [i], [u] etc.) contain acoustic energy at different frequencies Listeners must perform a ‘frequency analysis’ of vowels in order to identify them (Fourier Analysis)

Time --> Amplitude Frequency Any function can be decomposed in terms of sinusoidal (= sine wave) functions (‘basis functions’) of different frequencies that can be recombined to obtain the original function. [Wikipedia entry on Fourier Analysis] Time --> Joseph Fourier (1768-1830) Amplitude Frequency

Frequency - Male Vowels

Frequency - Male Vowels

Frequency - Female Vowels

Frequency - Female Vowels

Schedule Lab #1A – Classic speech perception tasks individual data: collect by Weds Sept 5th due Monday Sept 12th Lab #1B - New speech perception tasks Task 1: rapid sequence recall (Dupoux et al. 2008) Task 2: implicit discrimination (Navarra et al. 2005) collect individual data by Monday Sept 17th– email to lalithab@umd.edu group data files available shortly thereafter – team analysis welcome/encouraged due Monday Sept 24th

Timing - Voicing

Voice Onset Time (VOT) 60 msec

English VOT production Not uniform 2 categories

Perceiving VOT ‘Categorical Perception’

Discrimination Same/Different

Discrimination Same/Different 0ms 60ms

Discrimination Same/Different 0ms 60ms Same/Different

Discrimination Same/Different 0ms 60ms Same/Different 0ms 10ms

Discrimination Same/Different 0ms 60ms Same/Different 0ms 10ms

Discrimination Same/Different 0ms 60ms Same/Different 0ms 10ms

Discrimination Same/Different 0ms 60ms Same/Different Why is this pair difficult? 0ms 10ms Same/Different 40ms 40ms

Discrimination Same/Different 0ms 60ms Same/Different Why is this pair difficult? 0ms 10ms (i) Acoustically similar? (ii) Same Category? Same/Different 40ms 40ms

Discrimination A More Systematic Test Same/Different 0ms 60ms Why is this pair difficult? 0ms 10ms (i) Acoustically similar? (ii) Same Category? Same/Different 40ms 40ms

Discrimination A More Systematic Test Same/Different 0ms 60ms 0ms 20ms

Discrimination A More Systematic Test D D D T T T Same/Different 0ms 60ms 0ms 20ms D 20ms 40ms T Same/Different 0ms 10ms T T 40ms 60ms Same/Different Within-Category Discrimination is Hard 40ms 40ms

Cross-language Differences

Cross-language Differences

Cross-Language Differences English vs. Japanese R-L

Cross-Language Differences English vs. Hindi alveolar [d] retroflex [D] ?

Russian -40ms -30ms -20ms -10ms 0ms 10ms

Kazanina et al., 2006 Proceedings of the National Academy of Sciences, 103, 11381-6

Discrimination A More Systematic Test D D D T T T Same/Different 0ms 60ms 0ms 20ms D 20ms 40ms T Same/Different 0ms 10ms T T 40ms 60ms Same/Different Within-Category Discrimination is Hard 40ms 40ms

Quantifying Sensitivity

Quantifying Sensitivity Response bias Two measures of discrimination Accuracy: how often is the judge correct? Sensitivity: how well does the judge distinguish the categories? Quantifying sensitivity Hits Misses False Alarms Correct Rejections Compare p(H) against p(FA)

Quantifying Sensitivity Is one of these more impressive? Harder to obtain by chance? p(H) = 0.75, p(FA) = 0.25 p(H) = 0.99, p(FA) = 0.49 A measure that amplifies small percentage differences at extremes z-scores Both yield the same difference between p(H) and p(FA). But the second one is harder to obtain by chance.

√( ) Normal Distribution Dispersion around mean Standard Deviation A measure of dispersion around the mean. Mean (µ) √( ) ∑(x - µ)2 n Carl Friederich Gauss (1777-1855)

The Empirical Rule 1 s.d. from mean: 68% of data

Normal Distribution Standard deviation Heights of American  = 2.5 inches Heights of American Females, aged 18-24 Mean (µ) 65.5 inches

Quantifying Sensitivity A z-score is a reexpression of a data point in units of standard deviations. (Sometimes also known as standard score) In z-score data, µ = 0,  = 1 Sensitivity score d’ = z(H) - z(FA)

see sensitivity worksheet sensitivity.xls

Quantifying Differences

(Näätänen et al. 1997) (Aoshima et al. 2004) (Maye et al. 2002)

√( ) Normal Distribution Dispersion around mean Standard Deviation A measure of dispersion around the mean. Mean (µ) √( ) ∑(x - µ)2 n

The Empirical Rule 1 s.d. from mean: 68% of data

If we observe 1 individual, how likely is it that his score is at least 2 s.d. from the mean? Put differently, if we observe somebody whose score is 2 s.d. or more from the population mean, how likely is it that the person is drawn from that population?

If we observe 2 people, how likely is it that they both fall 2 s. d If we observe 2 people, how likely is it that they both fall 2 s.d. or more from the mean? …and if we observe 10 people, how likely is it that their mean score is 2 s.d. from the group mean? If we do find such a group, they’re probably from a different population

Standard Error is the Standard Deviation of sample means.

If we observe a group whose mean differs from the population mean by 2 s.e., how likely is it that this group was drawn from the same population?

Development of Speech Perception in Infancy

Voice Onset Time (VOT) 60 msec

Perceiving VOT ‘Categorical Perception’

Discrimination A More Systematic Test D D D T T T Same/Different 0ms 60ms 0ms 20ms D 20ms 40ms T Same/Different 0ms 10ms T T 40ms 60ms Same/Different Within-Category Discrimination is Hard 40ms 40ms

Abstraction Representations Behaviors Sound encodings - clearly non-symbolic, but otherwise unclear Phonetic categories Memorized symbols: /k/ /æ/ /t/ Behaviors Successful discrimination Unsuccessful discrimination ‘Step-like’ identification functions Grouping different sounds

Let’s Learn Inuktitut! Video: Nunavik: Building on the Knowledge of Ancestors

Vowels Consonants

Three Classics

Development of Speech Perception Unusually well described in past 30 years Learning theories exist, and can be tested… Jakobson’s suggestion: children add feature contrasts to their phonological inventory during development Roman Jakobson, 1896-1982 Kindersprache, Aphasie und allgemeine Lautgesetze, 1941

Developmental Differentiation Universal Phonetics Native Lg. Phonetics Native Lg. Phonology 0 months 6 months 12 months 18 months

#1 - Infant Categorical Perception Eimas, Siqueland, Jusczyk & Vigorito, 1971

Discrimination A More Systematic Test D D D T T T Same/Different 0ms 60ms 0ms 20ms D 20ms 40ms T Same/Different 0ms 10ms T T 40ms 60ms Same/Different Within-Category Discrimination is Hard 40ms 40ms

high amplitude sucking non-nutritive sucking

English VOT Perception To Test 2-month olds High Amplitude Sucking Eimas et al. 1971

General Infant Abilities Infants’ show Categorical Perception of speech sounds - at 2 months and earlier Discriminate a wide range of speech contrasts (voicing, place, manner, etc.) Discriminate Non-Native speech contrasts e.g., Japanese babies discriminate r-l e.g., Canadian babies discriminate d-D [these findings based mostly on looking/headturn studies w/ 6 month olds]

Universal Listeners Infants may be able to discriminate all speech contrasts from the languages of the world!

How can they do this? Innate speech-processing capacity? General properties of auditory system?

What About Non-Humans? Chinchillas show categorical perception of voicing contrasts! PK Kuhl & JD Miller, Science, 190, 69-72 (1975)

Suitability of Animal Models More recent findings… Auditory perceptual abilities in macaque monkeys and humans differ in various ways Discrimination sensitivity for b-p continua is more fine-grained in (adult) humans (Sinnott & Adams, JASA, 1987) Sensitivity to cues to r-l distinctions is different, although trading relations are observed in humans and macaques alike (Sinnott & Brown, JASA, 1997) Some differences in vowel sensitivity… Joan Sinnott, U. of S. Alabama

#2 - Becoming a Native Listener Werker & Tees, 1984

When does Change Occur? About 10 months Janet Werker U. of British Columbia Conditioned Headturn Procedure

When does Change Occur? Hindi and Salish contrasts tested on English kids Janet Werker U. of British Columbia Conditioned Headturn Procedure

What do Werker’s results show? Is this the beginning of efficient memory representations (phonological categories)? Are the infants learning words? Or something else?

Korean has [l] & [r] [rupi] “ruby” [kiri] “road” [saram] “person” [irumi] “name” [ratio] “radio” [mul] “water” [pal] “big” [s\ul] “Seoul” [ilkop] “seven” [ipalsa] “barber”

#3 - What, no minimal pairs? Stager & Werker, 1997

A Learning Theory… How do we find out the contrastive phonemes of a language? Minimal Pairs

Word Learning Stager & Werker 1997 ‘bih’ vs. ‘dih’ and ‘lif’ vs. ‘neem’

PRETEST

HABITUATION TEST SAME SWITCH

Word learning results Exp 2 vs 4

Why Yearlings Fail on Minimal Pairs They fail specifically when the task requires word-learning They do know the sounds But they fail to use the detail needed for minimal pairs to store words in memory !!??

One-Year Olds Again One-year olds know the surface sound patterns of the language One-year olds do not yet know which sounds are used contrastively in the language… …and which sounds simply reflect allophonic variation One-year olds need to learn contrasts

Maybe not so bad after all... Children learn the feature contrasts of their language Children may learn gradually, adding features over the course of development Phonetic knowledge does not entail phonological knowledge Roman Jakobson, 1896-1982

Werker et al. 2002 14 months 17 months 20 months 14 17 20 60 300 600

Swingley & Aslin, 2002 14-month olds did recognize mispronunciations of familiar words Dan Swingley, UPenn

Alternatives to Reviving Jakobson Word-learning is very hard for younger children, so detail is initially missed when they first learn words Many exposures are needed to learn detailed word forms at early stages of word-learning Success on the Werker/Stager task seems to be related to the vocabulary spurt, rapid growth in vocabulary after ~50 words

So how do infants learn…? Some possibilities: ‘Use it or lose it’ – they stop paying attention to contrasts that they don’t need for the ambient language Minimal pairs (e.g., rock vs. lock) – requires word meanings Acoustic distributions of sounds, requires no word knowledge Seeking contextually conditioned variation, e.g., Korean r/l contrast

(Dietrich, Swingley, & Werker 2007)

Exp 1: tam - ta:m Exp 2: tæm - tæ:m Exp 3: ta/æm - tem Length factor ~1.8-2.0

(Dietrich, Swingley, & Werker 2007)

Slides: Swingley 2006, ICIS

Slides: Swingley 2006, ICIS

Slides: Swingley 2006, ICIS

5 hours’ exposure to Mandarin ± human interaction [2003, Proceedings of the National Academy of Sciences]

Alveo-palatals affricate fricative

Jessica Maye, Northwestern U.

Infants at age 6-8 months are still ‘universal listeners’, cf Infants at age 6-8 months are still ‘universal listeners’, cf. Pegg & Werker (1997) Infants trained on bi-modal distribution show ‘novelty preference’ for test sequence with fully alternating sequence How could the proposal scale up?

p(a) = p(b) p(a) = 2 x p(b)

1.0 .5 .25 .1

Slides: Swingley 2006, ICIS

Slides: Swingley 2006, ICIS

Fenson et al. 2000

toast hat ants tooth table television blanket outside plant wait today fast hurt soft out stroller kitty water babysitter pretty patty cake bottle kitchen don’t night (night) bird dog duck doll bread candy head dish radio outside feed today dark MacArthur Short CDI - 89 items

Fei Xu, Berkeley

Xu & Carey 1996 10 mo.: no surprise 12 mo.: surprise --> “10 month olds do not represent basic sortal/kind concepts” Xu 2002 Add words! 9 mo.:

Fulkerson & Waxman 2007 12 months 6 months Categorization measured by novelty preference score: % looks to novel / total; categorization should imply novelty preference 12 months 6 months Words µ = .59, p = .007 µ = .63, p < .001 Tones µ = .53, p = .2 µ = .54, p = .2

Yeung & Werker 2009 Naturally produced Hindi syllables Dental vs. retroflex Familiarize sound-object links Test sound discrimination only Exp1: consistent links Exp2: inconsistent links Effect of Type (±alternating) Exp1: F(1,18) = 5.74, p < .05 Exp2: F(1,18) = 0.53, p = .47

(Feldman, Griffiths, & Morgan, 2009)

“Simulations demonstrate that using information from segmented words to constrain phonetic category acquisition allows more robust category learning from fewer data points, due to the inter- active learner’s ability to use information about which words contain particular speech sounds to disambiguate overlapping categories.” (Feldman et al. 2009)

Analysis Hypothesis testing (null HT vs. Bayesian) Linking probabilities to hypotheses Weighted binomial distributions

Questions Combining words and sound distributions: what do learners need to know? Too-many-colors problem: how to combine across words? Why does interaction matter (Kuhl et al. 2003 on Mandarin)? What does this predict about 1-year olds’ knowledge of phonological contrast? What changes between 12 & 18 months?

Invariance (Jusczyk 1997)

Training on [g-k] or [d-t], generalization across place of articulation. (Dis-)habituation paradigm. [Maye & Weiss, 2003]

So how do infants learn…? Phoneme categories and alternations Perhaps more like a phonologist than like a LING101 student - look directly for systematic relations among phones Gradual articulation of contrastive information encoded in lexical entries Much remains to be understood

Abstraction in Infant Speech Encoding From a very early age infants show great sensitivity to speech sounds, possibly already with some ‘category-like’ structure Although native-like sensitivity develops early (< 1 year), this should be distinguished from adult-like knowledge of the sound system of the language Children still need to learn how to efficiently encode words (phoneme inventory) Children presumably still need to learn how to map stored word forms onto pronunciations (phonological system of the language) Popular distributional approaches to learning the sound system address rather non-abstract encodings of sounds, at best

More Issues… Is there distributional evidence for contrasts in the input? Maye et al.: children can learn Werker et al.: demonstration from Japanese/English maternal speech How well does this scale beyond duration (1-dimensional)? Child needs to store all exemplars Child needs to know all relevant dimensions This could yield at most phones, not phonemes Why do children fail on minimal pair learning? Inaccurate representations, qualitatively different representations Hard tasks Fennell et al.: context helps

Questions about Development Change from 6-12 months What changes? Structure changing vs. structure adding What causes change to occur? Statistical distributions of sounds Reliably separable distributions? Storing and organizing tokens for analysis Knowing appropriate acoustic dimensions Allophony, e.g., k-palatalization in English Why does it take so long? Change from 12-20 months (Skepticism about the effect)

6-12 Months: What Changes? (clunky diagram, from Phillips, 2001)

Structure Changing Patricia Kuhl U. of Washington

Structure Adding Evidence for Structure Adding (i) Some discrimination retained when sounds presented close together (e.g. Hindi d-D contrast) (ii) Discrimination abilities better when people hear sounds as non-speech (iii) Adults do better than 1-year olds on some sound contrasts Evidence for Structure Changing (i) No evidence of preserved non-native category boundaries in vowel perception

Sources of Evidence Structure-changing: mostly from vowels Structure-adding: mostly from consonants Conjecture: structure-adding is correct in domains where there are natural articulatory (or acoustic) boundaries [cf. Phillips 2001, Cogn. Sci., 25, 711-731]