Modeling infant word segmentation: Another example of discovery fueled by CHILDES Alejandrina Cristia Laboratoire de Sciences Cognitives et Psycholinguistique.

Slides:



Advertisements
Similar presentations
Tone perception and production by Cantonese-speaking and English- speaking L2 learners of Mandarin Chinese Yen-Chen Hao Indiana University.
Advertisements

Markpong Jongtaveesataporn † Chai Wutiwiwatchai ‡ Koji Iwano † Sadaoki Furui † † Tokyo Institute of Technology, Japan ‡ NECTEC, Thailand.
Fast Algorithms For Hierarchical Range Histogram Constructions
Infant sensitivity to distributional information can affect phonetic discrimination Jessica Maye, Janet F. Werker, LouAnn Gerken A brief article from Cognition.
Method Participants Fifty-six undergraduate students (age range 19-37), 14 in each of the four language groups (monolingual, Spanish-English bilingual,
Phonetic Detail in Developing Lexicon Daniel Swingley 2010/11/051Presented by T.Y. Chen in 599.
Why do bilinguals gesture more than monolinguals? Elena Nicoladis University of Alberta.
Distributional Cues to Word Boundaries: Context Is Important Sharon Goldwater Stanford University Tom Griffiths UC Berkeley Mark Johnson Microsoft Research/
Learning the passive in natural(istic) settings Katie Alcock, Ken Rimba, Manizha Tellaie, and Charles Newton Thanks to Kamil ud Deen.
McEnery, T., Xiao, R. and Y.Tono Corpus-based language studies. Routledge. Unit A 2. Representativeness, balance and sampling (pp13-21)
Sebastián-Gallés, N. & Bosch, L. (2009) Developmental shift in the discrimination of vowel contrasts in bilingual infants: is the distributional account.
Infant Speech Perception & Language Processing. Languages of the World Similar and Different on many features Similarities –Arbitrary mapping of sound.
Learning of Word Boundaries in Continuous Speech using Time Delay Neural Networks Colin Tan School of Computing, National University of Singapore.
Active Learning for Statistical Phrase-based Machine Translation Gholamreza Haffari Joint work with: Maxim Roy, Anoop Sarkar Simon Fraser University NAACL.
Statistical Learning in Infants (and bigger folks)
A Psycholinguistic Perspective on Child Phonology Sharon Peperkamp Emmanuel Dupoux Laboratoire de Sciences Cognitives et Psycholinguistique, EHESS-CNRS,
1 Module One: Measurements and Uncertainties No measurement can perfectly determine the value of the quantity being measured. The uncertainty of a measurement.
1 Minimum Error Rate Training in Statistical Machine Translation Franz Josef Och Information Sciences Institute University of Southern California ACL 2003.
Phone-Level Pronunciation Scoring and Assessment for Interactive Language Learning Speech Communication, 2000 Authors: S. M. Witt, S. J. Young Presenter:
Pattern Associators, Generalization, Processing Psych /719 Feb 6, 2001.
Paul van Mulbregt Sheera Knecht Jon Yamron Dragon Systems Detection at Dragon Systems.
Second Language Acquisition To Think About: Think about a baby acquiring his/her first language. Think about a person acquiring a second language. What.
The Effect of Music Tempo on Memory Matthew Le Rockdale Magnet School for Science and Technology Students commonly listen to music while studying for tests.
1 Consider the k studies as coming from a population of effects we want to understand. One way to model effects in meta-analysis is using random effects.
In response… How QCA can be combined with other methods in evaluation? How can various ideas that are part of the whole QCA package be useful in themselves?
Effects of Word Concreteness and Spacing on EFL Vocabulary Acquisition 吴翼飞 (南京工业大学,外国语言文学学院,江苏 南京211816) Introduction Vocabulary acquisition is of great.
17th International Conference on Infant Studies Baltimore, Maryland, March 2010 Language Discrimination by Infants: Discriminating Within the Native.
Language Identification and Part-of-Speech Tagging
Analysis of AP Exam Scores
Effects of Reading on Word Learning
Lexical and Semantic Development: Part 1
Virtual University of Pakistan
Quality of Electronic Emergency Department Data: How Good Are They?
Measuring Monolinguality
Probability and the Normal Curve
DOUBLE JEOPARDY.
Thought as the basis of speech comprehension
Hunter Johnson1, Jack Darrow1, Oliver Sawi1,2, Kenneth Paap1;
Anik Wulyani, PhD candidate
Dean Luo, Wentao Gu, Ruxin Luo and Lixin Wang
Copyright © American Speech-Language-Hearing Association
Copyright © American Speech-Language-Hearing Association
Words in puddles of sound
Consonant variegations in first words: Infants’ actual productions of
Presentation by Hanh Dinh and Beverly Beaudette
What is development? Domains of development
Chapter 5 Sampling Distributions
Module 02 Research Strategies.
Introduction to Summary Statistics
Lesson 5. Lesson 5 Extraneous variables Extraneous variable (EV) is a general term for any variable, other than the IV, that might affect the results.
Who Taught YOU How to READ??????
Statistical Analysis Error Bars
Geology Geomath Chapter 7 - Statistics tom.h.wilson
Eiji Aramaki* Sadao Kurohashi* * University of Tokyo
10701 / Machine Learning Today: - Cross validation,
Contemporary Study: Sebastian and Hernandez-Gil (2012)
Chapter 5 Sampling Distributions
Elements of a statistical test Statistical null hypotheses
What Is Good Clustering?
Neural Speech Synthesis with Transformer Network
Key idea: Science is a process of inquiry.
Hindsight Bias Tendency to believe, after learning an outcome, that one would have foreseen it. “I knew.
Sampling and Power Slides by Jishnu Das.
False discovery rate estimation
Psych 231: Research Methods in Psychology
Exploring Probability Through Yahtzee Extensions
Thought as the basis of speech comprehension
Michael C. Frank Stanford University
Peer Review: How scientists get their homework marked
Lesson Overview 1.1 What Is Science?.
Presentation transcript:

Modeling infant word segmentation: Another example of discovery fueled by CHILDES Alejandrina Cristia Laboratoire de Sciences Cognitives et Psycholinguistique @Language Emergence: Competition, Usage, and Analyses, 2019-06-06

No overt & unambiguous word/morpheme boundaries in the input… The segmentation Problem ! There are no silences between words when we speech. Unlike written language, there are no spaces between words when we speak. This means that children, to learn the phonological forms of words (which is crucial to learn word meaning) they have to be able to find word boundaries. “no silences” Kuhl 2004

… yet by the end of the first year, infants know some words/morphemes ‘Feet’ ‘mommy’ ‘baby’ ‘alldone’ ‘tobed’ The segmentation Problem ! There are no silences between words when we speech. Unlike written language, there are no spaces between words when we speak. This means that children, to learn the phonological forms of words (which is crucial to learn word meaning) they have to be able to find word boundaries. “no silences” Kuhl 2004 Tincoff & Jusczyk 2012; Bergelson & Swingley 2012; Ngon et al. 2014

How to study segmentability? mommy talking … cute … something shiny go by? Let’s just get to the facts.

Today’s menu A methodology for studying word form segmentation using models Segmentability differences for child-directed versus adult-directed register (in French) … bilingual versus monolingual settings (English, Spanish, & Catalan) Implications for infant studies

Today’s menu A methodology for studying word form segmentation using models Segmentability differences for child-directed versus adult-directed register (in French) … bilingual versus monolingual settings (English, Spanish, & Catalan) Implications for infant studies

Input representation Acoustic Symbolic (‘Phonological text’) + realistic… … provided representations match babies’ few appropriate corpora (natural discourse & good quality audio) only one (reproducible) algorithm + lots of corpora can be used + lots of algorithms proposed + algorithms represent a wide range of strategies assumes babies represent input abstract, with zero errors

Example Phonologize Remove word boundaries & unitize *MOT: look at the doggie lUk At D2 dOgi Phonologize l U k A t D 2 d O g i Remove word boundaries & unitize Evaluate Precision = 1 of the 5 words found were words in the input = .2 Recall = 1 of the 4 words in the input was recovered = .25 lU kAt D2 dO gi Segment with some algorithm Token F-score = 2* (Precision * Recall) Precision + Recall Note -- one can also unitize at the syllable level: lUk At D2 dO gi (input) lUk At D2 dO gi (output)

Bernard et al. 2019 Beh Res Meth Example algorithms Every sentence is a word (SentBase) Every syllable is a word (SyllBase) Simplest strategies 1. Baseline Lignos 2012 Goal is to “cut” using local cues Transitional Probabilities (TP) TP_abs TP_rel 2. Sub-lexical x Absolute/Relative threshold Diphone-Based Segmentation (DiBS) Daland + 2009; Saksida + 2016 Goal is to learn a set of “minimal recombinable units” 3. Lexical Adaptor Grammar (AG) Phonotactics from Utterances Determine Distributional Lexical Elements (Puddle) Johnson + 2007; Monaghan + 2010 Package: wordseg.readthedocs.io Preprint: https://osf.io/nx49h/ Bernard et al. 2019 Beh Res Meth

Bernard et al. 2019 Beh Res Meth The process in WordSeg Package: wordseg.readthedocs.io Preprint: https://osf.io/nx49h/ Bernard et al. 2019 Beh Res Meth

Sample results: precision, recall, & F-score are correlated Providence corpus (Demuth, Culbertson, & Alter, 2006) on CHILDES

Sample results: Effects of algorithm and input represent-ation Naima, in Providence corpus (Demuth, Culbertson, & Alter, 2006) on CHILDES Sample results: Effects of algorithm and input represent-ation

Today’s menu A methodology for studying word form segmentation using models Segmentability differences for child-directed versus adult-directed register (in French) … bilingual versus monolingual settings (English, Spanish, & Catalan) Implications for infant studies

Why look at register? In child-directed speech, probably… More utterances consist of a single word (+ all models) Utterances are overall shorter in length (+ all models) *MOT: Attends! *MOT: Ouaistuvastemettreausoleilpourtesecherlescheveux!

Why look at register? In child-directed speech, probably… More utterances consist of a single word (+ all models) Utterances are overall shorter in length (+ all models) Utterances are more repetitious (+? lexical models) *MOT: coucoucoucousitufaisaisdespetitssourirestoi. *MOT: tumefaisdespetitssouriresXXXcoucoumongrand. *MOT: coucoutumefaisdessouriresoupas.

(Ask me about crosslinguistic extensions if curious!) French English Japanese LENA-Lyon corpus (LeNormand et al. HomeBank) Collected with child-worn device worn whole day  adult-directed speech is among caregivers Winnipeg corpus Collected with child-worn device worn whole day  adult-directed speech is among caregivers Riken corpus Collected in the lab  adult-directed speech is with experimenter Bogdan Ludusan Georgia Loukatou

on Le Normand, Canault, & Van Thai’s French “wild” ADS on Le Normand, Canault, & Van Thai’s LENA-Lyon corpus Loukatou + 2019 Proc Cog Sci

CDS-ADS: Conclusions Overall trend for better performance for child- than adult-directed speech But: reversed for some algorithms effect of register < 15% (in the best controlled cases, 2%)

Today’s menu A methodology for studying word form segmentation using models Segmentability differences for child-directed versus adult-directed register (in French) … bilingual versus monolingual settings (English, Spanish, & Catalan) Implications for infant studies

Why study word segmentation in a bilingual setting? ‘pié’ ‘mamá’ ‘bebé’ … Bilinguals need to: Learn words, like monolinguals do, but in two languages Overall less input in each language ‘Feet’ ‘mommy’ ‘baby’ ‘alldone’ ‘tobed’ The segmentation Problem ! There are no silences between words when we speech. Unlike written language, there are no spaces between words when we speak. This means that children, to learn the phonological forms of words (which is crucial to learn word meaning) they have to be able to find word boundaries. Hoff + 2012 Fibla & Cristia (submitted very soon, I hope)

Questions & predictions Are segmentation strategies equally successful when applied to bilingual and monolingual corpora? → Measure the performance of previously studied segmentation algorithms in a controlled monolingual versus bilingual corpus. Possible outcomes: The confusion hypothesis: variable and inconsistent input → Poorer performance for the bilingual than for the monolingual The resistant hypothesis: (if switching only at utterance edges) local statistical and lexical are still reliable → Similar performance for the bilingual and the monolingual The resistant hypothesis: our algorithms use transitional information to segment and we only mix languages at sentence boundaries = the segmentation strategies could still be robust Fibla & Cristia (submitted very soon, I hope)

Creating bilingual corpora

We had predicted that, if there were a bilingual disadvantage, then scores for the bilingual corpus should be below both of the matching monolingual corpora. In fact, this nearly never happens, as each bilingual corpus yields scores that are in between those of the corresponding monolingual corpora. There are only three exceptions to this general pattern. While DiBS for Catalan-Spanish followed that same pattern, the scores for the English-Spanish bilingual corpus were significantly lower than those for both English and Spanish monolingual. In TP-rel, the score for the two bilingual corpora overlapped with that for Spanish, and all three yielded lower Precision than Catalan and English. Nonetheless, the scores for DiBS and TP-rel vary by little, and thus a different measure of noise may come to reveal that even these differences are non-significant. Moreover, these 3 cases are exceptional, with all other 11 combinations (i.e., 7 algorithms times 2 bilingual corpora) fitting with the generalization that the bilingual score lies in between those found for the matching monolingual corpora

Three cases of bilingual < monolingual We had predicted that, if there were a bilingual disadvantage, then scores for the bilingual corpus should be below both of the matching monolingual corpora. In fact, this nearly never happens, as each bilingual corpus yields scores that are in between those of the corresponding monolingual corpora. There are only three exceptions to this general pattern. While DiBS for Catalan-Spanish followed that same pattern, the scores for the English-Spanish bilingual corpus were significantly lower than those for both English and Spanish monolingual. In TP-rel, the score for the two bilingual corpora overlapped with that for Spanish, and all three yielded lower Precision than Catalan and English. Nonetheless, the scores for DiBS and TP-rel vary by little, and thus a different measure of noise may come to reveal that even these differences are non-significant. Moreover, these 3 cases are exceptional, with all other 11 combinations (i.e., 7 algorithms times 2 bilingual corpora) fitting with the generalization that the bilingual score lies in between those found for the matching monolingual corpora

Three cases of bilingual < monolingual We had predicted that, if there were a bilingual disadvantage, then scores for the bilingual corpus should be below both of the matching monolingual corpora. In fact, this nearly never happens, as each bilingual corpus yields scores that are in between those of the corresponding monolingual corpora. There are only three exceptions to this general pattern. While DiBS for Catalan-Spanish followed that same pattern, the scores for the English-Spanish bilingual corpus were significantly lower than those for both English and Spanish monolingual. In TP-rel, the score for the two bilingual corpora overlapped with that for Spanish, and all three yielded lower Precision than Catalan and English. Nonetheless, the scores for DiBS and TP-rel vary by little, and thus a different measure of noise may come to reveal that even these differences are non-significant. Moreover, these 3 cases are exceptional, with all other 11 combinations (i.e., 7 algorithms times 2 bilingual corpora) fitting with the generalization that the bilingual score lies in between those found for the matching monolingual corpora 11 cases of bilingual ‘in between’ monolingual

Today’s menu A methodology for studying word form segmentation using models Segmentability differences for child-directed versus adult-directed register (in French) … bilingual versus monolingual settings (English, Spanish, & Catalan) Implications for infant studies

Effects of algorithm and input represent-ation size of algorithm x level effect = 40-60%? Effects of algorithm and input represent-ation Cristia + 2019 Open Mind

Effect of register Size of register effect < 10%? on LENA-Lyon corpus Loukatou + 2019 Proc Cog Sci

Effect of bilingualism We had predicted that, if there were a bilingual disadvantage, then scores for the bilingual corpus should be below both of the matching monolingual corpora. In fact, this nearly never happens, as each bilingual corpus yields scores that are in between those of the corresponding monolingual corpora. There are only three exceptions to this general pattern. While DiBS for Catalan-Spanish followed that same pattern, the scores for the English-Spanish bilingual corpus were significantly lower than those for both English and Spanish monolingual. In TP-rel, the score for the two bilingual corpora overlapped with that for Spanish, and all three yielded lower Precision than Catalan and English. Nonetheless, the scores for DiBS and TP-rel vary by little, and thus a different measure of noise may come to reveal that even these differences are non-significant. Moreover, these 3 cases are exceptional, with all other 11 combinations (i.e., 7 algorithms times 2 bilingual corpora) fitting with the generalization that the bilingual score lies in between those found for the matching monolingual corpora Size of bilingualism effect ~ 0%? Fibla & Cristia (submitted very soon, I hope)

Today’s menu A methodology for studying word form segmentation using models Segmentability differences as a function of language properties … child-directed versus adult-directed register (in Japanese, English, & French) … bilingual versus monolingual settings (English, Spanish, & Catalan) Implications for infant studies

What may babies be doing? Using CDI results & frequency effects Larsen + 2017 Interspeech & in prep

What may babies be doing? Using CDI results & frequency effects Coefficient of determination R2=.1 Larsen + 2017 Interspeech & in prep

phoneme-based models

syllable-based models phoneme-based models

Cut only at utterance edges  frequency of words in isolation

To be continued…

Thanks to... Families who agree to be recorded & for their data to be shared Researchers who record them and share on TalkBank TalkBank ~ Brian MacWhinney & you!

on Reiko Mazuka’s RIKEN corpus Japanese “lab” ADS on Reiko Mazuka’s RIKEN corpus much of this is in Ludusan et al. 2017 ACL (now working on journal paper with more material)

on Melanie Soderstrom’s Winnipeg corpus English “wild” ADS on Melanie Soderstrom’s Winnipeg corpus Cristia + 2019 Open Mind