Constraining Generalisation in Artificial Language Learning: Children are Rational Too Elizabeth Wonnacott, 1 Amy Perfors..2 University of Oxford 1,University.

Slides:



Advertisements
Similar presentations
Chapter 2 The Process of Experimentation
Advertisements

Animal, Plant & Soil Science
Cross Cultural Research
ENGLISH LEARNING FOR NON- NATIVE CHILDREN AROUND THE WORLD: SHOULD IT BE “SINK OR SWIM” APPROACH? By Majida Mehana, Ph.D.
Second Language Acquisition and the Critical Period Grant Goodall Dept. of Linguistics.
Chapter 4 Key Concepts.
Psycholinguistic what is psycholinguistic? 1-pyscholinguistic is the study of the cognitive process of language acquisition and use. 2-The scope of psycholinguistic.
Contrastive Analysis, Error Analysis, Interlanguage
18 and 24-month-olds use syntactic knowledge of functional categories for determining meaning and reference Yarden Kedar Marianella Casasola Barbara Lust.
Infant sensitivity to distributional information can affect phonetic discrimination Jessica Maye, Janet F. Werker, LouAnn Gerken A brief article from Cognition.
Using prosody to avoid ambiguity: Effects of speaker awareness and referential context Snedeker and Trueswell (2003) Psych 526 Eun-Kyung Lee.
Putting Together the Pieces: Meaning Matters in Children’s Plural Comprehension Craig Van Pay, Areanna Lakowske & Jennifer Zapf.
Psycholinguistic methodology Psycholinguistics: Questions and methods.
Casenhiser and Goldberg (2005) Ability to learn to pair novel constructional meaning with novel form Known nouns and nonsense verb arranged in non- English.
1 Attention and Inhibition in Bilingual Children: evidence from the dimensional change card sort Task By: Ellen Bialystok and Michelle M.Martin.
Word Retrieval in a Stem Completion Task: Influence of Number of Potential Responses Christine Chiarello 1, Laura K. Halderman 1, Cathy S. Robinson 1 &
Language, Mind, and Brain by Ewa Dabrowska Chapter 2: Language processing: speed and flexibility.
Second Language Proficiency Places Cognitive Constraints on Sentence Processing Noriko Hoshino Department of Psychology The Pennsylvania State University.
1 Human simulations of vocabulary learning Présentation Interface Syntaxe-Psycholinguistique Y-Lan BOUREAU Gillette, Gleitman, Gleitman, Lederer.
PSY 369: Psycholinguistics Language Acquisition: Bilinugalism.
Building Background F9: Vocabulary Bell Work: Write 3 strategies you can use in the classroom to help students link past learning with new concepts Opening.
1. Introduction Which rules to describe Form and Function Type versus Token 2 Discourse Grammar Appreciation.
Experimental Research
Growing Up Bilingual. Outline Genesee (1989) –Different approaches to explaining bilingual mixing in children. Au & Glusman (1990) –Mutual exclusivity.
McEnery, T., Xiao, R. and Y.Tono Corpus-based language studies. Routledge. Unit A 2. Representativeness, balance and sampling (pp13-21)
 For the IB Diploma Programme psychology course, the experimental method is defined as requiring: 1. The manipulation of one independent variable while.
Experimental study of morphological priming: evidence from Russian verbal inflection Tatiana Svistunova Elizaveta Gazeeva Tatiana Chernigovskaya St. Petersburg.
Jelena Mirković and Maryellen C. MacDonald Language and Cognitive Neuroscience Lab, University of Wisconsin-Madison Introduction How to Study Subject-Verb.
Psych 156A/ Ling 150: Acquisition of Language II Lecture 16 Language Structure II.
Impact of professional development training in indirect language stimulation techniques on language development in Head Start ESL preschool children Research.
Single-Factor Experimental Designs
TEMPLATE DESIGN © Learning Words and Rules Abstract Knowledge of Word Order in Early Sentence Comprehension Yael Gertner.
Experimental Research Methods in Language Learning Chapter 1 Introduction and Overview.
Educational Psychology, 7 th edition Jeanne E. Ormrod © 2011 Pearson Education, Inc. All rights reserved. 1-1 Understanding research.
The Research Design. Experimental Design Definition A description of what a researcher would like to find out and how to find it out. Pre-requisites 1.Identification.
Adele E. Goldberg. How argument structure constructions are learned.
Quantitative and Qualitative Approaches
EDN:204– Learning Process 30th August, 2010 B.Ed II(S) Sci Topics: Cognitive views of Learning.
First Language Acquisition
Research Strategies. Why is Research Important? Answer in complete sentences in your bell work spiral. Discuss the consequences of good or poor research.
1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.
Cognitive Processes PSY 334 Chapter 11 – Language Structure June 2, 2003.
Generic Tasks by Ihab M. Amer Graduate Student Computer Science Dept. AUC, Cairo, Egypt.
© 2011 Cengage Learning. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part, except for use as permitted in a license.
Lexical and morphosyntactic minimal pairs. Evidence for different processing Luca Cilibrasi, Vesna Stojanovik, Patricia Riddell, School of Psychology,
Experimental Research Methods in Language Learning Chapter 5 Validity in Experimental Research.
What infants bring to language acquisition Limitations of Motherese & First steps in Word Learning.
The Critical Period for Language Acquisition: Evidence from Second Language Learning CATHERINE E. SNOW AND MARIAN HOEFNAGEL-HÖHLE UNIVERSITY OF AMSTERDAM.
Second Language Acquisition
EXPERIMENTS AND EXPERIMENTAL DESIGN
ONLINE USAGE OF THEORY OF MIND CONTINUES TO DEVELOP IN LATE ADOLESCENCE Iroise Dumontheil, Ian A. Apperly, and Sarah-Jayne Blakemore.
The Emergent Structure of Semantic Knowledge
Method. Input to Learning Two groups of learners each learn one of two new Semi-Artificial Languages. Both Languages: Example sentences: glim lion bee.
The Computational Nature of Language Learning and Evolution 10. Variations and Case Studies Summarized by In-Hee Lee
A. Baker, J. de Jong, A. Orgassa & F. Weerman Collaborators: VARIFLEX project: Elma Blom & Daniela Polišenská (NWO-research grant : Disentangling.
1 Vocabulary acquisition from extensive reading: A case study Maria Pigada and Norbert Schmitt ( 2006)
Chapter 3 Language Acquisition: A Linguistic Treatment Jang, HaYoung Biointelligence Laborotary Seoul National University.
Chapter 10 Language acquisition Language acquisition----refers to the child’s acquisition of his mother tongue, i.e. how the child comes to understand.
VISUAL WORD RECOGNITION. What is Word Recognition? Features, letters & word interactions Interactive Activation Model Lexical and Sublexical Approach.
Effects of Reading on Word Learning
LANE 622 APPLIED LINGUISTICS
PSYC 206 Lifespan Development Bilge Yagmurlu.
Investigating the combined effects of word frequency and contextual predictability on eye movements during reading Christopher J. Hand Glasgow Language.
Cognitive Processes in SLL and Bilinguals:
Selin Gulgoz Susan A. Gelman University of Michigan Introduction
Reading and Frequency Lists
CHAPTER 4 Designing Studies
Saidna Zulfiqar bin Tahir STATE UNIVERSITY OF MAKASSAR
Noriko Hoshino Department of Psychology
Biological Science Applications in Agriculture
Presentation transcript:

Constraining Generalisation in Artificial Language Learning: Children are Rational Too Elizabeth Wonnacott, 1 Amy Perfors..2 University of Oxford 1,University of Adeleaide 2 1. Abstract 2. Background Successful language acquisition involves generalization, but learners must balance this against the acquisition of lexical constraints. Examples occur throughout language. For example, English native speakers know that certain noun-adjective combinations are impermissible (e.g. strong winds, high winds, strong breezes, *high breezes). Another example is the restrictions imposed by verb sub- categorization, (e.g. I gave/sent/threw the ball to him; I gave/sent/threw him the ball; I donated/carried/pushed the ball to him; * I donated/carried/pushed him the ball). Such lexical exceptions have been considered problematic for acquisition: if learners generalize abstract patterns to new words, how do they learn that certain specific combinations are restricted? (Baker, 1979). Certain researchers have proposed domain-specific procedures (e.g. Pinker, 1989 resolves verb sub- categorization in terms of subtle semantic distinctions). An alternative approach is that learners are sensitive to distributional statistics and use this information to make inferences about when generalization is appropriate (Braine, 1971). A series of Artificial Language Learning experiments have demonstrated that adult learners can utilize statistical information in a rational manner when determining constraints on verb argument-structure generalization (Wonnacott, Newport & Tanenhaus, 2008). The current work extends these findings to children in a different linguistic domain (learning relationships between nouns and particles). We also demonstrate computationally that these results are consistent with the predictions of domain-general hierarchical Bayesian model (cf. Kemp, Perfors & Tenebaum, 2007). 3. Experiments Inference is performed on multiple levels simultaneously: Level 1 knowledge about the construction distribution of specific nouns (represented by the  s); Level 2 knowledge about the nature of constructions in the language as a whole (represented by  and  ); and Level 3 priors about the nature of that knowledge (represented by and  ). 6. Conclusions Like adults in previous studies, children in these experiments show rational statistical learning when determining the extent of generalization. The results are captured by a hierarchical Bayesian model capable of learning about structure variability on several levels simultaneously. Both humans and the model make inferences about the extent to which particle usage is lexically conditioned. This statistic interacts with lexical frequency. In current work: the model was given the equivalent input to human participants except that minimal exposure nouns are heard only once. Summary: Model qualitatively replicates critical aspects of human performance. (N.b slightly more influence of ‘lexical’ constraints but model not subject to memory limitations – to be followed up!). Training Procedure (in 2 * 15 minute sessions over 2 consecutive days) Each session began with picture labelling: e.g. see say : “cat” “giraffe” Then sentence practice: e.g. seehear: “moop giraffe tay” Experimenter encourages children to repeat aloud. No other instruction. Participants: 44 children recruited from Year 1 classrooms (mean age 6 years). All monolingual native English speakers. 11 children assigned to learn each of 4 input languages (below) Pseudo-random assignment matching ages across conditions. Language Paradigm Vocabulary 8 nouns cat, giraffe, pig, dog, cow, (“borrowed” from English) crocodile, mouse (4 in input, 4 reserved for testing) 1 verb moop “THERE ARE TWO….” 2 particles dow, tay NO SEMANTICS BUT OBLIGTORY IN NP Sentences moop + noun + particlee.g. moop giraffe dow moop giraffe tay “THERE ARE TWO GIRAFFES particle” Generalist Language - 4 ‘alternating’ nouns 75% dow; 25% tay Lexicalist Language -3 dow-only nouns -1 tay-only nouns Mixed language ‘alternating’ nouns 50% dow; 50% tay -1 dow-only noun -1 tay-only noun constrained nouns 3x as frequent Input languages differ in extent to which usage of particles is lexically determined Familiar nouns (from the input) New nouns 3 types: 2 Entirely novel. 1 Minimal-exposure -dow not in input but presented in four sentences just before test, always dow in each sentence 1 Minimal-exposure –tay not in input but presented in four sentences just before test, always tay in each sentence Input Languages and Test Types follow WNT. Note: new nouns explore generalization of noun- general patterns (entirely novel nouns) and how learners deal with very small amount of language specific input (minimal exposure nouns) Testing Procedure : Production Test e.g. See hear.: “moop” Children asked to say the whole sentence. Score – use of dow versus tay NB – break mid testing for more sentence practice which provided the ‘minimal exposure’ 4 Input languages (one for each of four groups) noun-particle co-occurrences can be manipulated. References : Baker, C. L. (1979). Syntactic theory and the projection problem. Linguistic Inquiry 10, Hudson-Kam, C.L., Newport, E.L. (2005) Regularizing unpredictable variation: The roles of adult and child learners in language formation and change. Language Learning and Development, 1, Kemp, C., Perfors, A., Tenenbaum, J.B. (2007) Learning overhypotheses with hierarchical Bayesian models. Developmental Science 10, Newport, E. (1990) Maturational constraints on language learning. Cognitive Science, 14, Snow, C.E., & Hoefnagel-Hohle, M. (1978). The critical period for language acquisition: Evidence from second language learning. Child Development, 49, Wonnacott, E., Newport, E.L., & Tanenhaus, M.K. (2008). Acquiring and processing verb argument Structure: Distributional learning in a miniature language. Cognitive Psychology 56, Test items for Production Test (identical for all groups) dow-only and tay-only nouns are: matched in frequency in Lexicalist and Mixed Language 1 *3 more frequent in Mixed Language 2. Mixed language ‘alternating’ nouns 50% dow; 50% tay -1 dow-only noun -1 tay-only noun all nouns equally frequent Noun A Noun B Noun C General inferences about the nature of particle distributions across the language e.g., Nouns may tend to occur with one particle only. Each occurs equally often across the language. HBMs learn on multiple levels simultaneously Hierarchical Bayesian Models (HBMs) can explain the computational principles that allow structure variability to be learned Level 2 Level 1 Knowledge about particle distributions of individual nouns (Different colors represent the different proportion of time each noun occurs in a different construction) Wonnacott, Newport & Tanenhaus (2008) (henceforth WNT) conducted a series of Artificial Language Learning experiments in which adult participants were exposed to miniature languages with two competing synonymous transitive constructions. Verbs in these languages were arbitrarily constrained as to whether they occurred in just one or both structures, with no semantic or phonological cues to verb-type. WNT Central questions: Do learners acquire verb-specific and verb-general statistical patterns? What factors affect the tendency to generalize a verb to a new construction not encountered in the input? WNT Central Findings: Learners acquire both verb-specific and verb-general statistical patterns (i.e. learned the likelihood of encountering a particular structure both with a given verb and with verbs in general). The tendency to use a verb in a new structure was affected by: - Verb frequency (less likelihood of generalizing a more frequent verb - Extent to which construction usage was lexically determined across the language as a whole The last was particularly obvious when comparing the treatment of very low frequency ‘minimal exposure’ verbs by learners of different languages. WNT argued that learners were utilizing statistical information in accordance with its utility/ relevance in the past – i.e. showing rational statistical learning. The need for Artificial Language Learning experiments with children -First language acquisition primarily occurs in early childhood. - Language learning (first and second) is generally more successful when it begins in early childhood _(Newport 1990) - Adults may use conscious learning strategies unavailable to children. The difficulty of Artificial Language Learning experiments with children -Generally fewer and shorter sessions learning sessions are practical. -Children are slower than adults in early stages of second language learning (Snow & Hoefnagle-Hohle, 1978) -Pilot work suggests WNT video paradigm inappropriate for learning mappings between event structure and word-order. (Ongoing work explores alternative methodologies - e.g. live act out. Watch this space!) NB - problem of determining generalization not limited to verb argument structures. 4. Computational Model 5. Results Significant effect of noun type in all languages. alternating nouns: Production probabilities match input statistics (aprox. 75% dow in Generalist language 50 % dow in Mixed Language 1 and 2) dow-only and tay-only nouns: Production probabilities reflect lexical constraints but Significantly more lexical learning in Lexical Language than in Mixed Language 1  influence of presence of alternating nouns Significantly more lexical learning in Mixed Language 2 than in Mixed Language 1  influence of lexicial (noun) frequency Experimental Data: Productions probabilities with familiar nouns alternating dow-only tay-only Note: Sentences with incorrect nouns or no particle are excluded (these make up approx 10% of the data) so 0% = 100% tay. Productions probabilities with novel nouns novel min.exp. dow min.exp tay Significant effect of noun-type in lexical language, marginal effect in other languages Entirely novel nouns: Production probabilities match input statistics – note not associated with these particular nouns (noun-general statistic). Minimal exposure nouns: In Generalist and Mixed languages usage of particles primarily influenced by noun-general statistic (little influence of 4 exposures). In Lexicalist language primarily influenced by 4 noun-specific exposures (little influence of noun-general statistic) Modeling data: “Production” probabilities with familiar and novel verbs. Acknowledgements: Many thanks to Jennifer Thomson for collecting the experimental data and to Prof. Kate Nation for helpful advice and discussion. This work was supported by a grant from the Economic and Social Research Council awarded to the first author, British Academy Fellowship awarded to the first author and a NSF Graduate Fellowship awarded to the second author. Aim of current work: To explore factors affecting balance between generalization and lexically-specific learning with children in a new linguistic domain (similar to that used in previous Artificial Language Learning experiments with children – see Hudson-Kam & Newport, 2005). Questions: Are the rational statistical learning procedures in WNT also relevant to child learning? Critically, is the tendency to generalize affected by: (a) lexical frequency (b) the extent to which the language as a whole exhibits lexically based patterns Note – initial aim is to explore purely distributional learning - will use languages with no relevant semantic/phonological cues Noun-general usage of two particles matched in lexical and generalist languages (75% dow bias) % nouns produced with dow % nouns produced with dow No semantic or phonological cues to noun type.