Morphology in the Decomposing Brain: Correlational Analyses of Single Trial MEG Data Alec Marantz, Olla Somolyak, Ehren Reilly Bill Badecker, Asaf Bachrach.

Slides:



Advertisements
Similar presentations
Accessing spoken words: the importance of word onsets
Advertisements

Summer 2011 Tuesday, 8/ No supposition seems to me more natural than that there is no process in the brain correlated with associating or with.
CS Morphological Parsing CS Parsing Taking a surface input and analyzing its components and underlying structure Morphological parsing:
Timing of the brain events underlying access to consciousness during the attentional blink Claire Sergent, Sylvain Baillet, & Stanislas Dehaene.
Knowing More than One Language: The Psycholinguistics of Bilingualism Marina Blekher Department of Linguistics.
Detecting Conflict-Related Changes in the ACC Judy Savitskaya 1, Jack Grinband 1,3, Tor Wager 2, Vincent P. Ferrera 3, Joy Hirsch 1,3 1.Program for Imaging.
A few pilots to drive this research Well-trained subjects: 15 hours, including 5 of practice. Stimuli: Holistic: Separable: Task: "Same"-"Different" task.
Morphology and Meaning in the English Mental Lexicon By William Marlsen-Wilson, Lorraine Komisarjevsky Tyler, Rachelle Waksler, and Lianne Older Presented.
Using prosody to avoid ambiguity: Effects of speaker awareness and referential context Snedeker and Trueswell (2003) Psych 526 Eun-Kyung Lee.
Processing Multiple Unrelated Meanings versus Multiple Related Senses Ekaterini Klepousniotou McGill University.
Using Evoked Magnetoencephalographic Responses for the Cognitive Neuroscience of Language Alec Marantz MIT KIT/MIT MEG Joint Research Lab Department of.
Introduction Complex words may be either (a) stored as full forms in the mental lexicon, or (b) undergo decomposition into their constituent morphemes.
Sentence Memory: A Constructive Versus Interpretive Approach Bransford, J.D., Barclay, J.R., & Franks, J.J.
 Previous studies have found facilitatory combinability effect in transparent characters, which have semantic radicals with clear meaning. Our results.
Experiment 2: MEG Study Materials and Methods: 11 right-handed subjects with 20:20 vision were run. 3 subjects’ data was discarded because of poor performance.
Language (and Decomposition). Linguistics provides… a highly articulated “computational” (generative) theory of the mental representations of language.
The Timecourse of Morphological Processing: Base and surface frequency effects in speed-accuracy tradeoff designs Jennifer Vannest University of Michigan.
Introduction to Linguistics and Basic Terms
Writing with APA style (cont.) & Experiment Basics: Variables Psych 231: Research Methods in Psychology.
Word Retrieval in a Stem Completion Task: Influence of Number of Potential Responses Christine Chiarello 1, Laura K. Halderman 1, Cathy S. Robinson 1 &
Features and Object in Visual Processing. The Waterfall Illusion.
Language, Mind, and Brain by Ewa Dabrowska Chapter 2: Language processing: speed and flexibility.
Let remember from the previous lesson what is Knowledge representation
Influence of Word Class Proportion on Cerebral Asymmetries for High and Low Imagery Words Christine Chiarello 1, Connie Shears 2, Stella Liu 3, and Natalie.
Early effects of morphological complexity on visual evoked fields in MEG Eytan Zweig & Liina Pylkkänen New York University 80 th Annual LSA meeting, January.
Liina Pylkkänen (NYU) and Alec Marantz (MIT) Morphological families and phonological neighborhoods – who competes when? MEG evidence.
Psycholinguistics 05 Internal Lexicon.
Models of Generative Grammar Smriti Singh. Generative Grammar  A Generative Grammar is a set of formal rules that can generate an infinite set of sentences.
Lecture 1 Introduction: Linguistic Theory and Theories
Language and Culture Prof. R. Hickey SoSe 2006 How language works
CAS LX 502 Semantics 3a. A formalism for meaning (cont ’ d) 3.2, 3.6.
Studying Visual Attention with the Visual Search Paradigm Marc Pomplun Department of Computer Science University of Massachusetts at Boston
What “Mice Trap” tells us about the mental lexicon Carolyn J. Buck-Gengler 1,3, Lise Menn 2,3, and Alice F. Healy 1,3 University of Colorado at Boulder.
Lemmatization Tagging LELA /20 Lemmatization Basic form of annotation involving identification of underlying lemmas (lexemes) of the words in.
Experimental study of morphological priming: evidence from Russian verbal inflection Tatiana Svistunova Elizaveta Gazeeva Tatiana Chernigovskaya St. Petersburg.
Word category and verb-argument structure information in the dynamics of parsing Frisch, Hahne, and Friedericie (2004) Cognition.
The Limits of the Left Hemisphere Interpreter in a Split Brain patient Rami H. Gabriel University of California, Santa Barbara Department of Psychology.
Introduction Pinker and colleagues (Pinker & Ullman, 2002) have argued that morphologically irregular verbs must be stored as full forms in the mental.
1 by Catherine-Marie Longtin, Juan Segui, and Pierre A. Halle´ Laboratoire de Psychologie Expe´rimentale, CNRS, Universite´ Rene´ Descartes, Boulogne-
Visual Word Form Recognition: An MEG study using Masked Priming Heejeong Ko 1, Michael Wagner 1, Linnaea Stockall 1, Sid Kouider 2, Alec Marantz 1 1 Department.
Age of acquisition and frequency of occurrence: Implications for experience based models of word processing and sentence parsing Marc Brysbaert.
Adele E. Goldberg. How argument structure constructions are learned.
323 Morphology The Structure of Words 3. Lexicon and Rules 3.1 Productivity and the Lexicon The lexicon is in theory infinite, but in practice it is limited.
Learning Automata and Grammars Peter Černo.  The problem of learning or inferring automata and grammars has been studied for decades and has connections.
Studying Memory Encoding with fMRI Event-related vs. Blocked Designs Aneta Kielar.
1 Compiler Construction (CS-636) Muhammad Bilal Bashir UIIT, Rawalpindi.
The Influence of Feature Type, Feature Structure and Psycholinguistic Parameters on the Naming Performance of Semantic Dementia and Alzheimer’s Patients.
Capturing patterns of linguistic interaction in a parsed corpus A methodological case study Sean Wallis Survey of English Usage University College London.
Stem Homograph Inhibition and Stem Allomorphy: Representing and Processing Inflected Forms in a Multilevel Lexical System, 1999 & Morphological Parsing.
1 Cross-language evidence for three factors in speech perception Sandra Anacleto uOttawa.
Tracking the able in stable: Toward an understanding of morphological decomposition in processing and representation Alec Marantz, Olla Somolyak, Ehren.
The effects of working memory load on negative priming in an N-back task Ewald Neumann Brain-Inspired Cognitive Systems (BICS) July, 2010.
Lexical and morphosyntactic minimal pairs. Evidence for different processing Luca Cilibrasi, Vesna Stojanovik, Patricia Riddell, School of Psychology,
Natural Language Processing Chapter 2 : Morphology.
Adam Houston 1, Chris Westbury 1 & Morton Gernsbacher 2 1 Department of Psychology, University of Alberta, Canada, 2 Department of Psychology, University.
COGNITIVE MORPHOLOGY Laura Westmaas November 24, 2009.
3 Phonology: Speech Sounds as a System No language has all the speech sounds possible in human languages; each language contains a selection of the possible.
Neural correlates of morphological decomposition in a morphologically rich language : An fMRI study Lehtonen, M., Vorobyev, V.A., Hugdahl, K., Tuokkola.
October 2004CSA3050 NLP Algorithms1 CSA3050: Natural Language Algorithms Morphological Parsing.
Models of Production and Comprehension [1] Ling4-437.
Chapter 3 Word Formation I This chapter aims to analyze the morphological structures of words and gain a working knowledge of the different word forming.
Morphology 1 : the Morpheme
VISUAL WORD RECOGNITION. What is Word Recognition? Features, letters & word interactions Interactive Activation Model Lexical and Sublexical Approach.
Argument Structure violation
Language, Mind, and Brain by Ewa Dabrowska
Cognitive Processes in SLL and Bilinguals:
Phonological Priming and Lexical Access in Spoken Word Recognition
The Nature of Learner Language
Phonological Priming and Lexical Access in Spoken Word Recognition
Decision Making as a Window on Cognition
Presentation transcript:

Morphology in the Decomposing Brain: Correlational Analyses of Single Trial MEG Data Alec Marantz, Olla Somolyak, Ehren Reilly Bill Badecker, Asaf Bachrach KIT/NYU MEG Joint Research Lab Departments of Psychology and Linguistics NYU

Map Simple Models of Lexical Access Rejected  simple dual route model (Pinker)  simple obligatory decomposition model  simple whole word access model Real Issues Identified  modality specific access “lexicon”?  what properties modulate affix-stripping/decomposition, stem access and recombination on an obligatory decomposition model?

Completed Experiment  evidence for, at least, full decomposition or interactive dual route model  no support for whole word access route  support for early effects of surface frequency of affixed forms relative to stem frequency at decomposition stage Experiment Underway  tracking the -able in amiable and stable

Competing Models of Lexical Access – according to Pinker’s Words and Rules Cartoon Version

Saturday Morning Models of Lexical Access Full storage model: all complex words (walked, taught) stored and accessed as wholes  only surface frequency effects predicted Full decomposition model: no complex words stored and accessed as wholes  only stem frequency effects predicted

Dual Route Model: irregular complex forms are stored and accessed as wholes; regular complex forms are not:  surface frequency effects for irregulars (and high frequency regulars)  stem frequency but no surface frequency effects on access for regulars

Facts Support Dual Route Model, if Alternatives are these Cartoon Versions Fact: Stem Frequency effects in access for complex words Fact: These effects are not attributable to post-access decomposition  masked priming studies showing morphological (stem) priming when neither form nor semantic priming are found

But, fact: Surface frequency effects in lexical access are found in wide variety of cases, including completely regular morphology (e.g., for most inflected words in Finnish)

Problems for Dual Route Model The representation of irregular derived or inflected forms must be complex  from the grammatical point of view, felt is as complex as walked  e.g., behavior with respect to do support, impossibility of further inflection or derivation, rigidity of meaning…  from the psycho and neurolinguistic point of view, irregulars contain the stem in the same way that regulars do  taught-teach identity priming in long-lag priming and for M350 brain response

Whole Word “Representations” for Regulars Surface frequency effects on access are seen for a variety of completely regular derivations and inflections. Obligatory decomposition:  surface frequency effects could be tied to decomposition (the more you’ve decomposed a particular letter/sound sequence into stem and affix, the faster you are at it) and/or  recombination (the more often you’ve put together a particular stem and affix, the faster you are at it) Against Pinker’s dual route model, such effects imply representation of whole word as complex structure, regardless of regularity.

Comparing surface and base frequency effects: Level 1 vs Level 2 Morphology: surface frequency effects even for transparent productive regular morphology, and for same words that yield base frequency effects

Surface Frequency Effect: even for productive transparent “-less” type

Base Frequency Effect: here only for productive transparent “-less” type

“Representations” Saying that every combination of morphemes in perception or production, no matter how regular, leaves a trace in the language system of the speaker is saying that frequency information is part of the grammar and that all combinations of morphemes are stored in some sense.

walked may “stored” as a complex form with a certain frequency in the same way that a saying like, And now for something completely different, is. Both must be composed with the grammar when heard or produced, but both may have frequency and special meaning information associated with them that have no implications for the grammar whatsoever.

Beyond the Cartoons

Realistic Full Decomposition Models Must… Recognize that complex words, both regular and irregular, are stored in some sense, leading to surface frequency effects (but this is true of phrases and sentences as well as words and holds no implications for the grammar) Investigate the role of surface frequency in  decomposition  stem access  recombination

Realistic Dual Route Models Must… Recognize that all complex forms must be representationally complex, containing structures of morphemes and contrasting with monomorphemic constituents Explore the possible existence of stored “whole word” word form representations at modality- dependent “access lexicons” to distinguish themselves from obligatory decomposition models which deny such representations

u n r e a l un unreal (??) real [un[real]] “not”REAL form code modality specific access lexicon(??) lemma (lexical entry) Encyclopedia Stored info about encountered items And now for something completely different UN+REAL (??) interactive dual route models and obligatory decomposition models differ on the possible presence of complex word forms in modality specific access lexicons, and perhaps on whether derived forms have “lexical entries”

Effect of “Dominance” on Lexical Access: view from interactive dual route model Jen Hay: importance of the relative frequency of a morphologically complex form with respect to the frequency of its stem. Complex words with high frequencies relative to their stems are “affix-” or “surface dominant”; those with low frequencies are “stem” or “base dominant” Hay: affix dominance leads to difficulty in parsing/decomposition, thus reliance on whole-word recognition and suppression of decomposition in favor of whole-word route.

Simplistic Prediction of Hay Model Affix dominant words should show surface frequency effects but no stem frequency effects since they are accessed via the whole word route. Stem dominant words should show stem (cumulative) frequency effects since they are accessed via the decomposition route.

matched for stem frequency (9), difference in surface dominant (mere(ly)) or stem dominant (sane(ly)) meremerely merely sanesanely sane

Taft (2004): “Morphological Decomposition and the Reverse Base Frequency Effect” Obligatory decomposition makes same predictions as Hay for RT Base frequency effects…  RT to complex word correlates with freq of stem …reflect accessing the stem of morphological complex forms whereas Surface frequency effects…  RT to complex word correlates with freq of complex word …reflect the stage of checking the recombination of stem and stripped affix for existence and/or well- formedness.

How can we distinguish these accounts of RT differences? PL-Dominant PL (High Surface Freq) SG-Dominant PL (Low Surface Freq) Post-Access processing (until response)Latency of Lexical Access Post-Access processing (until response)Latency of Lexical Access Full Parsing* Latency of Lexical AccessPost-Access processing (until response) Latency of Lexical AccessPost-Access processing (until response) Full Listing / Parallel Dual Route PL-Dominant PL (High Surface Freq) SG-Dominant PL (Low Surface Freq) Reilly, Badecker & Marantz 2006 (Mental Lexicon): by determining the point of lexical access via brain monitoring

Sequential processing of words

Pylkkänen and Marantz, 2003, Trends in Cognitive Sciences

(Pylkkänen, Stringfellow, Flagg, Marantz, Biomag2000 Proceedings, 2000) Repetition Frequency (Embick, Hackl, Shaeffer, Kelepir, Marantz, Cognitive Brain Research, 2001) Latency of M350 sensitive to lexical factors such as lexical frequency and repetition: reflects stage of lexical access

Experiment: parallel behavioral and MEG processing measures Lexical Manipulation (Baayen, Dijkstra & Schreuder, 1997, JML)  Lemma/stem frequency (CELEX database)  Stem vs. affix dominance Stem Frequency: Stem DominantAffix Dominant Highdesk – deskscrop – crops Middeck – deckscliff – cliffs Lowchef – chefschord – chords

Stimuli: 3 Lexical Categories Nouns: singular/plural  bone  bones Verbs:stem/progressive  chop  chopping Adjectives:adjective/-ly adverb  clear  clearly

Experiment: behavioral measures Reliable effect of stem frequency in RT

Experiment: behavioral measures Interacting effects on RT of affixation (base vs. affixed) and dominance (base-dominant vs. affix- dominant)

Analysis of M350 peak latency (brain index of lexical access) Reliable effect of Stem frequency for unaffixed words and for affixed words Unaffixed WordsAffixed Words

Analysis of M350 peak latency Reliable effect of Affixation (base vs. affixed)

Analysis of M350 peak latency No effect of Dominance (base-dominant vs. affix- dominant) on M350 peak latency

Analysis of M350 peak latency No interaction between Dominance (base-dominant vs. affix- dominant) and Affixation (base vs. affixed) M350 peak latencyCumulative Response Time

Analysis of M350 peak latency Evidence that early stages of access for affixed words is based on full parsing: Stem frequency affects M350/lexical access while whole word frequency affects post-access (recombination) stage of word recognition.

Problem with this Conclusion No acknowledgement of the effects of dominance and/or surface frequency on parsing stage of decomposition

Possible effects of dominance at different stages in word recognition parsing affixstem accessrecombin- ation and checking RT affix dominant merely harder, since tighter connection (possibly only at high surface freq values) based on stem frequency, possibly speeded if high conditional probability of stem given affix faster, should correlate with surface frequency at high freq values might correlate (-) with surface frequency, given speed up in recom-bination stem dominant sanely easier than for affix dominant (lower transition probability) based on stem frequency at lower surface freq values, no effect of surface freq at lower surface freq value, should correlate (-) with stem freq

Let’s examine these possibilities with correlational analyses

RMS Correlations Across Subjects For some set of sensors, calculate at each time point in each experimental “epoch” the root mean square (RMS) = the square root of the mean of the squares of the values at each sensor (after normalization of values) So, for each subject, for each item, an RMS “wave” can be provided for the correlational analysis At each time point, the RMS value for each stimulus is correlated with a stimulus variable

Grand Average All Stimuli All Subjects (11)

M170 Sensors Chosen on the basis of field pattern, subject by subject

M350 sensors chosen subject by subject

M170 Correlation with Dominance: Significant “parsing” effect

No M170 word frequency effect for unaffixed words

No M170 Word/Non-Word “lexicality” effect

Although surface frequency correlates with affix dominance in these words, no surface frequency M170 effect

Recombination Effect?: Correlation with Conditional Probability of Stem, Given Affix, for Affixed Words

Summary of Correlations M170 amplitude:  Dominance o For affixed words: r = 0.07; p < 0.05 o Affixed surface dominant words: r = 0.11; p < 0.01 M350 Latency:  Stem Frequency o For all words: r = -0.06; p < 0.05 o For affixed words: not significant (neg. cor; p = 0.21) o For unaffixed words: r = -0.07; p < 0.05 Post M350 activity and affix frequency:  Window 1: 380ms – 430ms, W2: 390ms – 440ms, W3: 400ms – 450ms  Left Sensors: o W1: r = 0.07; p < 0.05 o W2: r = 0.08; p < 0.01 o W3: r = 0.08; p < 0.01

Evidence for an orthographic word form lexicon Frequency of stem relative to full affixed form – affix dominance – correlates with M170 amplitude; implies access to some kind of stem representation Zweig & Pylkkänen (2006) show M170 effect of decomposition in the contrast between farmer (complex) and winter (simple), where the contrast implies access to a representation of farm at the M170

Zweig & Pylkkänen (2006) Bimorphemic: refill, Monomorphemic Orth: resume

Zweig & Pylkkänen (2006) Bimorphemic: farmer, Monomorphemic Orth: winter

Modality-Specific Access Lexicon? For: “Parsing” at the M170 may require access to “lexicalized” word forms (or to high-n n-grams, or what)? For: Dominance effects at the M170 suggest frequency information associated with word-forms  dominance reflects the conditional probability of the affix given the stem Against: At the M170 it’s difficult to find word-form frequency effects or lexicality effects

Sophisticated Interactive Dual-Route Models predict Whole Word Word Form Effects However, there is no evidence from the current study that whole word word forms are available for complex words To the contrary, complex words with high affix dominance – the very words that should have whole word word forms according to Hay – show the biggest effect of complexity at the M170, where word form effects are expected

New Experiment: Investigate Factors Influencing Stages of Processing for Morphologically Complex Words AND Provide Neural Indices of Morphological Decomposition for Controversial Cases Initial parsing and decomposition  affix frequency?  transition probabilities, computed over morphemes or over strings? stem activation  frequency  “family” structure (family size, frequency, semantic transparency, etc.)  conditional probability from affix recognition? re-merger of pieces  surface frequency and conditional probabilities? evaluation of combined structure  semantic transparency?

status of bound stems durable  same root in duration  predicts durability amiable  no other uses of root  but, predicts amiability (stability) cable  false parse, compared to stable

tracking the -able in amiable If words like tolerable with a recurring root and amiable with a unique root nevertheless are parsed and computed as is workable with a word root, then  M170 “parsing” effects should be visible for these “opaque” words, since effects are strong for affix-dominant words  M350 effects should be modulated by morphological family and conditional probability of word given the affix  correlational contrasts with “matched” unaffixed words should be seen at all stages of access

Categories of Affixed Words for New Experiment 1. Word-Affix  taxable 2. Root-Affix  tolerable 3. Pseudo-Affix (mono-morphemes)  capable Morphological parsing as from English Lexicon Project

Nine Affixes able ary ant ity ate ic er al ion

More Examples Word-AffixRoot-AffixPseudo-Affix rationalityquantityvicinity destroyersorcerercharacter classicspecificempiric

The 27 subgroups… All groups are equivalent and well-distributed over:  length  mean bigram count  frequency All word-affix groups are well-distributed over:  surface vs stem frequency dominance

Matched Words Matched on frequency of ending:  Calculated affix frequencies from CELEX database  For each affix, found a list of mono-morphemes with roughly the same ending frequency. (log frequency of +/- 0.3 from affix log frequency)  Chose 18 matched words for each affix that had same parameter distributions as affixed words.

Examples er  evening, lightning, mediocre ity  caricature, terrain, pertain able  sentence, thought, profound

Parsability Affixed and Matched words are equally distributed over “parsability” - transitional probability between the last 2 letters of the stem and the first two letters of the affix (or affix-match). Example - for “ability” this would be: given that you see “il” 5 letters from the end of a word, what is the probability that “it” will follow it?

Analysis in brain space A distributed source model for each subject is constructed based on the structural MRI of the S’s brain Brain regions and time windows of interest (ROIs) are identified based on the grand average of responses over all stimuli Correlations are computed with stimulus variables within each ROI

ROIs (green) for M170, left and right hemispheres for one subject

Preliminary results (N=2) word+affix words yield more/later M170 activity than root+affix words and, M170 responses correlate with affix dominance, as in previous experiment lemma to suffix transitional probability correlations with M170 activity for root+affix words, not word+affix words

representative single subject correlations M170 left hemisphere: root+affix:  Latency by TransProbLemma:r = p = M170 right hemisphere: root+affix:  Latency by TransProbLemma:r = p = word+affix:  Amp by Surface Dominance:r = p =  Amp by Stem Word Freq:r = p = 0.058

leading interpretation: For word+affix words, the most important variable is the relative frequency of the stem as an independent word compared to the frequency of the word+affix; here affix dominance inhibits parsing For root+affix, the predictability of the affix given the stem (like affix dominance) seems to affect parsing More evidence in favor of full decomposition along with some sort of orthographic access lexicon for morphemes

That’s all, fo..(oh, you know the rest…)