A new corpus for Spanish Second Language Acquisition Research L. Dominguez, R. Mitchell, M. J. Arche (U. of Southampton), E. Marsden (U. of York), F. Myles.

Slides:



Advertisements
Similar presentations
Making the most of assessment: designing multi-purpose tasks for young language learners Louise Courtney Warsaw 21st October 2011.
Advertisements

Deaf People Education: crossing linguistic borders through e-Learning Introduced by Maria Tagarelli De Monte May 22nd, 2010 – Valletta, Malta LREC 2010.
Table of Contents – Part B HINARI Resources –Clinical Evidence –Cochrane Library –EBM Guidelines –BMJ Practice –HINARI EBM Journals.
ESDS Qualidata: Qualitative Data Preparation and Use John Southall ESDS 26 November 2003.
ESDS Qualidata Libby Bishop, ESDS Qualidata Economic and Social Data Service UK Data Archive ESDS Awareness Day Friday 5 December 2003Royal Statistical.
Diachronic study and language change Corpus Linguistics Richard Xiao
CLL Session 3: L2 Research Methodology LAEL, Lancaster University Florencia Franceschina.
1 Teaching writing: Issues from a British perspective Clare Furneaux The University of Reading, UK Oficinas de Escrita no Ensino de Línguas, Universidade.
The 7th Annual Graduate Student Forum at the 41 st Annual TESOL Convention EFL College Student Comprehension Strategies Olga M. Galarraga Sánchez Universidad.
Gordon Taylor (CLA)-  Why Gather Family Stories  Why cloud Storage  LegacyStories  Features  Applications  Review training.
CRELLA University of Bedfordshire May 2012 Parvaneh Tavakoli Effects of Task Design on Native and Non-native Task Performance.
A tool for simplifying text For people with autism who have difficulty with understanding regular text Converts.
Why study grammar? Knowledge of grammar facilitates language learning
Modified Approximants in L2 Spanish Teacher Talk: What are Students Hearing in the L2 Classroom? Meghan V. Huff Department of Linguistics University of.
Diachronic study and language change Corpus Linguistics Richard Xiao
Jing-Shin Chang National Chi Nan University, IJCNLP-2013, Nagoya 2013/10/15 ACLCLP – Activities ( ) & Text Corpora.
A Critical Analysis of English Language Courses in Thailand Richard Watson Todd King Mongkut’s University of Technology Thonburi ©2009 Richard Watson Todd.
The material was supported by an educational grant from Ferring How to Write a Scientific Article Nikolaos P. Polyzos M.D. PhD.
PHONEXIA Can I have it in writing?. Discuss and share your answers to the following questions: 1.When you have English lessons listening to spoken English,
Using corpora in SLA research: investigating word order Universidad Autónoma de Madrid WOSLAC project: 2 learner corpora CEDEL2WriCLE.
Qualitative Data Preparation and Use Jack Kneeshaw ESDS Psychology Department-U of Essex 4 December 2003.
”When CALL is the better choice” Jane Vinther, Ph.D. University of Southern Denmark CALL 2008 conference University.
Using Corpora in Linguistics Introduction to WordSmith Tools for Beginners Íde O’Sullivan Regional Writing Centre
Corpora and Language Teaching
Teaching Grammar in the Communicative Classroom:
Presented by Eroika Jeniffer.  What are we going to learn? - the use of chat in classroom - the most likely application on chat. And many more….. So,
1 OWLTS Online World Language Testing Software Pittsburgh Public Schools Prismatic Consulting LLC.
14: THE TEACHING OF GRAMMAR  Should grammar be taught?  When? How? Why?  Grammar teaching: Any strategies conducted in order to help learners understand,
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Stages of Second Language Acquisition
Current Trends in Language Documentation and the Hans Rausing Endangered Languages Project Lenore A. Grenoble Dartmouth College Lenore A. Grenoble Linguistics.
Teaching French Linguistics to students of French: aims, methods and outcomes Glyn Hicks University of Southampton, Modern Languages.
Using the SILL to Record the Language Learning Strategy Use: Suggestions for the Greek EFL Population Dr. Vassilia Kazamia-Christou Aristotle University.
Benefits of ICT Communicating As an Internet user, the tools you are most likely to use to communicate with other Internet users are: Electronic mail the.
CHAT and CLAN Fraibet Aveledo ESRC Centre for Research on Bilingualism in Theory and Practice.
AGNovel Partnership Teacher Training – Module 4: AGnovel – How to use the App! AGnovel Advanced Interactive Graphic Novels on Mobile Touchscreen Devices.
Prof. Karīna Aijmere ( Karin Aijmer ) Gēteborgas Universitāte, Zviedrija „Valodas apguvēju korpuss – tā veidošana un izmantošana valodu apguvē, mācību.
Lafford, B. A. (2009). Toward an Ecological CALL: Update to Garrett (1991). The Modern Language Journal, 93, doi: /j x.
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
Mary Kickham-Samy Reference Renaissance Conference Denver, CO August 4, 2008.
Zolkower-SELL 1. 2 By the end of today’s class, you will be able to:  Describe the connection between language, culture and identity.  Articulate the.
By Noriko and Luisa.  Language proficiency level: advanced beginners  Previous Computer Knowledge: basic word- processing skills  Class size: 12 students.
Language Assistantships and Student Exchange: A comparison of students’ linguistic development during different residence abroad programmes. Patricia Romero.
Lafford, B. A. (2009). Toward an Ecological CALL: Update to Garrett (1991). The Modern Language Journal, 93, doi: /j x.
What is the Tell me project? How did it start? Partners Objectives Activities Products The site – some works The future
Creating and Using a Correlated Corpora to Glean Communicative Commonalities Jade Goldstein-StewartKerri A. GoodwinRoberta E. SabinRansom K. Winder U.S.
Tracking Language Development with Learner Corpora Xiaofei Lu CALPER 2010 Summer Workshop July 12, 2010.
1 The Electronic Network for Language and Culture Exchange (ENLACE): Development and Implementation Gary A. Cziko Raymond D. Meredith University of Illinois.
How to write a scientific article Nikolaos P. Polyzos M.D. PhD.
Documenting Endangered Languages A Partnership between the National Endowment for the Humanities and the National Science Foundation.
Diagnostic Assessment: Salvia, Ysseldyke & Bolt: Ch. 1 and 13 Dr. Julie Esparza Brown Sped 512/Fall 2010 Portland State University.
BUILDING STUDENTS’ LITERACY SKILLS Rosanne Zeppieri Teaching World Languages: Elementary.
1 Branches of Linguistics. 2 Branches of linguistics Linguists are engaged in a multiplicity of studies, some of which bear little direct relationship.
The Critical Period for Language Acquisition: Evidence from Second Language Learning CATHERINE E. SNOW AND MARIAN HOEFNAGEL-HÖHLE UNIVERSITY OF AMSTERDAM.
Second Language Acquisition
ONZEminer Margaret Maclagan, ONZE director Robert Fromont, designer.
Table of Contents – Part B HINARI Resources –Clinical Evidence –Cochrane Library –EBM Guidelines –Essential Evidence Plus –HINARI EBM Journals.
Putting it All Together Xiaofei Lu APLNG 596D July 17, 2009.
Data Collection and Analysis. Empiricism Language research is an empirical activity i.e. It involves the collection and analysis of data which can be.
Online measurement of language development DR VIVIENNE ROGERS, COAH CODAH BID DEVELOPMENT WORKSHOP: 25 TH JAN 2016.
ELEMENTARY SPANISH 2 INTERMEDIATE SPANISH 1 CONSISTENCY PROJECT El Camino College Fall 2015 Workshop 2: November 13, 2015.
Welcome to All S. Course Code: EL 120 Course Name English Phonetics and Linguistics Lecture 1 Introducing the Course (p.2-8) Unit 1: Introducing Phonetics.
INTRODUCTION TO APPLIED LINGUISTICS
© 2016 albert-learning.com. The Speaking Test Assesses speaking in a foreign language in a business context. Lasts a maximum of 15 minutes. Available.
Pronoun Interpretation in the Second Language: DPBE or not?
Textuality across linguistics and literature
Using Corpora in Linguistics
PATHWAYS TO MINORS IN FRENCH & SPANISH DEPARTMENT OF MODERN LANGUAGES
The effects of age in SLA
Using GOLD to Tracking L2 Development
Presentation transcript:

A new corpus for Spanish Second Language Acquisition Research L. Dominguez, R. Mitchell, M. J. Arche (U. of Southampton), E. Marsden (U. of York), F. Myles (Newcastle U.)

A corpus for L2 Acquisition SLA theory aims to understand the complex mechanisms and conditions behind learner grammars SLA theory aims to understand the complex mechanisms and conditions behind learner grammars Access to good quality data is crucial: learner production data + focused comprehension tasks Access to good quality data is crucial: learner production data + focused comprehension tasks Increasing interest in the creation of electronic learner corpora: Increasing interest in the creation of electronic learner corpora: –sharing data more easily –automatising some aspects of data analysis through the use of software such as concordancers, part of speech taggers, etc.

Some Existing Learner Corpora –CHILDES: –TALKBANK: –English Corpus Linguistics: –L2 FRENCH  FLLOC: –L2 (Written) SPANISH  CEDEL 2:

SPLLOC “Spanish Language Learner Oral Corpus” 2 year ESRC funded corpus project investigating the development of L2 Spanish 2 year ESRC funded corpus project investigating the development of L2 Spanish Aims: Aims: –a small scale, high quality cross-sectional database of spoken learner Spanish –topics being investigated lie at the syntax/discourse interface Data: Data:  Collected - c40 hours of audio recordings (native/non-native) - 80 written focused tests on word order - 80 written focused tests on word order - 60 computer based tests on clitic comprehension - 60 computer based tests on clitic comprehension  95% transcribed to date!

Immediate Research Agenda Syntax/discourse interface as conceptualised in generative linguistics, including: Syntax/discourse interface as conceptualised in generative linguistics, including: –The acquisition of Spanish word order –Clitic pronouns Verbal morphology Verbal morphology Development of the L2 lexicon Development of the L2 lexicon

Corpus Design Balance of spontaneous and focused data (semi- spontaneous oral tasks are complemented by focused judgement and production tasks) Balance of spontaneous and focused data (semi- spontaneous oral tasks are complemented by focused judgement and production tasks) Balance of genres (semi-spontaneous oral tasks include interview, narrative and discussion) Balance of genres (semi-spontaneous oral tasks include interview, narrative and discussion) Balance of participants (20 L2 speakers from each of beginner, intermediate and advanced levels + NS speakers) Balance of participants (20 L2 speakers from each of beginner, intermediate and advanced levels + NS speakers) Flexibility of computer-aided analysis (use of the CHILDES system, plus an XML version) Flexibility of computer-aided analysis (use of the CHILDES system, plus an XML version) Free web access to all materials (anonymised sound files, transcripts, analysis files) for all bonafide research users. Free web access to all materials (anonymised sound files, transcripts, analysis files) for all bonafide research users.

Summary of tasks by type, elicitation method and genre Task Type ElicitationGenre Modern Times Loch Ness Photos Paired Discussion Clitic Production Clitic Comprehe nsion Word Order SEMI - SPONTA NEOUS OralNarrative√√√ Interview√ Discussion√ FOCUSED TASKS OralProduction√ Computer based Comprehens ion √ Paper Written √

Some task samples

Illustrations by Alex Brychta for “A Monster Mistake” by Roderick Hunt (Oxford Reading Tree, 2003) used by permission of Oxford University Press. Loch Ness

Modern Times

Photos task   Description of states And   Description of events

Clitic Comprehension (computer based) The learner hears a sentence with a clitic pronoun and has to click on the object it refers to. 32 screens:   Combination of number and gender (canonical and non-canonical) plus syntactic collocation. Canonical feminine: -a ending (e.g. calculadora ‘calculator’) Canonical masculine: -o ending (e.g. teléfono ‘phone) Non canonical: no –a/-o ending (e.g. lápiz) Collocation: Proclitic (as in coniugated verbs) vs. enclitic (as in infinitives).

Clitic Production (computer based) The learner is asked a question referring to an object based on the sequence of pictures shown. 32 slides; combination of number and gender (canonical and non-canonical) plus syntactic collocation.

Word Order Task (paper & pencil)   Context-dependent word order preference test The learner is presented with 28 situations with a following question Two types of questions: What happened? (Broad focus) Who did x? (Narrow focus) 4 items by 7 syntactic contexts: 4xSVO, 4xVOS, 4xCLLD, 4xUnerg/Narrow, 4xUnerg/ Broad, 4xUnacc/Narrow and 4xUnacc/Broad Three options: Inverted (VS), non-inverted (SV) and both You get home and your brother just tells you that he has got an from your friend Sue and that he has very good news to tell you. You ask your brother “¿Qué ha pasado?” (What happened?) What could he say? a. Se ha comprado un coche Sue b..Sue se ha comprado un coche c. Both sentences (Sue has bought a car) (Sue has bought a car) 2. Your brother is having some friends over for a get together at home. When your mother comes she sees some smoke coming out of the bathroom and she asks your brother: “¿Quién está fumando?” (Who’s smoking?) What could you brother say? a.Oscar está fumandob. B. Está fumando Oscarc. Both sentences (Oscar is smoking) (Oscar is smoking)

Summary of subjects by task (to date) Task TypeTask NameUniversity (Final Year) Sixth Form College (Year 13) Lower Secondary School (Year 9) Natives (all ages) Open- ended Modern Times 205 Loch Ness20 15 Photos20 15 Paired Discussion 20 5 Focused Clitic Comprehen sion 20 3 Picture Sequence Word Order

Tools for Data Analysis CHILDES (The Child Language Data Exchange System) CHILDES (The Child Language Data Exchange System) –CLAN = Computerised Language Analysis  Computer program suite for transcribing, searching and analysing language data –CHAT = Codes for the Human Analysis of Transcripts  A format for notation and transcription Types of Analyses: Types of Analyses: –FREQ, MLU, COMBO, KWAL

Next Steps Database will be available for use by the research community via (in spring 2008) Database will be available for use by the research community via (in spring 2008) Articles & conference papers (in 2007): Articles & conference papers (in 2007): –BAAL LLT SIG –GALA –BUCLD –HLS –SLRF CHILDES training workshop: CHILDES training workshop: – 25 January 2008, University of Southampton.

The SPLLOC project is supported by an ESRC research grant (RES ) We would like to thank all the participants in the project, including subjects, transcribers and fieldworkers Acknowledgments

References Domínguez, L., Arche, M.J. 2007a. “Deviant optional forms in L2 Spanish: the case of word order variation”. Poster presentation at GALA, Barcelona, 6-8 September. Domínguez, L., Arche, M.J. 2007b. “Optionality in L2 grammars: the acquisition of SV/VS contrast in Spanish”. To be presented at BUCLD 32,Boston, 1-4 November. Domínguez, L., Arche, M.J. 2007c. “The L2 Acquisition of SV/VS contrast in Spanish”. To be presented at the Hispanic Linguistic Symposium, Texas, 1-4 November. Domínguez, L., Arche, M.J., Mitchell, R, Marsden, E. and Myles, F “Innovations in Spanish SLA research methodology: introducing the ‘Spanish Learner Language Oral Corpus’”. To be presented at the Hispanic Linguistic Symposium, Texas, 1-4 November. Granger, S., J. Hung and S. Petch-Tyson (eds.) Computer Learner Corpora, second language acquisition and foreign language teaching. Amsterdam: John Benjamins. Lozano, C. & Mendikoetxea, A. (in press). Verb-Subject order in L2 English: new evidence from the ICLE corpus. In: Actas del XXV Congreso Internacional de AESLA. Universidad de Murcia. Lozano, C. & Mendikoetxea, A. (forthcoming 2007). Postverbal subjects at the interfaces in Spanish and Italian learners of L2 English: a corpus analysis. In: Papp, S., Díez, B. and Gilquin, G. (eds). Linking up contrastive and corpus learner research. Rodopi Mitchell, R., Marsden, E., Domínguez, L., Arche, M. J. and Myles, F “Creation and analysis of a Spanish language learner oral corpus (SPLLOC)”. Poster presentation at BAAL LLT SIG Conference “Towards a Researched Pedagogy”, University of Lancaster, 2-3 July. Mitchell, R., Dominguez, L., Arche, M.J., Myles, F. and Marsden, E. “Developing a CHILDES-based corpus of L2 oral Spanish”. To be presented at Second Language Research Forum, Urbana-Champaign, October. Myles, F Linguistic development in classroom learners of French: a cross-sectional study (No. End of ESRC award report R ). Southampton: University of Southampton. Myles, F Interlanguage corpora and second language acquisition research. Second Language Research, 21,4: