The Sketch Engine www.sketchengine.co.uk. -What is The Sketch Engine? -What is a corpus? -Looking at the BASE and the BAWE corpora. -How can this help.

Slides:



Advertisements
Similar presentations
The Cambridge Learner Corpus, English Profile, the Sketch Engine and the Kelly Project Adam Kilgarriff Lexical Computing Ltd
Advertisements

Finding multiwords of more than two words Adam Kilgarriff, Pavel Rychly, Vojtech Kovar, Vıt Baisa Lexical Computing Ltd; Masaryk Univ., Cz.
Concordancing at Upper-Intermediate Levels What it is not What you will get from this talk.
Using Corpus Tools in Discourse Analysis Discourse and Pragmatics Week 12.
1 Corpora for all Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
L EARNERS ’ D ICTIONARY Deny A. Kwary
Using Corpora for Teaching Chinese Dr. Adam Kilgarriff Lexical Computing Ltd Leeds University UK.
Recent Developments in Technological Tools for the Purpose of Facilitating SLA.
Making useful wordlists for ELT Topical vocabulary from the WWW Simon Smith & Scott Sommers Ming Chuan University, Taipei Adam Kilgarriff, Lexical Computing.
Today Listening test Corpus linguistics talk, Part 3 News task NEOs Life on Mars.
Talking about your homework News story? –What made you choose…? One of your words? –What made you choose…? (Give your vocabulary books to another student.
1 Corpora for the coming decade Adam Kilgarriff Lexical Computing Ltd.
1/26 Corpus Linguistics. 2/26 Varieties of English Relevance of corpus linguistics to this course –Previously studies of stylistics were largely informal.
Pedagogic uses of a corpus of student writing and their implications for sampling and annotation Alois Heuboeck University of Reading, UK.
Today Writing: using the comma –Writing task Corpus linguistics talk, Part 2 Re-organize groups –Group news discussion.
Using Corpora in Linguistics
1 Vocab Assessment & Corpora and Concordancing Major vocabulary assessment tools Major corpora and concordancers.
Corpus Linguistics What can a corpus tell us ? Levels of information range from simple word lists to catalogues of complex grammatical structures and.
Research methods in corpus linguistics Xiaofei Lu.
FATMA ISMED K1.09 CALL. Advantages of s s are easy to use. You can organize your daily correspondence, send and receive electronic messages.
Memory Strategy – Using Mental Images
Simple Maths for Keywords Adam Kilgarriff Lexical Computing Ltd.
Labels: automation Adam Kilgarriff. Auckland 2012Kilgarriff / Labels: automation2 Which words are:  Most distinctive of business English?  Most often.
1 Evaluating word sketches Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
Tomaž Erjavec 1, Adam Kilgarriff 2, Irena Srdanović Erjavec 3 1 Jožef Stefan Institute, Slovenia 2 Lexical Computing Ltd. and University of Leeds, UK 3.
Using Corpora for Teaching Chinese Dr. Adam Kilgarriff Lexical Computing Ltd Leeds University UK.
Corpus linguistics for translators Amanda Saksida University of Nova Gorica.
Online Corpora in L2 Writing Class Zawan Al Bulushi Indiana University Bloomington November 15,
1 Corpora, Language Technology and Maltese Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd University of Sussex.
GDEX: Automatically finding good dictionary examples in a corpus Adam Kilgarriff, Miloš Husák, Katy McAdam, Michael Rundell, Pavel Rychlý Lexical Computing.
1 Corpora, Dictionaries, and points in between in the age of the web Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of.
Researching language with computers Paul Thompson.
1 Corpora, Language Technology and Maltese Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd University of Sussex.
Why We Need Corpora and the Sketch Engine Adam Kilgarriff Lexical Computing Ltd, UK Universities of Leeds and Sussex.
Corpora by Web Services Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
Comparable Corpora BootCaT (CCBC) (or: In Praise of BootCaT) Adam Kilgarriff, Jan Pomikalek, Avinesh PVS Lexical Computing Ltd. Work Supported by EU FP7.
Tracking Language Development with Learner Corpora Xiaofei Lu CALPER 2010 Summer Workshop July 12, 2010.
Corpora and Concordancers in ESL/EFL Class: Truly Authentic Language for Language Learning. and opening.
1 Evaluating word sketches and corpora Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
Corpus Evaluation Adam Kilgarriff Lexical Computing Ltd Corpus evaluationPortsmouth Nov
Malta, May 2010Kilgarriff: Corpora by Web Services1 Corpora by Web Services Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities.
Chapter 8 Online reference tools Dictionaries and thesauruses Monolingual dictionaries Monolingual dictionaries Electronic dictionaries Electronic dictionaries.
8. ONLINE REFERENCE TOOLS Dictionaries and Thesauruses Concordancers and corpuses for language analysis Translators for language analysis Encyclopedias.
How Can Corpora Help Me To Be Successful in CO150?
Corpus approaches to discourse
Corpus Linguistics in Research Doctorate in Education University of Warwick 6th November 2008.
Sketch engine for Chinese Discussion notes. Wordsketch, subsequently Sketch Engine Was developed by Kilgarriff et al at Brighton Gives automatic, corpus-based.
Grammar is to Meaning as the Law if to Good Behaviour Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
Corpus Linguistics MOHAMMAD ALIPOUR ISLAMIC AZAD UNIVERSITY, AHVAZ BRANCH.
GDEX: Automatically finding good dictionary examples in a corpus Auckland 2012Kilgarriff: GDEX1.
Exploring Variation in Lexis and Genre in the Sketch Engine Adam Kilgarriff Lexical Computing Ltd., UK Supported by EU Project PRESEMT.
Using Corpora in TEFL By Terri Yueh. WhyWhy Work With Corpora? Why  From Vocabulary to Corpus  Choosing a Corpus Choosing a Corpus  Examples of Word.
What is a Corpus? What is not a corpus?  the Web  collection of citations  a text Definition of a corpus “A corpus is a collection of pieces of language.
GDEX: Automatically finding good dictionary examples in a corpus Kivik 2013Kilgarriff: GDEX1.
Use of Concordancers A corpus (plural corpora) – a large collection of texts, written or spoken, stored on a computer. A concordancer – a computer programme.
PRIMENJENA LINGVISTIKA I NASTAVA JEZIKA II 3 rd class.
GDEX: Automatically finding good dictionary examples in a corpus.
Using language corpora in developing Arabic lessons & syllabuses
Corpora: a key part of a materials writer’s toolkit
CORPUS LINGUISTICS Corpus linguistics is the study of language as expressed in samples (corpora) or "real world" text. An approach to derive at a set of.
AntConc is a freeware, multiplatform of application suitable for all types of users
Making useful wordlists for ELT
Computational and Statistical Methods for Corpus Analysis: Overview
Exploring the BNC Corpus
عمادة التعلم الإلكتروني والتعليم عن بعد
Corpora and Concordancers in ESL/EFL Class:
Corpus-Based ELT CEL Symposium Creating Learning Designers
Discovering Academic English using Sketch Engine
Corpora, Language Technology and Maltese
The BAWE Quicklinks project
Presentation transcript:

The Sketch Engine

-What is The Sketch Engine? -What is a corpus? -Looking at the BASE and the BAWE corpora. -How can this help you with ELT? Today

What is The Sketch Engine? “A language tool for anyone interested in how words behave” -A corpus query tool -Can reveal language grammar patterns -Web based corpora in 60+ languages -Easy to use

What is The Sketch Engine? - Used for lexicography at: OUP, CUP, Collins, Macmillan…

What is a corpus? -Corpus: Latin word ‘body’ (Plural corpora) -Large, computer searchable bodies of language. -BASE: British Academic Spoken English -BAWE: British Academic Written English -Target varieties -Advanced learners -Hilary Nesi and Paul Thompson

-Sketch Engine in action… -Concordancer, Word Sketch, Sketch Diff -Thesaurus “Automatic collocations dictionary” forbetterenglish.com -WebBootCaT How can this help you with ELT?

Let’s investigate… - How does the English word enjoy behave? –What meaning(s)? –Grammar? –Collocations?

Findings… - 3 million hits –Plenty of evidence - Lemmatised –enjoy enjoys enjoying enjoyed - Start with a sample

Getting more out of our corpus - Sort –on right context

Any patterns? - Recurring themes –Do you spot any? –Same concordance again:

What do we enjoy? - breakfast, dining, snack - Object is often ‘meal’ - Scroll on down, looking for more patterns - To see more context, click on word –What is a furry friend?

Too much data? - So much data to look through - Can the computer help? –yes

Grammar - We can cluster the objects

Enjoy - Views, benefits, meals, rides –Good summary of the things we enjoy - Scrolling down …

Patterns - ing_comp –Strong pattern for enjoy –click on any item in Word Sketch to get more information ing_comp: hike

Thesaurus - ‘Distributional Thesaurus’ - Which word shares most collocates with target word? - What will come top for grandfather?

“Sketch Diff” Verbs they are subject of (middle column) –What grandmothers do –What they both do –What grandfathers do

What corpora? - Public ones - Your own

Public corpora - Many languages –Show # more corpora button – 220+ corpora –60+ languages

Your own corpora -WebBootCaT: a corpus building tool -Students can work on a topic close to their hearts -interests, an assignment -

Thank you day free trial LLAS_0113 Facebook.com/SketchEngine Twitter.com/SketchEngine YouTube.com/TheSketchEngine