Labels: automation Adam Kilgarriff. Kivik 2013Kilgarriff / Labels: automation2 Which words are:  Most distinctive of business English? Keywords, already.

Slides:



Advertisements
Similar presentations
Terminology-finding in the Sketch Engine Miloš Jakubíček, Adam Kilgarriff, Vojtěch Kovář, Pavel Rychlý, Vit Suchomel Lexical Computing Ltd., Brighton,
Advertisements

Finding multiwords of more than two words Adam Kilgarriff, Pavel Rychly, Vojtech Kovar, Vıt Baisa Lexical Computing Ltd; Masaryk Univ., Cz.
Corpus Processing and NLP
WebBootCaT usage Adam Kilgarriff Lexical Computing Ltd.
Using Corpus Tools in Discourse Analysis Discourse and Pragmatics Week 12.
1 Corpora for all Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
Sorting CMSC 201. Sorting In computer science, there is often more than one way to do something. Sorting is a good example of this!
Measuring Distance between Language Varieties Adam Kilgarriff, Jan Pomikalek, Pavel Rychly, Vit Suchomel Supported by EU Project PRESEMT.
Linking Dictionary and Corpus Adam Kilgarriff Lexicography MasterClass Ltd Lexical Computing Ltd University of Sussex UK.
1 Corpora for the coming decade Adam Kilgarriff. Dublin June 2009 Kilgarriff: Corpora for the coming decade2 How should they be different?  Bigger 
HOW TO USE A FRENCH DICTIONARY
Using Corpora for Teaching Chinese Dr. Adam Kilgarriff Lexical Computing Ltd Leeds University UK.
The Sketch Engine -What is The Sketch Engine? -What is a corpus? -Looking at the BASE and the BAWE corpora. -How can this help.
Making useful wordlists for ELT Topical vocabulary from the WWW Simon Smith & Scott Sommers Ming Chuan University, Taipei Adam Kilgarriff, Lexical Computing.
Morris LeBlanc.  Why Image Retrieval is Hard?  Problems with Image Retrieval  Support Vector Machines  Active Learning  Image Processing ◦ Texture.
Today Listening test Corpus linguistics talk, Part 3 News task NEOs Life on Mars.
Talking about your homework News story? –What made you choose…? One of your words? –What made you choose…? (Give your vocabulary books to another student.
1 Corpora for the coming decade Adam Kilgarriff Lexical Computing Ltd.
Stages of Learning To learn a paired-associate list, 3 tasks must be accomplished: 1. Stimulus Discrimination: There must be something distinctive about.
Today Writing: using the comma –Writing task Corpus linguistics talk, Part 2 Re-organize groups –Group news discussion.
The Project AH Computing. Functional Requirements  What the product must do!  Examples attractive welcome screen all options available as clickable.
Simple Maths for Keywords Adam Kilgarriff Lexical Computing Ltd.
Albert Gatt LIN 3098 Corpus Linguistics. In this lecture Some more on corpora and grammar Construction Grammar as a theoretical framework Collostructional.
Labels: automation Adam Kilgarriff. Auckland 2012Kilgarriff / Labels: automation2 Which words are:  Most distinctive of business English?  Most often.
Using Corpora for Teaching Chinese Dr. Adam Kilgarriff Lexical Computing Ltd Leeds University UK.
1 Corpora, Language Technology and Maltese Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd University of Sussex.
Terminology, translation, and PRESEMT; word frequency lists and KELLY 1 Adam Kilgarriff Lexical Computing Ltd SKEW-2, March 2011Kilgarriff: PRESEMT and.
GDEX: Automatically finding good dictionary examples in a corpus Adam Kilgarriff, Miloš Husák, Katy McAdam, Michael Rundell, Pavel Rychlý Lexical Computing.
Finite State Automata and Tries Sambhav Jain IIIT Hyderabad.
1 Corpora, Language Technology and Maltese Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd University of Sussex.
Why We Need Corpora and the Sketch Engine Adam Kilgarriff Lexical Computing Ltd, UK Universities of Leeds and Sussex.
Corpora by Web Services Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
Test 2 Review The test will consist of 3 sections. Section 1 is vocabulary matching. Section 2 is a Rate Per 100 problem. Section 3 is a Unit Rate Problem.
Using a Lemmatizer to Support the Development and Validation of the Greek WordNet Harry Kornilakis 1, Maria Grigoriadou 1, Eleni Galiotou 1,2, Evangelos.
1 Using Corpora in Language Research -also Introduction to the Sketch Engine (WS15) part 1 Adam Kilgarriff Lexical Computing Ltd Universities of Leeds.
Exploring Text: Zipf’s Law and Heaps’ Law. (a) (b) (a) Distribution of sorted word frequencies (Zipf’s law) (b) Distribution of size of the vocabulary.
1 Evaluating word sketches and corpora Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities of Leeds and Sussex.
Corpus Evaluation Adam Kilgarriff Lexical Computing Ltd Corpus evaluationPortsmouth Nov
Upgrading to SQL Server 2000 Kashef Mughal. Multiple Versions SQL Server 2000 supports multiple versions of SQL Server on the same machine It does that.
Using Corpora in Language Research Adam Kilgarriff Lexical Computing Ltd Universities of Leeds January 2013Adam Kilgarriff.
Malta, May 2010Kilgarriff: Corpora by Web Services1 Corpora by Web Services Adam Kilgarriff Lexical Computing Ltd Lexicography MasterClass Ltd Universities.
Terminology-finding in the Sketch Engine Miloš Jakubíček, Adam Kilgarriff, Vojtěch Kovář, Pavel Rychlý, Vit Suchomel Lexical Computing Ltd., Brighton,
CL 2005, Birmingham Web as Corpus Workshop Intro: Adam Kilgarriff 1 Web as Corpus Workshop Co-chairs: Marco Baroni Adam Kilgarriff Sebastian Hoffman.
ACTIVE V PASSIVE VERBS Tutorial.
Subcorpus configuration Adam Kilgarriff. Feb 2010Kilgarriff: IWSG: Subcorpora2 “you can’t get away from genre” Bonnie Weber, Keynote Lecture ICON (Indian.
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
Before anything can be built, constructed, or manufactured with any degree of accuracy, it must be drawn first. ©Emil Decker, 2009.
Corpus Linguistics MOHAMMAD ALIPOUR ISLAMIC AZAD UNIVERSITY, AHVAZ BRANCH.
GDEX: Automatically finding good dictionary examples in a corpus Auckland 2012Kilgarriff: GDEX1.
Exploring Variation in Lexis and Genre in the Sketch Engine Adam Kilgarriff Lexical Computing Ltd., UK Supported by EU Project PRESEMT.
1 Word senses: a computational response Adam Kilgarriff.
How to copy and paste. BY Zachary Hamer. Step One First find the document or writing passage you want to copy and paste. Then RIGHT click on the item.
1 SQL Chapter 9 – 8 th edition With help from Chapter 2 – 10 th edition.
GDEX: Automatically finding good dictionary examples in a corpus Kivik 2013Kilgarriff: GDEX1.
Making trouble-free corpus tasks in 10 minutes Jennie Wright.
Use of Concordancers A corpus (plural corpora) – a large collection of texts, written or spoken, stored on a computer. A concordancer – a computer programme.
Setting up a remote office connection September 2011 Nick Maxwell.
Word 2013 REVIEW AND LOOK AT RIBBONS USING MORE TEMPLATES FREE TRAINING INFORMATION FROM MICROSOFT.
GDEX: Automatically finding good dictionary examples in a corpus.
Using the Correct Order Template Copy the presentation to your hard drive. Open the slides using slide sorter and copy slides #3, 4, and 5 for each question.
Evaluating word sketches and corpora
Japanese・にほんご・nihongo
WHAT IS VERB??!.
SPARC’s innovation DIARIES
Verbs.
  30 A 30 B 30 C 30 D 30 E 77 TOTALS ORIGINAL COUNT CURRENT COUNT
Label Name Label Name Label Name Label Name Label Name Label Name
Multiplication Grids.
Hosted by Type your name here
(Type Answer Here) (Type Answer Here) (Type Answer Here)
Presentation transcript:

Labels: automation Adam Kilgarriff

Kivik 2013Kilgarriff / Labels: automation2 Which words are:  Most distinctive of business English? Keywords, already shown  Most often passive?

Kivik 2013Kilgarriff / Labels: automation3 Common issue for lexicographers  Ordinary cases No need to say anything in dictionary  Extreme cases (“most X”) Needs saying

Kivik 2013Kilgarriff / Labels: automation4 Not hard in principle  Given the right corpus For each word  Count, under condition 1 Eg plural instances  Count, under condition 2 Eg all instances  Compute ratio Sort all words according to ratio Words at top of list are most X

Kivik 2013Kilgarriff / Labels: automation5 In practice  Programming task  Big corpora: big and slow  Slightly different each time  Very rarely done (except Keywords in WordSmith)  Now: in Sketch Engine

Kivik 2013Kilgarriff / Labels: automation6 FindX specification file =passive name HR passives human-readable name (optional) WS passive use the word sketch relation ‘passive’ RE -v$ only for items matching this RegExp (here: only verbs) (optional)

Kivik 2013Kilgarriff / Labels: automation7 PercentilRatioLemposFreq base-v station-v destine-v poise-v doom-v situate-v schedule-v associate-v entitle-v embed-v couple-v jail-v deem-v arm-v design-v clothe-v flank-v confine-v dedicate-v compose-v convict-v ally-v age-v attach-v gear-v levy-v elect-v found-v award-v3103 Find X