Introduction to Corpus Linguistics ENG 331

Slides:



Advertisements
Similar presentations
School of something FACULTY OF OTHER School of Computing FACULTY OF ENGINEERING A comparative study of the tagging of adverbs in modern English corpora.
Advertisements

Biostatistics Unit 3 Graphs 1. Grouped data Data can be grouped into a set of non- overlapping, contiguous intervals called class intervals (Excel calls.
Using Corpus Tools in Discourse Analysis Discourse and Pragmatics Week 12.
Creating a Similarity Graph from WordNet
Adverbs An adverb modifies a verb, an adjective, or another adverb.
Corpus Linguistics: session 2 Corpus Linguistics (2): The Tools of the Trade 669o4zt
Corpus Linguistics Case study 2 Grammatical studies based on morphemes or words. G Kennedy (1998) An introduction to corpus linguistics, London: Longman,
TITLE: Multiplying/Dividing by Negative Numbers Mr. P. Collins X Copy & Complete this multiplication table...
The Eight Parts of Speech With Baseball Evan Gilman and Clayton Wilms.
U3A Computing Beginners Class Leader – Brian Moore Week 9 of 10 weeks. Mondays 4:15 to 5:45 pm **** Last Class on 2/12/2013***
7 th CRCT Language Arts Sentence Construction. Which sentence contains a compound subject? A. Sandy handed in her science project. B. Janice and Betty.
PHRASES & CLAUSES AND WHY COMMAS ARE IMPORTANT!. WORD CLASSES Every word in the English language belongs to a “class”. It will be one of the following:
Mining the Web to Create Minority Language Corpora Rayid Ghani Accenture Technology Labs - Research Rosie Jones Carnegie Mellon University Dunja Mladenic.
An adverb is a …. part of speech. They are used to describe how, …, where,…., …., and …. Something happens. Adverbs.
Guessing Meaning from Context More practice by Jonathan Smith.
Introduction to Linguistics Ms. Suha Jawabreh Lecture # 2.
Today we will be learning: Subject pronouns in English.
Phrases Week 3 Grammar.
Corpus search What are the most common words in English
Prepositional Phrases (Adjective & Adverb Phrases) Learning Target: I can identify prepositional, adjectival, and adverbial phrases and diagram sentences.
ACADEMIC ENGLISH WRITING SKILLS Mdm Siti Aisyah binti Akiah Faculty of Science, Technology & Human Development.
Subordinate Adverbial Clauses. Subordinate Clauses  A clause is a group of words with a subject and a verb.  A subordinate clause cannot stand alone.
ENGLISH is a language Learning mode of ENGLISH Subject Language(Spoken) Literature Competition.
Making trouble-free corpus tasks in 10 minutes Jennie Wright.
XAIRA is an XML Aware Indexing and Retrieval Architecture ● Developed from the British National Corpus Sara program, it provides: – platform-independent.
TRUE or FALSE? Syntax= the order of words in a sentence.
Using language corpora in developing Arabic lessons & syllabuses
INTRODUCTION TO DATABASES (MICROSOFT ACCESS)
The Simple Corpus Tool Martin Weisser Research Center for Linguistics & Applied Linguistics Guangdong University of Foreign Studies
Prepositions: Day 1 1/20.
Introduction to Corpus Linguistics
Statistical NLP: Lecture 7
ALE161 國際行銷英文簡報技巧 International Marketing Presentation Techniques
Access Tutorial 3 Maintaining and Querying a Database
AN EXAMPLE OF A VERY MEDIOCRE TO POOR ASSIGNMENT
Searching corpora.
AntConc is a freeware, multiplatform of application suitable for all types of users
ALE161 國際行銷英文簡報技巧 International Marketing Presentation Techniques
Exploring the BNC Corpus
Corpus Linguistics I ENG 617
Introduction to Access 2007
Introduction to Corpus Linguistics: Exploring Collocation
Introduction to Corpus Linguistics: Applications Lexicography
Topics in Linguistics ENG 331
Corpus Linguistics I ENG 617
Introduction to Corpus Linguistics: Key Word Analysis
Corpus Linguistics I ENG 617
Corpus Linguistics I ENG 617
Introduction to Corpus Linguistics: Colligation
Language, Mind, and Brain by Ewa Dabrowska
Bite-size TD: using wordandphrase.info/academic (with students)
Nouns Nouns not noun noun noun not not
11B word order of phrasal verbs
A CORPUS-BASED STUDY OF COLLOCATIONS OF HIGH-FREQUENCY VERB —— MAKE
Topics in Linguistics ENG 331
Corpus-Based ELT CEL Symposium Creating Learning Designers
Corpus Linguistics I ENG 617
Topics in Linguistics ENG 331
Business English January 22, 2018
FIRST SEMESTER GRAMMAR
Topics in Linguistics ENG 331
What part of speech is that word?
Introduction to Text Analysis
An Investigation into the Developmental Features of Chinese EFL Learners’ Use of Amplifier Collocations Wang Haihua School of Foreign Languages Dalian.
Corpus processing tools
A Vocabulary Review Activity
BYU COCA: CORPUS OF CONTEMPORARY AMERICAN ENGLISH
Assignment 3 Querying and Maintaining a Database
New Perspectives on Microsoft
Problem Statement An experiment is conducted by an Agricultural engineer to test the effect of soil type and nutrient on the yield of Strawberry plants.
Presentation transcript:

Introduction to Corpus Linguistics ENG 331 Rania Al-Sabbagh Department of English Faculty of Al-Alsun (Languages) rsabbagh@alsun.asu.edu.eg Week 4

COCA: Comparing Words 1 To differentiate near synonyms, COCA uses the ‘compare’ function. It displays the collocations of each word sorted by frequency. It also displays the raw frequency of the first word to the second word. For example, comparing ‘steady’ as an adjective to ‘stable’ as an adjective yields the following table. The table shows the difference that ‘stable’ means unchanged, while ‘steady’ is used Week 4

Comparing Words in BYU Corpora 2 Week 4

Comparing Words in BYU Corpora 3 The first line in the ‘steady’ table reads as follows: The raw frequency of word 1 – ‘steady’ – with ‘pace’ is 218. Yet, the raw frequency of word 2 – ‘stable’ – with ‘pace’ is 0. The third column is the ratio of word 1 to word 2 and it reads as follows: there are 436.0 times as many cases of steady pace as there are stable pace. Remember: you can always guarantee more accurate results by adding the part of speech to each of your queries. Week 4

COCA: Finding Collocations 1 To find the collocations of a given word, you can use the ‘collocate’ function. For example, the top collocations of remind_v* are: Week 4

COCA: Finding Collocations 2 You can refine your collocation search by looking for collocations in a specific part of speech. Suppose that we want to find the adverbial collocations of the verb remind, then your query should look like: The top 3 adverbs collocating with remind as a verb are: just, how, and also. Week 4

COCA: Finding Collocations 3 In the ‘collocates’ functions there is the ribbon below. What does it stand for? This is the window size – i.e. the search space in which the engine tries to find collocations. It is meant to find both adjacent and non-adjacent collocations. Adjacent collocations are the ones that immediately precede or follow your query word. They are usually inseparable. In this case, you need to set the window size to ±1. An example of adjacent collocations is at hand and kick the bucket. Non-adjacent collocations are the ones that can be separated by one or more words such as give up. Week 4

COCA: Finding Collocations 4 Looking for the adjacent left-hand collocation of remind_v* yields: Looking for the adjacent right-hand collocation of remind_v* yields: Week 4