Download presentation
Presentation is loading. Please wait.
Published byRudolf Sparks Modified over 8 years ago
1
Using Corpora in TEFL By Terri Yueh
2
WhyWhy Work With Corpora? Why From Vocabulary to Corpus Choosing a Corpus Choosing a Corpus Examples of Word Lists and Concordances Examples of Word Lists and Concordances Collocations & Concordance Collocations Concordance Corpus-informed approach Corpus-informed approach
3
The Ages of Corpus Lexicography The four ages The four ages http://www.macmillandictionary.com/MED- Magazine/February2006/35-Corpora-Tips.htm Word sketching http://www.sketchengine.co.uk/ Word sketching http://www.sketchengine.co.uk/
4
The Brown Corpus of Standard American English was the first of the modern, computer readable, general corpora. It was compiled by W.N. Francis and H. Kucera, Brown University, Providence, RI. The corpus consists of 1,000,000 words of American English texts printed in 1961. The texts for the corpus were sampled from 15 different text categories to make the corpus a good standard reference. Much of its usefulness lies in the fact that the Brown corpus lay-out has been copied by other corpus compilers. The LOB corpus (British English) and the Kolhapur Corpus (Indian English) are two examples of corpora made to match the Brown corpus. The availability of corpora which are so similar in structure is a valuable resourse for researchers interested in comparing different language varieties, for example.
5
URL: http://www.natcorp.ox.ac.uk/ http://www.natcorp.ox.ac.uk/ The British National Corpus (BNC) is a 100 million word collection of samples of written and spoken language from a wide range of sources, designed to represent a wide cross-section of British English from the later part of the 20th century. The BNC World Edition contains 4,054 texts and occupies 1,508,392 Kbytes, or about 1.5 Gb. The written part of the BNC (90%) includes, for example, extracts from regional and national newspapers, specialist periodicals and journals for all ages and interests, academic books and popular fiction, published and unpublished letters and memoranda, school and university essays. The spoken part (10%) includes a large amount of unscripted informal conversation, recorded by volunteers selected from different age, region and social classes in a demographically balanced way, together with spoken language collected in all kinds of different contexts, ranging from formal business or government meetings to radio shows and phone-ins.
6
More Free Corpus Websites Virtual Language Centre (Hong Kong) Virtual Language Centre Web Concordancer & Net Dictionary http://vlc.polyu.edu.hk/Web ConcordancerNet Dictionary http://vlc.polyu.edu.hk/ VIEW: VARIATION IN ENGLISH WORDS AND PHRASES http://view.byu.edu/ VOA's Special English Program Scripts VOA's Special English Program Scripts http://www.manythings.org/voa/sentences.htm Corpora Collections http://devoted.to/corpora
7
References 黃希敏,“ 語料語言學概述,” 敦煌英語教學電子雜誌, Nov. 2004, http://cet.cavesbooks.com.tw/htm/m0131100.htm 語料語言學概述 http://cet.cavesbooks.com.tw/htm/m0131100.htm 黃希敏,“ 語料語言學研究面面觀,” 敦煌英語教學電子雜誌, March 2005, http://cet.cavesbooks.com.tw/htm/m050307.htm 語料語言學研究面面觀 http://cet.cavesbooks.com.tw/htm/m050307.htm The Use of Corpora in Language Studies The Use of Corpora in Language Studies NTNU Virtual Language Center NTNU Virtual Language Center 師大英語搭配線上檢索系統 http://140.122.83.196/~hjchen/colocation2/default.htm http://140.122.83.196/~hjchen/colocation2/default.htm NTNU Concordancer and Collocation Retrieval System http://140.122.83.246/cwb/ http://140.122.83.246/cwb2/ http://140.122.83.246/cwb/ http://140.122.83.246/cwb2/
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.