Priya Mathew, Hilary Nesi & Benet Vincent

Slides:



Advertisements
Similar presentations
Christopher Graham Garnet Education UK. I dont do rhetorical questions !
Advertisements

Methodology and Explanation XX50125 Lecture 3: Interviews and questionnaires Dr. Danaë Stanton Fraser.
“In light of this, it is suggested…”: Comparing n-grams in Chinese and British students’ undergraduate assignments from UK universities Maria LeedhamICAME.
Recycling Writing: learning from a corpus of student-generated texts Megan Bruce & Simon Rees Durham University Foundation Centre March 2013 Supported.
Mapping our language programmes Vicky Wright Centre for Language Study
Hilton Rose Hall, Montego Bay Best practices in Teacher Education
Recent Developments in Technological Tools for the Purpose of Facilitating SLA.
What is a corpus?* A corpus is defined in terms of  form  purpose The word corpus is used to describe a collection of examples of language collected.
Using a VLE to teach information skills within an English Literature course Greg Garrard and Nick Drew Bath Spa University College.
Faculty of Engineering Teaching and Learning SupportCentre Peer Assessment and Tutorial Records Marie Bassford TQEF Project Officer.
Presented by Jennifer Robison TexTESOL II March 12, 2010 San Antonio, TX.
INTERNATIONAL STANDARDS: CEFR AND ACADEMIC ENGLISH ELENA FRUMINA 18 JUNE, 2015.
Thursday 3 rd April 2014 Katie Mansfield
Getting the Most out of Your Reading.  The strategies presented in this workshop are all about helping you to gain efficiency with your assigned readings.
Memory Strategy – Using Mental Images
Masaryk University, Brno Friday 13 th September Katie Mansfield
Feedback on exams Vicki Bruce On behalf of School of Psychology.
McEnery, T., Xiao, R. and Y.Tono Corpus-based language studies. Routledge. Unit A 2. Representativeness, balance and sampling (pp13-21)
Dafna Hardbattle, Ken Fisher & Peter Chalk London Metropolitan University International Blended Learning Conference University of Hertfordshire,
Role delineation in an iterative, cognitive skills based model of Information Literacy Judith Keene and John Colvin, University of Worcester, U.K. Justine.
CLC reading program Nguyen Thi Thu Trang. In-class activities Assignment Assessment Add your text in here Reading program Objectives Contents.
Embedding information literacy in an undergraduate Management module: reflecting on students’ performance and attitudes over two academic years Clive Cochrane.
TALC Applying some Developments in Corpus Building Technology to Language Teaching and Learning TALC 2006 Paris.
BSc Information Technology Management for Business Dr Ilias Petrounias, ITMB Programme Director Dr Ali Owrak, Placement and Employability Tutor – ITMB.
Tracking Language Development with Learner Corpora Xiaofei Lu CALPER 2010 Summer Workshop July 12, 2010.
Setting the Standard to Support Retention Mairin Nicell Heather Sayers Derek Woods Jennifer Hyndman.
L JSTOR Tools for Linguists 22nd June 2009 Michael Krot Clare Llewellyn Matt O’Donnell.
TEACHING AND LEARNING What you need to know School of Computing and Mathematics.
Teaching Business Information Systems in UK Courses in BIS Structure and Content Teaching and Assessment Questions… … but no answers TempusJan03_1.
English for Specific Purposes
CS4042 / CS4032 – Directed Study 28/01/2009 Digital Media Design Music and Performance Technology Jim Buckley Directed Study (CS4042.
Group writing tutorials: Do they improve student writing? Roger Graves University of Alberta, CAN
Corpus Linguistics MOHAMMAD ALIPOUR ISLAMIC AZAD UNIVERSITY, AHVAZ BRANCH.
Topic The common errors in usage of written cohesive devices among secondary school Malaysian learners of English of intermediate proficiency.
Jessica Matt MoodleMoot IE/UK 2016 Moodle My Feedback: Trialling the My Feedback report at UCL.
Getting Used to Studying at York Katy Mann Learning Enhancement Project.
University of East London Elizabeth Attree Research methods 1 Practicals Research Methods 1.
The Northumbria timetable puzzle This brief presentation offers a quick guide to understanding the timetable. The next slide which gives an extract of.
PRIMENJENA LINGVISTIKA I NASTAVA JEZIKA II 3 rd class.
To teach or not to teach: the effectiveness of overtly teaching formulaic phrasing in Academic Practice Julie Wilson, Teaching Fellow, Durham University.
Research Introduction to the concept of incorporating sources into your own work.
The selection of vocabulary, language and skills for a discipline- specific Pre-university Foundation Programme Clare Anderson CATS College Cambridge.
CHATTING IN THE ACADEMY: EXPLORING SPOKEN ENGLISH FOR ACADEMIC PURPOSES Michael McCarthy.
How Many Words Does It Take to Listen and Read in English?
for a Sustainable EAP Course
Integrating an ESAP component into an EGAP course. BALEAP 2017 Bristol
Lecture 7 Teaching Grammar
Academic writing styles
Searching corpora.
APS Teacher Evaluation
Approach, Methods, Techniques
AntConc is a freeware, multiplatform of application suitable for all types of users
Recommendations Regarding the Organisation of the educations at INS

Computational and Statistical Methods for Corpus Analysis: Overview
عمادة التعلم الإلكتروني والتعليم عن بعد
PAF 101 Module 2, Lecture 1 “An educated person is one who has learned that information almost always turns out to be at best incomplete and very often.
Corpora and Concordancers in ESL/EFL Class:
TALIF Research Project – University of Essex BALEAP PIM at Edinburgh
Fitting Medical Education Into Training
Spreading the Word: The challenge of the corpus as an agent of change
What is a digital library?
Annie Bélanger September 2016
Helen Jefferis, Soraya Kouadri & Elaine Thomas
Using journals and accessing electronic journal articles
Using GOLD to Tracking L2 Development
Can Lexical Bundles Increase (the Perception of) Fluency?
Lab 2: Information Retrieval
May 2013 KEY INFORMATION SETS THE BASICS.
The BAWE Quicklinks project
Presentation transcript:

Priya Mathew, Hilary Nesi & Benet Vincent Corpus from scratch: collecting and processing a sizeable EAP corpus in a (relatively) resource-poor context Priya Mathew, Hilary Nesi & Benet Vincent

Types of DIY corpus: Expert writing collected by students. Corpus compilation helps students learn more about their own disciplines Can provide good examples for data-driven learning Types of DIY corpus: Fairly quick and easy Expert writing collected by students. Student writing collected by lecturers. Student writing compared with expert writing (collected by students or lecturers). Fairly slow and laborious May contain errors Corpus compilation helps lecturers learn more about disciplinary requirements

The Middle East College DIY corpus Created for needs analysis:  What types of assignments to subject lecturers set?  What genres of writing do the students produce?  What do the best students do well, and where are they still having problems? Created for learning activities: Using discipline-specific key words and phrases  Noticing similarities and differences between their own and expert usage

Context: MEC, Oman Largest private college (6000 students) Electronics, Civil Engineering, Mechanical Engineering, Computing and Business Student population: 90% Omani, 10% International Arabic background (8 years of English) 1-year foundation before undergraduate course (IELTS 5.5)

Need for writing support post Foundation Many students not able to meet disciplinary writing requirements (feedback from subject lecturers, students and external examiners, student performance)

Centre for Academic Writing at MEC Supports UG and PG students through: workshops consultations WID (Writing in Disciplines) courses

Initial questions How to design courses if we don’t know: Texts need to be categorized into genres Initial questions How to design courses if we don’t know: what genres students from different disciplines write the lexicogrammatical features of the different stages of the texts what subject lecturers value in their students’ written assignments Stages of the texts need to be marked up

Creating the Corpus Civil Engineering (coursework from 26 modules represented) Obtained student consent (Consent Form on Moodle)

<Oxygen/> Creating the Corpus Subject lecturers chose some proficient assignments per module Converted texts to xml format Texts annotated during the conversion process <Oxygen/>

The MEC Civ. Eng. Corpus MEC Undergraduate Civil Engineering Programme consists of 8 semesters Semester 1 2 3 4 5 6 7 Number of assignments 10 12 22 41 15 23 Number of words 30200 23700 35000 33600 68100 58000 70000

Genre Analysis Categorized texts in corpus into genres based on: analysis of stages in texts (Nesi and Gardner 2012) interviews with subject lecturers assignment briefs module information guide

MEC Civil Engineering Corpus, by genre No. of assignments No. of words Case Study 34 13800 Explanation 27 88600 Exercise 14 18000 Lab Report 62 48700 Manual 2 11200 Site Investigation Report 5 14400

Exploiting the corpus: some initial analyses Data-driven analysis involving e.g. key words key terms n-grams can be used to suggest pedagogical interventions

NB Sketch Engine keywords Wordforms that are significantly more frequent in the corpus than in a reference corpus MEC CE Corpus vs. enTenTen13 (parameter: 1) suggests items / categories that may be worth teaching Includes some that definitely aren’t!

Keyword procedure applied to MWIs Key terms MEC CE Corpus vs. enTenTen13 (parameter: 1) Almost all N + N / Adj + N Measurement-related terms

4-grams Useful starting point to look at categories such as: aka 4-word lexical bundles 4-grams Useful starting point to look at categories such as: reference to measurement / location reference to visuals This can reveal common issues

Referring to visuals teaching material Lines retrieved using CQL

Further work to include… Keywords of genres (e.g. case study) compared to rest of corpus Comparisons of usage seen in corpus with more expert writing: BAWE Engineering writing Journal writing Textbook writing? in terms of typical collocates and other phraseological features Probably retrieves different types of keywords Sharing results with teachers and students