James Christie Automated Marking of Essays for Content.

Slides:



Advertisements
Similar presentations
Assessment types and activities
Advertisements

An Introduction to Computer- assisted Assessment Joanna Bull and Ian Hesketh CAA Centre Teaching and Learning Directorate.
Chapter 6 Process and Procedures of Testing
Measurement Concepts Operational Definition: is the definition of a variable in terms of the actual procedures used by the researcher to measure and/or.
UNIT-2 Data Preprocessing LectureTopic ********************************************** Lecture-13Why preprocess the data? Lecture-14Data cleaning Lecture-15Data.
Introduction to: Automated Essay Scoring (AES) Anat Ben-Simon Introduction to: Automated Essay Scoring (AES) Anat Ben-Simon National Institute for Testing.
Contingency Tables Chapters Seven, Sixteen, and Eighteen Chapter Seven –Definition of Contingency Tables –Basic Statistics –SPSS program (Crosstabulation)
Topics: Quality of Measurements
Reliability for Teachers Kansas State Department of Education ASSESSMENT LITERACY PROJECT1 Reliability = Consistency.
2.2 Validation & Verification
Auditing Concepts.
Guide to Computer Forensics and Investigations Fourth Edition
Lecture 24: More on Data Quality and Metadata By Austin Troy Using GIS-- Introduction to GIS.
1 Testing Writing Pertemuan 21 Matakuliah: >/ > Tahun: >
Data Input How do I transfer the paper map data and attribute data to a format that is usable by the GIS software? Data input involves both locational.
OHT 3.1 Galin, SQA from theory to implementation © Pearson Education Limited 2004 The need for comprehensive software quality requirements Classification.
Research Methods in MIS
Verbs Verbs are the most important words in a sentence Every sentence must have at least one verb Verbs can change in several ways They can be in the.
Software Process and Product Metrics
 DIAGNOSTIC: provides instructors with information about student's prior knowledge and misconceptions before beginning a learning activity.  FORMATIVE:
Introduction.  Classification based on function role in classroom instruction  Placement assessment: administered at the beginning of instruction 
Essay Assessment Tasks
Validity and Reliability Dr. Voranuch Wangsuphachart Dept. of Social & Environmental Medicine Faculty of Tropical Medicine Mahodil University 420/6 Rajvithi.
Automated Essay Evaluation Martin Angert Rachel Drossman.
An English Proficiency Test for Today’s Student Using Today’s Technology Marcie Mealia,
Classroom Assessments Checklists, Rating Scales, and Rubrics
The Developmental Reading & English Placement Test
ASSESSMENT IN EDUCATION ASSESSMENT IN EDUCATION. Copyright Keith Morrison, 2004 ITEM TYPES IN A TEST Missing words and incomplete sentences Multiple choice.
ASSESMENT IN OPEN AND DISTANCE LEARNING Objectives: 1.To explain the characteristic of assessment in ODL 2.To identify the problem and solution of assessment.
James Christie Automated Marking for Essay Content ~ does it work?
This is where your writing is being assessed as opposed to your reading.
Measurement Validity.
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
Software quality factors
Review in Computerized Peer- Assessment Dr Phil Davies Department of Computing Division of Computing & Mathematical Sciences FAT University of Glamorgan.
Assessment. Workshop Outline Testing and assessment Why assess? Types of tests Types of assessment Some assessment task types Backwash Qualities of a.
Distant Course Master English English Language Course For Masters of Mathematical and Mechanical Faculty Saint Petersburg State University The Faculty.
Xianggang Putonghua Yanxishe Primary School of Science and Creativity
1 Chapter 18: Selection and training n Selection and Training: Last lines of defense in creating a safe and efficient system n Selection: Methods for selecting.
CERTIFICATE IV IN BUSINESS JULY 2015 BSBWRT401A - Write Complex Documents.
Chapter 6 - Standardized Measurement and Assessment
Essay Questions. Two Main Purposes for essay questions 1. to assess students' understanding of and ability to think with subject matter content. 2. to.
The Review Process: Where Do We Begin? Jennifer L. Bishoff June 7, 2001.
1 Phil Davies School of Computing University of Glamorgan “Super U” The Automatic Generation of ‘Marks for Marking’ within the Computerised Peer-Assessment.
Assessment Item Types: SA/C, TF, Matching. Assessment Item Types Objective Assessments Objective Assessments Performance Assessments Performance Assessments.
Test Question Writing Instructor Development ANSF Nurse Training Program.
TEST SCORES INTERPRETATION - is a process of assigning meaning and usefulness to the scores obtained from classroom test. - This is necessary because.
* Statutory Assessment Tasks and Tests (also includes Teacher Assessment). * Usually taken at the end of Key Stage 1 (at age 7) and at the end of Key.
JOB KEEPING SKILLS. USE CORRECT GRAMMAR AT ALL TIMES.
Struts2 Validation using XML Approach. May 12, 2011 Need For Validation Validation can be defined as the assessment of an action so as to ensure that.
MyWritingLabPlus and Psychology. What is MyWritingLabPlus? MyWritingLabPlus is an online program designed to help you with writing and grammar necessary.
Monitoring and Assessment Presented by: Wedad Al –Blwi Supervised by: Prof. Antar Abdellah.
Key Boarding / 6th Lesson Plans Teachers: Castro
Design Evaluation Overview Introduction Model for Interface Design Evaluation Types of Evaluation –Conceptual Design –Usability –Learning Outcome.
Key Boarding / 6th Lesson Plans Teachers: Castro
SOFTWARE TESTING AND QUALITY ASSURANCE. Software Testing.
Assessment in Education ~ What teachers need to know.
ESTABLISHING RELIABILITY AND VALIDITY OF RESEARCH TOOLS Prof. HCL Rawat Principal UCON,BFUHS Faridkot.
MY Access! ® Product Research base
Testing and Assessment
Using EduStat© Software
Key Boarding / 6th Lesson Plans Teachers:
Automated Essay Scoring Tools – New approaches in Editing Process
Multimedia Information Retrieval
DEVELOPMENTAL LEARNING AND TARGETED TEACHING
Verbs.
Verbs.
Passive Voice Revision
Techniques to Proofread Your Grammar and Spelling in Essay Writing
Presentation transcript:

James Christie Automated Marking of Essays for Content

Earliest recorded use is some 2,000 years BC by the Chinese for their Administrators! How long have essays been used for assessment?

Essay Definition 1 of 2 … requires a response composed by the examinee, usually in the form of one or more sentence, of a nature that no single response or pattern of responses can be listed as correct, and the accuracy and quality of which can be judged subjectively only by one skilled or informed in the subject, …

Essay Definition 2 of 2 … but even an expert cannot usually classify a response as categorically right or wrong. Rather, there are different degrees of quality or merit which can be recognized. … attributed to Stalnaker, 1951

Possible criteria for automated essay marking Ease of creating a scoring schema Ability to score on various mark regimes Ease of identification on non-scoring elements Ease of modification should scoring error(s) occur Consistent and reproducible scoring Acceptability of results to human markers, essayists, … Defensibility Accuracy and precision Coachability avoidance Cost

Model Essay The cat sat on the mat.

Marking Schema MarkItem 3catsatmat [max:3]

Content Data Structure 1 z a a * 3 0 n 3 2 z f cat * 0 1 n 0 3 z f sat * 0 1 n 0 4 z f mat * 0 1 n 0

Essay Set ALPHA –The cat sat on the mat. BRAVO –The cat sat on the floor. CHARLIE –The dog lay on the floor. MODEL –The cat sat on the mat.

Process interface What LEVEL of Diagnostics to use [0... 3] : 0 What ESSAY SET to use : catmat Enter SCHEMA to use : catmat ALPHA.EXT. BRAVO.EXT. CHARLIE.EXT. MODEL.EXT. Started on Thursday, February at 15:59:05 Finished on Thursday, February at 15:59:06

Schema Report Essay set catmat Schema Report using... catmat Entities : Entity ID : Entity Type : a f f f Part ID's : z Essay Name : ALPHA.EXT: y y y y BRAVO.EXT: _ y y _ CHARLIE.EXT: _ _ _ _ MODEL.EXT: y y y y

Content Report Essay set catmat Content Report using... catmat Essay Name : Words Sentences Usage[%]Coverage[%]Part: z[ 3] Mark[ 3] %[100] ALPHA.EXT: BRAVO.EXT: CHARLIE.EXT: MODEL.EXT: Started on Thursday, February at 15:59:05 Finished on Thursday, February at 15:59:06 Marked 4 file(s): scanned 4 file(s)

Marking Performance Essay Set First v Second Markers Human v SEAR A 0.704** / 0.700**0.594** / 0.596** B 0.810** / 0.740**0.404** / 0.376** C / * / 0.394** D N/A0.238* / 0.336** Pearson / Spearman Significance **= 0.01 *= 0.05

Future work [style] obtain marked essays for style marking –plain ASCII essays using a common set of metrics –word-processed essays using a common set of metrics augmented with word-processing based metrics

Future work [content] maximise use of active and passive voices cope with spelling (and grammar) errors increased coverage of Bloom’s Taxonomy include non-textual feature(s) develop –better feedback to the essayist –better feedback to the examiner –plagiarism detection mechanism(s)

If manual marking equals Da Vinci’s Helical Screw,

then does SEAR equal the first powered flight?

Is the future equal to the ISS?

Is this the future [for style and content]? EnglishThe cat sat on the mat. ItalianIl gatto era seduto sullo zerbino. GreekI gata ekatse ston kanape. RussianKoshka sidit na matrase. FrenchLe chat s’est assis sur le tapis. GermanDie Katze sass auf dem Teppich. DutchDe kat zat op de mat. SpanishEl gato se sent’s en la alfombra. etc

James R Christie The Robert Gordon University Faculty of Design and Technology School of Computing Room B23b St Andrew Street, Aberdeen ‘Phone +44 [0] Fax +44 [0] URL