Automated Essay Scoring The IntelliMetric® Way

Slides:



Advertisements
Similar presentations
Chapter 09 AI techniques in different game genres (Puzzle/Card/Shooting)
Advertisements

Introduction to: Automated Essay Scoring (AES) Anat Ben-Simon Introduction to: Automated Essay Scoring (AES) Anat Ben-Simon National Institute for Testing.
SEARCHING QUESTION AND ANSWER ARCHIVES Dr. Jiwoon Jeon Presented by CHARANYA VENKATESH KUMAR.
Australian Document Computing Conference Dec Information Retrieval in Large Organisations Simon Kravis Copyright 2010 Fujitsu Limited.
Confidential and Proprietary. Copyright © 2010 Educational Testing Service. All rights reserved. Catherine Trapani Educational Testing Service ECOLT: October.
Requirements Specification
Technical Writing II Acknowledgement: –This lecture notes are based on many on-line documents. –I would like to thank these authors who make the documents.
Chapter 6 Building Vocabulary This multimedia product and its contents are protected under copyright law. The following are prohibited by law: –any public.
National 5 History Final Exam Outline.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Artificial Intelligence
Artificial Intelligence. Agenda StartEnd Introduction AI Future Recent Developments Turing Test Turing Test Evaluation.
INTELLIGENT SYSTEMS Artificial Intelligence Applications in Business.
Introduction.  Classification based on function role in classroom instruction  Placement assessment: administered at the beginning of instruction 
Second Language Writers and The Machine Scoring of Essays Deborah Crusan Wright State University.
Automated Essay Evaluation Martin Angert Rachel Drossman.
Assessment Tools. Contents Overview Objectives What makes for good assessment? Assessment methods/Tools Conclusions.
Term Paper English III Prepared by Jenny MacDonald.
An English Proficiency Test for Today’s Student Using Today’s Technology Marcie Mealia,
Search Engines and Information Retrieval Chapter 1.
Artificial Intelligence: Its Roots and Scope
Chapter 6 Supplement Knowledge Engineering and Acquisition Chapter 6 Supplement.
Artificial Intelligence Introductory Lecture Jennifer J. Burg Department of Mathematics and Computer Science.
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
Intelligent Database Systems Lab Presenter : WU, MIN-CONG Authors : Jorge Villalon and Rafael A. Calvo 2011, EST Concept Maps as Cognitive Visualizations.
Fundamentals of Information Systems, Third Edition2 Principles and Learning Objectives Artificial intelligence systems form a broad and diverse set of.
1 Automatic Essay Scoring is Here and Now Online Welcome to CIT S234 Gary Greer University of Houston Downtown & Michelle Overstreet The College Board.
Assessment Tools.
I Robot.
Copyright © 2015 Pearson Education Ltd.
Source : The Problem Learning and innovation skills increasingly are being recognized as the skills that separate students who are.
4-6 Copyright © 2015 Pearson Education, Inc. 6-1 Employee Testing Employee Testing and Selection.
Performance Based Assessment. What is Performance Based Assessment? PBA is a form of assessment that requires students to perform a task rather than an.
Spring, 2005 CSE391 – Lecture 1 1 Introduction to Artificial Intelligence Martha Palmer CSE391 Spring, 2005.
Pastra and Saggion, EACL 2003 Colouring Summaries BLEU Katerina Pastra and Horacio Saggion Department of Computer Science, Natural Language Processing.
Copyright © 2007 Pearson Education, inc. or its affiliates. All rights reserved. 1 Evaluation of Distributed Scoring with the Texas Assessment of Knowledge.
Welcome Parents! FCAT Information Session. O Next Generation Sunshine State Standards O Released Test Items O Sample Test.
Assessment and the Institutional Environment Context Institutiona l Mission vision and values Intended learning and Educational Experiences Impact Educational.
Decision Support and Business Intelligence Systems (9 th Ed., Prentice Hall) Chapter 12: Artificial Intelligence and Expert Systems.
The Rocket Science of Score Points Holistic Scoring and the New Jersey HSPA Writing Assessment.
AQA Religious Philosophical & Ethical Studies
Automatic Writing Evaluation
SAT.
Welcome SNHS Work Session October 28, 2014 Facilitators:
Acknowledgement: Khem Gyawali
Computerizing a Test Using a Learning Management System
What is a CAT? What is a CAT?.
Friday September 9, 2016 Today you will need…. Pencils Novel
Accountability Solution
An assessment framework for Intrusion Prevention System (IPS)
Test Based on Response There are two kinds of tests based on response. They are subjective test and objective test. 1. Subjective Test Subjective test.
The Writing Test College Prep Alabama
Web Services and Application of Multi-Agent Paradigm for DL
ASSESSMENT OF STUDENT LEARNING
Contracting Officer Podcast Slides
Preparing for the Verbal Reasoning Measure
& Anglophone Writing Assessments
Created by Lori Sekhr 2016 South Lake High School
Presentation 王睿.
Gail E. Tompkins California State University, Fresno
Data Warehousing and Data Mining
Testing Writing Rio Darmasetiawan
Accountability Solution
Deputy Commissioner Jeff Wulfson Associate Commissioner Michol Stapel
SPOKEN LANGUAGE Higher English.
Designing Your Performance Task Assessment
Language in the Media Lesson 2.
Information Retrieval
AP U.S. History Exam Details
AP World History “The Exam”
Presentation transcript:

Automated Essay Scoring The IntelliMetric® Way Past, Present, Future The IntelliMetric® Way Paul Edelblut Vantage Learning Newtown, PA USA Copyright 2005-2008 Vantage Technologies Knowledge Assessment, L.L.C.

Brief History of Essay Scoring… AI Based Time Human level scoring 2000 Machine Learning NLP 1990 1980 Surface Features LSA 1970 1960 Complex Simple

NCLB – The Lightning Rod For Change Marketing Team

Why is IntelliMetric Important? Makes it feasible to use authentic open ended assessment Supports the basic premise that if you want to students to learn to write, have them write Provides immediate feedback Is more accurate than human scoring, particularly compared to local and single scorer models Eliminates reporting lag time Reduces assessment costs

Background of IntelliMetric™ Essay Scoring IntelliMetric has been used successfully for over 8 years to accurately score essays and, more recently, short-answer items as well Capable of marking papers written in English as well as many other languages Uses Artificial Intelligence to learn how to score essays Trained with a set of known papers Creates a unique solution for each prompt

IntelliMetric Approach Blend of proprietary Artificial Intelligence Techniques Learning Engine CogniSearch™;Quantum Reasoning™ Internalizing the pooled wisdom of many expert scorers Evaluates more than 450 Syntactic, semantic and discourse level features

Text Parser (Syntax analysis, Feature Extraction) Essay Files Prior Knowledge Base (16 million word Concept Net, 500,000 word vocabulary) Computational Analysis Text Preprocessor (Prepare text for processing) Judge 2 Judge 1 IntelliMetric ™ Final Score Judge N

Sample results of IntelliMetric™ scoring in global operations English Chinese (Mandarin) Bahasa Malay Hebrew Holistic D1 D2 D3 D4 Exact 86% 51% 46% 40% 44% 50% Adjacent 100% 43% 90% 92% 96% Discrepant 0% 6% 10% 8% 4% 14% Pearson .96 .65 .83 .81 .84 .86 .72

Conclusions and Discussion The multi-judge approach of IntelliMetric™ applies more accurate scoring than the Vantage Bayesian Statistical models “In part because individual judgment is not accurate enough or consistent enough, cognitive diversity is essential to good decision making.”—James Surowiecki, in The Wisdom of the Masses As found with expert human scoring, multiple people are more accurate than one Future research will include a comparison of IntelliMetric with other types of AES models

What Can’t IntelliMetric do? Score poetry, creative writing, humor, music Limited utility in a paper-based exam Replace humans entirely Create World Peace

Closing Thoughts What if the computerized scoring does not match the human scoring? Three hypotheses: a) the computer is wrong, b) the human is wrong or, c) they are both wrong. From our informal research reviewing mismatching papers, we have found that there is about an equal likelihood that the computer is wrong as the human is wrong, with a slight edge going in the computer’s favor.

“I write because I don’t know what I think until I read what I wrote.” Closing thought for the day “I write because I don’t know what I think until I read what I wrote.” --Flannery O’Connor American Essayist and Author

ã 2005 Vantage Technologies The information provided by Vantage Learning in this document/presentation is proprietary and confidential. This document or other information discussed in this meeting shall not be duplicated, used, disclosed or distributed in whole or in part to any third parties.