1 Business Proprietary © 2009 Oculus Info Inc. Everyone’s a Critic: Memory Models and Uses for an Artificial Turing Judge W. Joseph MacInnes, Blair C.

Slides:



Advertisements
Similar presentations
Critical Reading Strategies: Overview of Research Process
Advertisements

Sruti Akula (PhD ELE) EFL University Developing academic reading skills through strategy training.
Introduction to: Automated Essay Scoring (AES) Anat Ben-Simon Introduction to: Automated Essay Scoring (AES) Anat Ben-Simon National Institute for Testing.
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
2 Information Retrieval System IR System Query String Document corpus Ranked Documents 1. Doc1 2. Doc2 3. Doc3.
1 Latent Semantic Mapping: Dimensionality Reduction via Globally Optimal Continuous Parameter Modeling Jerome R. Bellegarda.
Probabilistic inference in human semantic memory Mark Steyvers, Tomas L. Griffiths, and Simon Dennis 소프트컴퓨팅연구실오근현 TRENDS in Cognitive Sciences vol. 10,
Katrin Erk Distributional models. Representing meaning through collections of words Doc 1: Abdullah boycotting challenger commission dangerous election.
Latent Semantic Analysis
IVITA Workshop Summary Session 1: interactive text analytics (Session chair: Professor Huamin Qu) a) HARVEST: An Intelligent Visual Analytic Tool for the.
DIMENSIONALITY REDUCTION BY RANDOM PROJECTION AND LATENT SEMANTIC INDEXING Jessica Lin and Dimitrios Gunopulos Ângelo Cardoso IST/UTL December
A shot at Netflix Challenge Hybrid Recommendation System Priyank Chodisetti.
IR Models: Latent Semantic Analysis. IR Model Taxonomy Non-Overlapping Lists Proximal Nodes Structured Models U s e r T a s k Set Theoretic Fuzzy Extended.
1 Today  Tools (Yves)  Efficient Web Browsing on Hand Held Devices (Shrenik)  Web Page Summarization using Click- through Data (Kathy)  On the Summarization.
Latent Semantic Analysis (LSA). Introduction to LSA Learning Model Uses Singular Value Decomposition (SVD) to simulate human learning of word and passage.
1 CS 430 / INFO 430 Information Retrieval Lecture 9 Latent Semantic Indexing.
Automatic Collection “Recruiter” Shuang Song. Project Goal Given a collection, automatically suggest other items to add to the collection  Design a process.
Evaluation of Utility of LSA for Word Sense Discrimination Esther Levin, Mehrbod Sharifi, Jerry Ball
A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora Benjamin Arai Computer Science and Engineering Department.
Artificial Intelligence. Agenda StartEnd Introduction AI Future Recent Developments Turing Test Turing Test Evaluation.
Introduction.  Classification based on function role in classroom instruction  Placement assessment: administered at the beginning of instruction 
Longbiao Kang, Baotian Hu, Xiangping Wu, Qingcai Chen, and Yan He Intelligent Computing Research Center, School of Computer Science and Technology, Harbin.
Essay Writing in Philosophy
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Automated Essay Grading Resources: Introduction to Information Retrieval, Manning, Raghavan, Schutze (Chapter 06 and 18) Automated Essay Scoring with e-rater.
Revising the comprehension paper Aim To know what you need to do in each section of Paper 2.
NL Question-Answering using Naïve Bayes and LSA By Kaushik Krishnasamy.
Intelligent Database Systems Lab Presenter : WU, MIN-CONG Authors : Jorge Villalon and Rafael A. Calvo 2011, EST Concept Maps as Cognitive Visualizations.
Indices Tomasz Bartoszewski. Inverted Index Search Construction Compression.
Learn to Comment Lance Lebanoff Mentor: Mahdi. Emotion classification of text  In our neural network, one feature is the emotion detected in the image.
CpSc 881: Information Retrieval. 2 Recall: Term-document matrix This matrix is the basis for computing the similarity between documents and queries. Today:
Katrin Erk Vector space models of word meaning. Geometric interpretation of lists of feature/value pairs In cognitive science: representation of a concept.
Shane T. Mueller, Ph.D. Indiana University Klein Associates/ARA Rich Shiffrin Indiana University and Memory, Attention & Perception Lab REM-II: A model.
A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources Author: Carmen Banea, Rada Mihalcea, Janyce Wiebe Source:
Latent Semantic Indexing: A probabilistic Analysis Christos Papadimitriou Prabhakar Raghavan, Hisao Tamaki, Santosh Vempala.
June 5, 2006University of Trento1 Latent Semantic Indexing for the Routing Problem Doctorate course “Web Information Retrieval” PhD Student Irina Veredina.
CONCLUSION & FUTURE WORK Normally, users perform search tasks using multiple applications in concert: a search engine interface presents lists of potentially.
Comparing and Ranking Documents Once our search engine has retrieved a set of documents, we may want to Rank them by relevance –Which are the best fit.
SINGULAR VALUE DECOMPOSITION (SVD)
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
1 Sentence Extraction-based Presentation Summarization Techniques and Evaluation Metrics Makoto Hirohata, Yousuke Shinnaka, Koji Iwano and Sadaoki Furui.
Gene Clustering by Latent Semantic Indexing of MEDLINE Abstracts Ramin Homayouni, Kevin Heinrich, Lai Wei, and Michael W. Berry University of Tennessee.
1 CSC 594 Topics in AI – Text Mining and Analytics Fall 2015/16 6. Dimensionality Reduction.
Introduction to String Kernels Blaz Fortuna JSI, Slovenija.
Topic Modeling using Latent Dirichlet Allocation
1 Latent Concepts and the Number Orthogonal Factors in Latent Semantic Analysis Georges Dupret
Computer-assisted essay assessment zSimilarity scores by Latent Semantic Analysis zComparison material based on relevant passages from textbook zDefining.
Jeff Howbert Introduction to Machine Learning Winter Machine Learning Natural Language Processing.
V. Clustering 인공지능 연구실 이승희 Text: Text mining Page:82-93.
1 CS 430: Information Discovery Lecture 11 Latent Semantic Indexing.
UWMS Data Mining Workshop Content Analysis: Automated Summarizing Prof. Marti Hearst SIMS 202, Lecture 16.
Finding document topics for improving topic segmentation Source: ACL2007 Authors: Olivier Ferret (18 route du Panorama, BP6) Reporter:Yong-Xiang Chen.
Link Distribution on Wikipedia [0407]KwangHee Park.
10.0 Latent Semantic Analysis for Linguistic Processing References : 1. “Exploiting Latent Semantic Information in Statistical Language Modeling”, Proceedings.
Traditional vs. Alternative Assessment Assessment is the process of finding out how well students have mastered the curriculum.
1 ICASSP Paper Survey Presenter: Chen Yi-Ting. 2 Improved Spoken Document Retrieval With Dynamic Key Term Lexicon and Probabilistic Latent Semantic Analysis.
Antar Abdellah, PhD Necessary for Conducting Research.
CS791 - Technologies of Google Spring A Web­based Kernel Function for Measuring the Similarity of Short Text Snippets By Mehran Sahami, Timothy.
Vector Semantics Dense Vectors.
Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.
MDD-Kurs / MDA Cortex Brainware Consulting & Training GmbH Copyright © 2007 Cortex Brainware GmbH Bild 1Ver.: 1.0 How does intelligent functionality implemented.
Automatic Writing Evaluation
Plan for Today’s Lecture(s)
Erasmus University Rotterdam
Entity- & Topic-Based Information Ordering
Vector-Space (Distributional) Lexical Semantics
HCC class lecture 13 comments
CSE 635 Multimedia Information Retrieval
The Winograd Schema Challenge Hector J. Levesque AAAI, 2011
Latent Semantic Analysis
Presentation transcript:

1 Business Proprietary © 2009 Oculus Info Inc. Everyone’s a Critic: Memory Models and Uses for an Artificial Turing Judge W. Joseph MacInnes, Blair C. Armstrong, Dwayne Pare, George S. Cree and Steve Joordens

2 Business Proprietary © 2009 Oculus Info Inc. Turing Test 3 parts to play –Computer Program (AI) –Human Confederate –Judge Is it a test for intelligence? –Likely necessary but not sufficient Unclaimed Loebner prize is evidence that this is still a hard problem for a conversational agent. C?H?

3 Business Proprietary © 2009 Oculus Info Inc. Here Comes the Judge Focus on third party as first step –Can a computer agent function as a Turing Judge? –Language recognition not generation –Can an automated judge tell the difference between human and computer generated conversations? –Human judges may use Grammar Meaningfulness of reply Relatedness of answer ‘Commonsense’ knowledge Task: determine which is human/computer

4 Business Proprietary © 2009 Oculus Info Inc. Rationale and Applications 1.Use of judge as internal critic for language creation/generation for AGI 2.Models of human language comprehension and generation likely include critic Garden path sentences suggest internal critic evaluates a sentence continuously while reading. the cotton clothing is made of… is grown in the South First reading interprets ‘cotton’ as adjective, but later adjust to noun 3.Applications in spam detection and detecting computer generated forum posts for narrow AI.

5 Business Proprietary © 2009 Oculus Info Inc. LSA A1A2A3A4B1B2B3B4 built111 cylons11111 humans111 a111 galactica111 survivors1111 viper111 A1. Humans built the Cylons to make their lives easier. A2. The Cylons did not like doing work for the humans. A3. In a surprise attack, the Cylons destroyed the humans that built them. A4. The Cylons were built by humans to do arduous work. B1. Some survivors escaped and fled on the Galactica. B2. The Galactica protected the survivors using its Viper attack ships. B3. The Cylons were no match for a Viper flown by one of the survivors. B4. A Viper flown by one of the survivors found Earth and led the Galactica there. 1.LSA is a corpus-based statistical method for generating representations that capture aspects of word meaning based on the contexts in which words co-occur. 2.The text corpus is converted into a word x passage matrix, where the passages can be any unit of text (e.g., sentence, paragraph, essay). 3.The elements of the matrix are the frequencies of each target word in each passage The entire matrix is then submitted to singular value decomposition (SVD), the purpose of which is to abstract a lower dimensional (e.g., 300 dimensions) meaning space in which each word is represented as a vector in this compressed space. 4.In addition to computational efficiency, this smaller matrix tends to better emphasize the similarities amongst words. 5.Following the generation of this compressed matrix, new passages can be compared by comparing the cosine of the word vectors the passage contains.

6 Business Proprietary © 2009 Oculus Info Inc. LSA: Example A1A2A3A4B1B2B3B4 built111 cylons11111 humans111 a111 galactica111 survivors1111 viper111 A1. Humans built the Cylons to make their lives easier. A2. The Cylons did not like doing work for the humans. A3. In a surprise attack, the Cylons destroyed the humans that built them. A4. The Cylons were built by humans to do arduous work. B1. Some survivors escaped and fled on the Galactica. B2. The Galactica protected the survivors using its Viper attack ships. B3. The Cylons were no match for a Viper flown by one of the survivors. B4. A Viper flown by one of the survivors found Earth and led the Galactica there. Given the question “The humans built what?” And the possible replies The humans built the Cylons” and “They built the Galactica and the vipers The question overlaps more in space with the first question and would have a higher rating by our judge The effect of SVD is to reduce the dimensionality of the matrix, and discover word similarities which weren’t explicit in the original text (hence the ‘Latent’ part)

7 Business Proprietary © 2009 Oculus Info Inc. Results Artificial Judge Human Judge 1 Human Judge 2 Human Judge 3 Human Judge 4 Hum (.20)Hum (77)Hum (75)Hum (79)Hum (88) Ali (.17)Jab (40)Eug (40)Eug (45)Eug (30) Eug (.12)Eug (35)Ton (30)Jab (20)Ali (10) Jab (.10)Ton (10)Ali (20)Ton (15)Jab (8) Ton (.07)Ali (9)Eug (10)Ali (5)Ton (2) “Humanness” was operationalized as the vector cosine similarity for all of the words in the question relative to all of the words in the answer. Error bars are the standard error of the mean. With the exception of ALICE, humans scored significantly higher than the artificial agents. Rank orderings and performance scores for all judges. Substantial variability in the rank orderings of the different agents by the different human judges There was a higher correlation of agent rankings within human judges (0.83) than with our artificial judge, though the artificial judge did reach a respectable (0.63).

8 Business Proprietary © 2009 Oculus Info Inc. Future work Fom Forbidden Planet content/uploads/2007/12/Adam%20 LevermoreRich%20how%20to%20s pot%20a%20cylon.jpg  Better training data  Live journal ‘conversations’  Better Judge  Topic based, LDA, to try extract the topic or ‘gist’ of a sentence  Pure semantic spaces (LSA) are unlikely sufficient  Better ambitions  Full conversational agent

9 Business Proprietary © 2009 Oculus Info Inc. Potential Fit Response Generation Comprehension Semantics Grammar Context Critic ??