Push Singh & Tim Chklovski. AI systems need data – lots of it! Natural language processing: Parsed & sense-tagged corpora, paraphrases, translations Commonsense.

Slides:



Advertisements
Similar presentations
Common Sense Reasoning for Interactive Applications MAS 964: Common Sense Reasoning for Interactive Application.
Advertisements

1.Data categorization 2.Information 3.Knowledge 4.Wisdom 5.Social understanding Which of the following requires a firm to expend resources to organize.
Chapter 2. Slide 1 CULTURAL SUBJECT GATEWAYS CULTURAL SUBJECT GATEWAYS Subject Gateways  Started as links of lists  Continued as Web directories  Culminated.
January 12, Statistical NLP: Lecture 2 Introduction to Statistical NLP.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Ontologies, Web 2.0 and Beyond Tom Gruber TagCommons.org tomgruber.org.
Machine Learning Case study. What is ML ?  The goal of machine learning is to build computer systems that can adapt and learn from their experience.”
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
Artificial Intelligence
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
Chapter 12: Intelligent Systems in Business
CS580: Building Web Based Information Systems Roger Alexander & Adele Howe The purpose of the course is to teach theory and practice underlying the construction.
Adding Common Sense into Artificial Intelligence Common Sense Computing Initiative Software Agents Group MIT Media Lab.
Personalized Medicine Research at the University of Rochester Henry Kautz Department of Computer Science.
Artificial Intelligence What’s Possible, What’s Not, How Do We Move Forward? Adam Cheyer Co-Founder, VP Engineering Siri Inc.
Common Sense Computing MIT Media Lab Interaction Challenges for Agents with Common Sense Henry Lieberman MIT Media Lab Cambridge, Mass. USA
Lecture #32 WWW Search. Review: Data Organization Kinds of things to organize –Menu items –Text –Images –Sound –Videos –Records (I.e. a person ’ s name,
CSCI 4410 Introduction to Artificial Intelligence.
RuleML-2007, Orlando, Florida1 Towards Knowledge Extraction from Weblogs and Rule-based Semantic Querying Xi Bai, Jigui Sun, Haiyan Che, Jin.
Steps Toward an AGI Roadmap Włodek Duch ( Google: W. Duch) AGI, Memphis, 1-2 March 2007 Roadmaps: A Ten Year Roadmap to Machines with Common Sense (Push.
How We’re Going to Solve the AI Problem Pedro Domingos Dept. Computer Science & Eng. University of Washington.
Author: William Tunstall-Pedoe Presenter: Bahareh Sarrafzadeh CS 886 Spring 2015.
Explanation Facility دكترمحسن كاهاني
Information Systems & Semantic Web University of Koblenz ▪ Landau, Germany Semantic Web - Multimedia Annotation – Steffen Staab
Artificial Intelligence
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Semantic Learning Instructor: Professor Cercone Razieh Niazi.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
1 Search Engines Emphasis on Google.com. 2 Discovery  Discovery is done by browsing & searching data on the Web.  There are 2 main types of search facilities.
Core Concepts of ACCOUNTING INFORMATION SYSTEMS Moscove, Simkin & Bagranoff John Wiley & Sons, Inc. Developed by: Marianne Bradford, Ph.D. Bryant College.
1 CS 2710, ISSP 2610 Foundations of Artificial Intelligence introduction.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
卓越發展延續計畫分項三 User-Centric Interactive Media ~ 主 持 人 : 傅立成 共同主持人 : 李琳山,歐陽明,洪一平, 陳祝嵩 水美溫泉會館研討會
A Common Ground for Virtual Humans: Using an Ontology in a Natural Language Oriented Virtual Human Architecture Arno Hartholt (ICT), Thomas Russ (ISI),
Student Page Title Introduction Task Process Evaluation Conclusion Credits [Teacher Page]Teacher Page A WebQuest for 5th Grade Science Designed by Aaron.
Course Instructor: K ashif I hsan 1. Chapter # 1 Kashif Ihsan, Lecturer CS, MIHE2.
Intelligent Robot Architecture (1-3)  Background of research  Research objectives  By recognizing and analyzing user’s utterances and actions, an intelligent.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Generating a Dynamic Gaming Environment Using OMCS David Gottlieb and Josh Juster
WEB 2.0 PATTERNS Carolina Marin. Content  Introduction  The Participation-Collaboration Pattern  The Collaborative Tagging Pattern.
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
PowerPoint Presentation by Charlie Cook Copyright © 2004 South-Western. All rights reserved. Chapter 5 Business Intelligence and and Knowledge Management.
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
1 An Introduction to Computational Linguistics Mohammad Bahrani.
MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Overview of Statistical NLP IR Group Meeting March 7, 2006.
Artificial Intelligence, simulation and modelling.
Basics of Natural Language Processing Introduction to Computational Linguistics.
Bridging the Generation Gap Through Stories Aro Muttilainen Oliphant Sammander Sen.
COMPUTER SYSTEM FUNDAMENTAL Genetic Computer School INTRODUCTION TO ARTIFICIAL INTELLIGENCE LESSON 11.
CHAPTER ELEVEN MANAGING KNOWLEGE. Objectives We have been through some of this already Role of knowledge management Types of knowledge management systems.
Decision Support and Business Intelligence Systems (9 th Ed., Prentice Hall) Chapter 12: Artificial Intelligence and Expert Systems.
NTT-MIT Collaboration Meeting, 2001Leslie Pack Kaelbling 1 Learning in Worlds with Objects Leslie Pack Kaelbling MIT Artificial Intelligence Laboratory.
Data mining in web applications
Digital Video Library - Jacky Ma.
MAS 964: Common Sense Reasoning for Interactive Applications
Visual Information Retrieval
Search Engines and Search techniques
GOOSE: A Goal-Oriented Search Engine with Commonsense
Chapter 11: Artificial Intelligence
Artificial Intelligence (CS 370D)
Artificial Intelligence and Lisp Lecture 13 Additional Topics in Artificial Intelligence LiU Course TDDC65 Autumn Semester,
Ontologies, Web 2.0 and Beyond
SECOND LANGUAGE LISTENING Comprehension: Process and Pedagogy
The Winograd Schema Challenge Hector J. Levesque AAAI, 2011
Information Retrieval
Presentation transcript:

Push Singh & Tim Chklovski

AI systems need data – lots of it! Natural language processing: Parsed & sense-tagged corpora, paraphrases, translations Commonsense reasoning: Facts, descriptions, scripts, rules, exceptions Computer vision and speech recognition: Segmented images, transcribed speech Robotics: Motion capture data, body configurations

Traditional Sources 1. Knowledge engineering, programming pro: can be high quality cons: - often brittle because lack of coverage - expensive! 2. Learning from raw data pro: there is sometimes a lot of raw data cons: - you have little control over the data - for many tasks the data is not available - hard to learn structured representations

Solution: turn to the general public! There are 500,000,000 people on-line (Nielsen/NetRatings) People can participate by: -Providing labeled training examples (i.e. for OCR) -Tagging corpora (with part-of-speech, word senses) -Verifying and cleaning data (validating assertions) -Supplying rules and examples (assertions, stories) -Evaluating performance of systems (i.e. of face recognizer) -Online supervised learning (i.e. Stork’s Animals) -Organizing and structuring information (i.e. the web)

Successful Distributed Human Projects The Open Directory Project ( -indexes 3,248,314 sites -46,846 editors FreeDB ( -543,786 CDs catalogued Others: The Internet Movie Database ( American Psychological Society (psych.hanover.edu) Distributed Proofreaders (charlz.dns2go.com/gutenberg) NASA Crater marking project (clickworkers.arc.nasa.gov/top)

Open Mind Common Sense Second-largest commonsense database after Cyc: - 410,000 assertions, stories, descriptions, rules, etc. - Built by 8600 users over 1 ½ years - Can extract relations and rules via shallow parsing Basis for several applications and experiments: - ARIA photo annotation and retrieval agent - GOOSE commonsense search engine - MAKEBELIEVE story generator - intelligent camera, analogical reasoner - word sense disambiguator (Henry Lieberman, Hugo Liu, Barbara Barry, Thomas Lin)

The Snowball Effect Two systems that leverage what they already know and give feedback to the contributors Word Sense Disambiguation: - Lets users select what sense a word is used in in a given sentence - Uses collected information to decide where more learning is needed. - Provides feedback on how much an automatic tagger has improved because of your contribution (learner.media.mit.edu/cgi/wsd-collect-tagging.cgi) Learner: - Gathers commonsense knowledge by asking questions that the system thinks may be true - Questions are formed by making analogies, based on existing knowledge (forthcoming, see

The Pyramid of Tasks Core write plug-ins contribute inference rules contribute and verify simple assertions This is because the prior experience required is an inverse pyramid: some lisp / scheme experience, knowledge representation / AI background knowledge rep / ai background or interest, some programming analytical skills, familiarity with reasoning possess common sense

Future Open Mind Projects - Using webcams, thousands of people help teach their computers to recognize the appearance and behavior of various kinds of objects. - A system that reads text on the web, but has people help it comprehend passages. - A dialogue system that people teach how to have a conversation. - Using future cell phones and wearable computers, we could all start to teach computers the patterns of our everyday lives by letting them see and hear us as we actually do things in the world.

Open Questions - How do we get users hooked? - How do we acquire more sophisticated knowledge? - Can we acquire hard-to-articulate knowledge? - How do we use knowledge that is easy to acquire?

Fear, Rejoice, … Fear You have to learn how to build a community of contributors. You have to make your research accessible. Rejoice Unlimited number of urops! Mom and Dad Come to our web site! You can help too. Discoveries If you build it they will come. Malevolent users not a big problem. Recent Built 2 nd largest commonsense database. Useful in prototype applications.