MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

Slides:



Advertisements
Similar presentations
START: Natural Language Access to Information Boris Katz, Gary Borchardt, Sue Felshin, Jimmy Lin, Jerome McFarland, Ali Ibrahim, Luciano Castagnola, Baris.
Advertisements

T Seminar on Multimedia Metadata Management Hannu Järvinen
Basic IR: Queries Query is statement of user’s information need. Index is designed to map queries to likely to be relevant documents. Query type, content,
1 CS 502: Computing Methods for Digital Libraries Lecture 20 Multimedia digital libraries.
Week 23 - Revision1 Week 23 Revision DSA. Week 23 - Revision2 Agenda Section A: Multiple choice Section B: Problem-oriented questions Topics for revision.
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
World Wide Web1 Applications World Wide Web. 2 Introduction What is hypertext model? Use of hypertext in World Wide Web (WWW) – HTML. WWW client-server.
Web Mining Research: A Survey
CGI and Perl MSc Publishing on the WWW. What is CGI ? (1) User Buying and selling Playing games Customised web pages Developer Means to run external programs.
Internet – Part II. What is the World Wide Web? The World Wide Web is a collection of host machines, which deliver documents, graphics and multi-media.
WWW and Internet The Internet Creation of the Web Languages for document description Active web pages.
Computer Software By: Richard Tyler Walker. The Two Kinds of Software Are: Systems Software, which includes operating systems, utilities, and device drivers.
Lecture-8/ T. Nouf Almujally
The Internet & The World Wide Web Notes
New “Collaborate” Button Integrate UI directly into the browser. Preferred target: Firefox Easiest browser to extend in terms of UI.
1 Introduction to Web Development. Web Basics The Web consists of computers on the Internet connected to each other in a specific way Used in all levels.
Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization.
Information Retrieval – and projects we have done. Group Members: Aditya Tiwari ( ) Harshit Mittal ( ) Rohit Kumar Saraf ( ) Vinay.
Chapter 16 The World Wide Web Chapter Goals ( ) Compare and contrast the Internet and the World Wide Web Describe general Web processing.
Chapter 16 The World Wide Web Chapter Goals Compare and contrast the Internet and the World Wide Web Describe general Web processing Describe several.
Research paper: Web Mining Research: A survey SIGKDD Explorations, June Volume 2, Issue 1 Author: R. Kosala and H. Blockeel.
Universität Stuttgart Universitätsbibliothek Information Retrieval on the Grid? Results and suggestions from Project GRACE Werner Stephan Stuttgart University.
Chapter 16 The World Wide Web. 2 The Web An infrastructure of information combined and the network software used to access it Web page A document that.
16-1 The World Wide Web The Web An infrastructure of distributed information combined with software that uses networks as a vehicle to exchange that information.
CPS120: Introduction to Computer Science The World Wide Web Nell Dale John Lewis.
Web Search Created by Ejaj Ahamed. What is web?  The World Wide Web began in 1989 at the CERN Particle Physics Lab in Switzerland. The Web did not gain.
CS621 : Seminar-2008 DEEP WEB Shubhangi Agrawal ( )‏ Jayalekshmy S. Nair ( )‏
Integrating Educational Technology into the Curriculum
Chapter 6 The World Wide Web. Web Pages Each page is an interactive multimedia publication It can include: text, graphics, music and videos Pages are.
Information Need Question Understanding Selecting Sources Information Retrieval and Extraction Answer Determina tion Answer Presentation This work is supported.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Artificial intelligence project
Computing Fundamentals Module Lesson 19 — Using Technology to Solve Problems Computer Literacy BASICS.
TOPIC CENTRIC QUERY ROUTING Research Methods (CS689) 11/21/00 By Anupam Khanal.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Search Engine Architecture
Searching the web Enormous amount of information –In 1994, 100 thousand pages indexed –In 1997, 100 million pages indexed –In June, 2000, 500 million pages.
4 1 SEARCHING THE WEB Using Search Engines and Directories Effectively New Perspectives on THE INTERNET.
Database Management Systems.  Database management system (DBMS)  Store large collections of data  Organize the data  Becomes a data storage system.
4 Transforming Our Use of Images in ELT. MEXTESOL’s OBJECTIVES * Promote the professional development of English teachers * Assist teachers in updating.
Service Service metadata what Service is who responsible for service constraints service creation service maintenance service deployment rules rules processing.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Introduction to Information Retrieval Aj. Khuanlux MitsophonsiriCS.426 INFORMATION RETRIEVAL.
Computing Fundamentals Module Lesson 6 — Using Technology to Solve Problems Computer Literacy BASICS.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Module: Software Engineering of Web Applications Chapter 2: Technologies 1.
Overviews of the Library of Texas & ZLOT Project Dr. William E. Moen Principal Investigator.
Study Guide Saul Mendiola EDTC 6343 Competency 001 G & H.
Toward Semantic Search: RDFa based facet browser Jin Guang Zheng Tetherless World Constellation.
11/23/00UNU/IAS/UNL Centre1 The Universal Networking Language United Nations University Institute of Advanced Studies United Networking Language ® UNU/IAS.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights.
Internet Searching the World Wide Web. The Internet and the World Wide Web The Internet is a worldwide collection of networks that allows people to communicate.
Using Technology to Solve Problems Unit 2 Mod 2 SO 7.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
AQUAINT Mid-Year Workshop: Observations and Comments Jimmy Lin MIT Artificial Intelligence Laboratory.
(class #2) CLICK TO CONTINUE done by T Batchelor.
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Trends in NL Analysis Jim Critz University of New York in Prague EurOpen.CZ 12 December 2008.
Visual Information Retrieval
Information Retrieval and Web Search
Search Engine Architecture
Information Retrieval and Web Search
Search Engine Architecture
Computer Literacy BASICS
Website A website is a collection of web pages (documents that are accessed through the Internet) When someone gives you their web address, it generally.
Presentation transcript:

MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions Finding information on line Two Approaches: 1. Keyword search (search engines, e.g., AltaVista) 2. Natural language processing The Problem:

MIT Artificial Intelligence Laboratory — Research Directions What’s Wrong with Keyword Search?

MIT Artificial Intelligence Laboratory — Research Directions What’s Right About Natural Language Processing?

MIT Artificial Intelligence Laboratory — Research Directions What’s Wrong with Natural Language Processing (today)? 1. Too hard Full-text NL understanding still beyond reach Intersentential reference Paraphrasing Summarization Common sense implication 2. Too slow 3. Not all information is language Most Web resources are not textual Maps and Images Sound and Video Multimedia Web resources are distributed across numerous non-traditional databases

MIT Artificial Intelligence Laboratory — Research Directions START (SynTactic Analysis using Reversible Transformations) provides multimedia information access using natural language. Natural language Natural language is human language. You don’t have to learn a special language to use START. Ask your questions in English; enter information using English. Multimedia access using natural language annotations START lets you use English to access any kind of information: text, pictures, movies, and more. “Just the right information” START gives you the answer you want without including a thousand others. Virtual collaboration START retrieves information from its own knowledge base and from databases all over the Web. What is START?

MIT Artificial Intelligence Laboratory — Research Directions Natural language is human language. You don’t have to learn a special language to use START. Ask your questions in English; enter information using English Natural Language

MIT Artificial Intelligence Laboratory — Research Directions START lets you use English to access any kind of information: text, pictures, movies, and more. Multimedia Access Using Natural Language Annotations

MIT Artificial Intelligence Laboratory — Research Directions START gives you the answer you want without including a thousand other answers. Just the Right Information

MIT Artificial Intelligence Laboratory — Research Directions START retrieves information from its own knowledge base and from databases all over the Web. Virtual Collaboration

MIT Artificial Intelligence Laboratory — Research Directions Bridge the gap between our ability to analyze natural language sentences and other information and our desire to access the huge amount of data now available on the Web. Annotations are collections of natural language sentences and phrases that describe the content of various information segments. START analyzes these annotations creates the necessary representational structures produces special pointers to the information segments summarized by the annotations. Natural Language Annotations

MIT Artificial Intelligence Laboratory — Research Directions START Server START Server START Server Document Natural Language Annotations Annotation Information Provider Information Seeker (negotiation) (submitted) (retrieved) Xxx xx xxxx xx xx xxxxx x xxx Xxx xx xx xxx “Neptune was discovered using mathematics.” + Document Xxx xx xxxx xx xx xxxxx x xxx Xxx xx xx xxx Question “How was Neptune discovered?” START Server

MIT Artificial Intelligence Laboratory — Research Directions HPKB POTUS Fortune500 Uniform Access START NL questions Multimedia responses Omnibase Queries Data Local knowledge base of ternary expressions Core vocabulary Uniform interface to multiple database formats (Web, text, etc.) Extended lexicon U.S. Census IMDb

MIT Artificial Intelligence Laboratory — Research Directions How START Works Web browser START Parser Matcher English Database of T-exps T-exps from KB Generator HTML English Annotations Scripts Native knowledge Omnibase (external knowledge) Scripts WWW Potus IMDb World Factbook U.S. Census Input T-exps

MIT Artificial Intelligence Laboratory — Research Directions