AQUAINT Mid-Year Workshop: Observations and Comments Jimmy Lin MIT Artificial Intelligence Laboratory.

Slides:



Advertisements
Similar presentations
Berliner XML Tage. Humboldt Universität zu Berlin, Oktober 2004 SWEB2004 – Intl Workshop on Semantic Web Technologies in Electronic Business Intelligent.
Advertisements

Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Application Service Provision of Semantic Annotation, Aggregation, Indexing and Routing of Textual, Multimedia, and Multilingual Web Content Esperonto.
Knowledge Management and Engineering David Riaño.
Haystack: Per-User Information Environment 1999 Conference on Information and Knowledge Management Eytan Adar et al Presented by Xiao Hu CS491CXZ.
Natural Language Processing Group Department of Computer Science University of Sheffield, UK IR4QA: An Unhappy Marriage Mark A. Greenwood.
Slide 1 Web-Base Management Systems Aaron Brown and David Oppenheimer CS294-7 February 11, 1999.
The Web of data with meaning... By Michael Griffiths.
CS652 Spring 2004 Summary. Course Objectives  Learn how to extract, structure, and integrate Web information  Learn what the Semantic Web is  Learn.
Web Information Retrieval and Extraction Chia-Hui Chang, Associate Professor National Central University, Taiwan
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Question Answering using Language Modeling Some workshop-level thoughts.
Web Information Retrieval and Extraction Chia-Hui Chang, Associate Professor National Central University, Taiwan Sep. 16, 2005.
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
IST NeOn-project.org The Semantic Web is growing… #SW Pages Lee, J., Goodwin, R. (2004) The Semantic.
SmartSQL AlfaTech Software Solutions Application Requirements Document  Radi Bekker  Vladimir Goldman  Marina Shaevich  Alexander Shapiro Team Members:
An Agent-Oriented Approach to the Integration of Information Sources Michael Christoffel Institute for Program Structures and Data Organization, University.
Extracting Semistructured Information from the Web J. Hammer, H. Garcia-Molina, J. Cho, R. Aranha, A. Crespo from Stanford University Presented by: Wei.
A Brief Survey of Web Data Extraction Tools (WDET) Laender et al.
1 Information Integration and Source Wrapping Jose Luis Ambite, USC/ISI.
AQUAINT Kickoff Meeting – December 2001 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Lowell 2003 Challenges Alon Y. Halevy University of Washington.
JAVELIN Project Briefing 1 AQUAINT Year I Mid-Year Review Language Technologies Institute Carnegie Mellon University Status Update for Mid-Year Program.
Challenges in Information Retrieval and Language Modeling Michael Shepherd Dalhousie University Halifax, NS Canada.
Tomek Strzalkowski & Sharon G. Small ILS Institute, SUNY Albany LAANCOR May 22, 2010 (Tacitly) Collaborative Question Answering Utilizing Web Trails 5/22/10.
A Tripartite Question Answering Architecture for Integrating Diverse Knowledge Resources Boris Katz, Gary Borchardt, Sue Felshin and Jimmy Lin MIT Computer.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
XML & Mediators Thitima Sirikangwalkul Wai Sum Mong April 10, 2003.
Linked-data and the Internet of Things Payam Barnaghi Centre for Communication Systems Research University of Surrey March 2012.
Abstract Question answering is an important task of natural language processing. Unification-based grammars have emerged as formalisms for reasoning about.
Carnegie Mellon School of Computer Science Copyright © 2001, Carnegie Mellon. All Rights Reserved. JAVELIN Project Briefing 1 AQUAINT Phase I Kickoff December.
1 Lessons from the TSIMMIS Project Yannis Papakonstantinou Department of Computer Science & Engineering University of California, San Diego.
Chapter 3 DECISION SUPPORT SYSTEMS CONCEPTS, METHODOLOGIES, AND TECHNOLOGIES: AN OVERVIEW Study sub-sections: , 3.12(p )
AQUAINT 18-Month Workshop 1 Light Semantic Processing for QA Language Technologies Institute, Carnegie Mellon B. Van Durme, Y. Huang, A. Kupsc and E. Nyberg.
Developing “Geo” Ontology Layers for Web Query Faculty of Design & Technology Conference David George, Department of Computing.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Next Generation Search Engines Ehsun Daroodi 1 Feb, 2003.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
Querying Web Data – The WebQA Approach Author: Sunny K.S.Lam and M.Tamer Özsu CSI5311 Presentation Dongmei Jiang and Zhiping Duan.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. How valuable is medical social media data? Content analysis of the medical web Presenter :Tsai Tzung.
For Monday Read chapter 24, sections 1-3 Homework: –Chapter 23, exercise 8.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Information Integration BIRN supports integration across complex data sources – Can process wide variety of structured & semi-structured sources (DBMS,
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Summary Knowledge Bases from Web are Real, Big & Useful: Entities, Classes & Relations Key Asset for Intelligent Applications: Semantic Search, Question.
Shridhar Bhalerao CMSC 601 Finding Implicit Relations in the Semantic Web.
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
Scalable Hybrid Keyword Search on Distributed Database Jungkee Kim Florida State University Community Grids Laboratory, Indiana University Workshop on.
Faculty Faculty Richard Fikes Edward Feigenbaum (Director) (Emeritus) (Director) (Emeritus) Knowledge Systems Laboratory Stanford University “In the knowledge.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Automatic Metadata Discovery from Non-cooperative Digital Libraries By Ron Shi, Kurt Maly, Mohammad Zubair IADIS International Conference May 2003.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz
Relevance Models and Answer Granularity for Question Answering W. Bruce Croft and James Allan CIIR University of Massachusetts, Amherst.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
1 Question Answering and Logistics. 2 Class Logistics  Comments on proposals will be returned next week and may be available as early as Monday  Look.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Jon Juett April 21,  Selected very recent papers  Includes some student level event / conference papers  UM Health Counseling Program  Correctly.
AQUAINT Mid-Year PI Meeting – June 2002 Integrating Robust Semantics, Event Detection, Information Fusion, and Summarization for Multimedia Question Answering.
Information Retrieval and Web Search
Information Retrieval and Web Search
What is IR? In the 70’s and 80’s, much of the research focused on document retrieval In 90’s TREC reinforced the view that IR = document retrieval Document.
Web IR: Recent Trends; Future of Web Search
Traditional Question Answering System: an Overview
Zachary Cleaver Semantic Web.
CSE 635 Multimedia Information Retrieval
Web Mining Department of Computer Science and Engg.
Social Abstractions for Information agents
Information Retrieval
Presentation transcript:

AQUAINT Mid-Year Workshop: Observations and Comments Jimmy Lin MIT Artificial Intelligence Laboratory

QA and IR Question Answering as an extension of Information Retrieval. Document Retrieval Passage Retrieval Factoid Question Answering + NLP Technology

QA and Databases Question Answering as an extension of Database Systems. Question Answering as knowledge brokering. Why? Databases often provide better answers. Lots of valuable structured/semi-structured resources available on the Web. Rich body of literature to capitalize on.

Cross Fertilization Early “classic” works: BASEBALL [Green et al., 61] LIFER [Hendrix et al., 77] LUNAR [Woods et al., 72] Plenty of research on integration of semistructured data: ARANEUS [Atzeni et al., 1997] ARIADNE [Knoblock et al., 1999] Information Maniford [Kirk et al., 1995] TSIMMIS [Hammer et al., 1997]

Advantages for Analysts Multimedia access. Knowledge Discovery. Better control of source quality and verification. Uniform coverage of a domain.

START A natural language interface to heterogeneous data sources on the Web. [Katz 97; Katz 02]

START and Ominbase Omnibase is a “virtual” database system that integrates heterogeneous, semistructured data. START: natural language queries  structured Omnibase queries. Omnibase executes these queries by Fetching the relevant pages. Extracting the relevant fragments. START performs additional generation.

Current Focus Streamline knowledge integration: Better authoring tools. Smarter parsers. “Conceptual Segmentation.” Wrapper Induction via Machine Learning. Semantic Web: RDF is a rich source of metadata.

Combining the Approaches Complementary Approaches: QA as DB: suitable to handling database-like queries. QA as IR: general purpose solution. More Info: Boris Katz and Jimmy Lin. START, Omnibase, and Beyond. AQUAINT Mid-Year Workshop. Jimmy Lin. The Web as a Resource for Question Answering: Perspectives and Challenges. LREC’2002. Boris Katz et. al. Omnibase: Uniform Access to Heterogenous Data for Question Answering.