Lowell 2003 Challenges Alon Y. Halevy University of Washington.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Languages & Inference Appropriate layering Do we need a logic? Do we need Description Logic? Legacy data; database storage vs inference Tolerant/anytime.
Company Confidential 1 © 2005 Nokia DBUpgradeTool_ ppt / / JMa A Database Upgrade Tool Nokia Networks Jukka Maaranen.
Idaho National Engineering and Environmental Laboratory Drill Down! The INEEL Docu-Search provides extensive searching capabilities Von Crofts Interlab.
GLOCAL Event-based Retrieval of Networked Media NEM Concertation Meeting Brussels, Feb
Page 1 Integrating Multiple Data Sources using a Standardized XML Dictionary Ramon Lawrence Integrating Multiple Data Sources using a Standardized XML.
Information Retrieval in Practice
PR-OWL: A Framework for Probabilistic Ontologies by Paulo C. G. COSTA, Kathryn B. LASKEY George Mason University presented by Thomas Packer 1PR-OWL.
T.Sharon - A.Frank 1 Internet Resources Discovery (IRD) Classic Information Retrieval (IR)
A Web of Concepts Dalvi, et al. Presented by Andrew Zitzelberger.
Information Retrieval Concerned with the: Representation of Storage of Organization of, and Access to Information items.
Web Information Retrieval and Extraction Chia-Hui Chang, Associate Professor National Central University, Taiwan Sep. 16, 2005.
Integrating data sources on the World-Wide Web Ramon Lawrence and Ken Barker U. of Manitoba, U. of Calgary
What is the Internet? The Internet is a computer network connecting millions of computers all over the world It has no central control - works through.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
IELM 230: Industrial Data Systems Course topics: - Relational Database Design - DB development and optimized usage - DB backed web-applications.
1 Information Integration and Source Wrapping Jose Luis Ambite, USC/ISI.
Overview of Search Engines
Databases & Data Warehouses Chapter 3 Database Processing.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Enterprise & Intranet Search How Enterprise is different from Web search What to think about when evaluating Enterprise Search How Intranet use is different.
Search Engines and Information Retrieval Chapter 1.
1 CS 430 Database Theory Winter 2005 Lecture 1: Introduction.
 DATABASE DATABASE  DATABASE ENVIRONMENT DATABASE ENVIRONMENT  WHY STUDY DATABASE WHY STUDY DATABASE  DBMS & ITS FUNCTIONS DBMS & ITS FUNCTIONS 
DBrev: Dreaming of a Database Revolution Gjergji Kasneci, Jurgen Van Gael, Thore Graepel Microsoft Research Cambridge, UK.
LIS618 lecture 1 Thomas Krichel economic rational for traditional model In olden days the cost of telecommunication was high. database use.
Practical Project of the 2006 Joint International Master’s Degree.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
XML (with a bias towards query language issues) A boring research topic? A new frontier? A means to keep standards people busy? Prepared by S. Abiteboul.
StockWatch Developers: Nimrod Hagay Hagai Barkan Supervisors: Assaf Solomovitch Viktor Kulikov June 2009.
The Data Ring: Community Content Sharing Serge Abiteboul (INRIA) Alkis Polyzotis (UC Santa Cruz)
1 Lessons from the TSIMMIS Project Yannis Papakonstantinou Department of Computer Science & Engineering University of California, San Diego.
System Initialization 1)User starts application. 2)Client loads settings. 3)Client loads contact address book. 4)Client displays contact list. 5)Client.
Page 1 Alliver™ Page 2 Scenario Users Contents Properties Contexts Tags Users Context Listener Set of contents Service Reasoner GPS Navigator.
2007. Software Engineering Laboratory, School of Computer Science S E Web-Harvest Web-Harvest: Open Source Web Data Extraction tool 이재정 Software Engineering.
Problems in Semantic Search Krishnamurthy Viswanathan and Varish Mulwad {krishna3, varish1} AT umbc DOT edu 1.
Uwe SchindlerGES 2007 – May 2-4, 2007 Data Information Service based on Open Archives Initiative Protocols and Apache Lucene Uwe Schindler 1, Benny Bräuer.
Database Environment Chapter 2. Data Independence Sometimes the way data are physically organized depends on the requirements of the application. Result:
Building a Topic Map Repository Xia Lin Drexel University Philadelphia, PA Jian Qin Syracuse University Syracuse, NY * Presented at Knowledge Technologies.
Intro: 1 What is a Database? Collection of Dynamic Data –Large Large of yesteryear now fits on a PC (small DBs) Many applications require even more (terabytes,
XML and Its Applications Ben Y. Zhao, CS294-7 Spring 1999.
Information Integration BIRN supports integration across complex data sources – Can process wide variety of structured & semi-structured sources (DBMS,
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
ACIS Introduction to Data Analytics & Business Intelligence Database s Benefits & Components.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Toward Semantic Search: RDFa based facet browser Jin Guang Zheng Tetherless World Constellation.
© 1997 UW CSE 10/27/97H-1 Embedded SQL and the CLI Chapter 7.7.
Yahoo! BOSS Open up Yahoo!’s Search data via web services Developer & Custom Tracks Big Goal – If you’re in a vertical and you perform a search, you should.
Lecture 15: Query Optimization. Very Big Picture Usually, there are many possible query execution plans. The optimizer is trying to chose a good one.
Welcome to CPSC 534B: Information Integration Laks V.S. Lakshmanan Rm. 315.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
1 Personalizing Search via Automated Analysis of Interests and Activities Jaime Teevan, MIT Susan T. Dumais, Microsoft Eric Horvitz, Microsoft SIGIR 2005.
Harnessing the Deep Web : Present and Future -Tushar Mhaskar Jayant Madhavan, Loredana Afanasiev, Lyublena Antova, Alon Halevy January 7,
UCSD Neuron-Centered Database
Data and Applications Security Developments and Directions
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
External Services & Frameworks
Ishan Sharma Abhishek Mittal Vivek Raj
Web IR: Recent Trends; Future of Web Search
Lecture 16: Probabilistic Databases
Announcements Project 2’s due date is moved to Tuesday 8/3/04
DBMS with probabilistic model
موضوع پروژه : بازیابی اطلاعات Information Retrieval
CSE 635 Multimedia Information Retrieval
International Marketing and Output Database Conference 2005
Information Integration
Query Optimization.
Context-Aware Internet
Data Independence Applications insulated from how data is structured and stored. Logical data independence: Protection from changes in logical structure.
Fading Schemas… Alon Y. Halevy.
Presentation transcript:

Lowell 2003 Challenges Alon Y. Halevy University of Washington

The KR/DB Impedance Mismatch We often feel that knowledge and inference would be helpful. E.g., semantic query optimization, web search, data integration, meta-data management, wrapper construction. To date, most KR/DB work has been quite theoretical (data models, query containment, integrity constraint reasoning). Why haven ’ t database systems benefited more from KR technology? One explanation: there is an impedance mismatch: Classical KR systems provide pure logical reasoning. DB applications often need knowledge to rank multiple plausible answers/plans/transformations …

Going Forward The good news from KR: Probabilistic reasoning answers questions such as “ what is the probability of X? ”, With machine learning it is possible to create models from data, rather than brittle hand-crafting. The DB community needs to define the needs (API) Example: Self-tuning systems Too many knobs to set. Took several years to understand a small subset (index selection). We need a giant rule set, but the rules need to handle uncertainty. Need to learn this rule set from data. Other applications: web querying (focused crawling), user interfaces, schema matching, and lifelong personal data management.

Lifelong Personal Data Management Save all the stuff I ever care(d) about (contacts, grades, boy scout assignments, stock portfolio, files, talks, restaurant reviews) Challenges: Our schema evolves (radically) over time. Data management systems change constantly. Our focus of attention changes. Find data in your information attic: find the photo of the girl next door from 8 th grade. Need to combine text and structured data, and make it accessible to everyone.