Download presentation
Presentation is loading. Please wait.
Published byJewel May Modified over 8 years ago
1
VLDB Demo WISE-Integrator: A System for Extracting and Integrating Complex Web Search Interfaces of the Deep Web Hai He, Weiyi Meng, Clement Yu, Zonghuan Wu Department of Computer Science State University of New York at Binghamton meng@cs.binghamton.edu August 2005
2
An Example Complex Search Interface
3
WISE-Integrator: What’s it about? Objective: Integrate the complex interfaces of multiple search engines in the same domain into a unified search interface. Two Main Task: Interface extraction: Extract information needed for interface integration from the source HTML file of the search interface page. Interface integration: Integrate multiple (local) interfaces into a single (global) interface.
4
WISE-Integrator: Architecture
5
WISE-Integrator: What can it do? 1.Load the HTML files of search interfaces. 2.Perform search interface extraction. 3.Generate dictionary of common attribute names from multiple search interfaces. 4.Display extracted attributes and their meta- information. 5.Make corrections to incorrectly extracted information. 6.Save the extracted information in XML format.
6
WISE-Integrator: What can it do? (2) 6.Load/export the XML files of extracted interfaces. 7.Perform interface integration: attribute matching, attribute integration (global attribute name identification, format integration, value integration), unified interface generation. 8.Trim less important attributes. 9.Perform incremental maintenance of the global interface.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.