Download presentation
Presentation is loading. Please wait.
1
Google’s Deep Web Crawler
- Dev PAtel
2
The Deep web There a big difference between Dark Web and Deep Web
Dark Web is a small part of Dark Web Deep Web consists of content hidden behind HTML forms
3
Deep Web Approaches Vertical search engine Surfacing
4
HTML Form Processing
5
Surfacing Goal – Achieving good content/coverage on individual sites while limiting the number of submissions Selecting an appropriate set of query temples Selecting appropriate input values
6
Challenges with Surfacing
Finding appropriate values Forms have more than one value
7
Contributions Informativeness Test
Algorithm that efficiently traverses the query templets Algorithm for predicting input values
8
ISIT
9
ISIt vs. Cp vs. Tp vs. Tpl
10
Conclusion First large scale Deep-Web surfacing system
Building block for exploring the Deep Web
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.