Presentation is loading. Please wait.

Presentation is loading. Please wait.

Google’s Deep Web Crawler

Similar presentations


Presentation on theme: "Google’s Deep Web Crawler"— Presentation transcript:

1 Google’s Deep Web Crawler
- Dev PAtel

2 The Deep web There a big difference between Dark Web and Deep Web
Dark Web is a small part of Dark Web Deep Web consists of content hidden behind HTML forms

3 Deep Web Approaches Vertical search engine Surfacing

4 HTML Form Processing

5 Surfacing Goal – Achieving good content/coverage on individual sites while limiting the number of submissions Selecting an appropriate set of query temples Selecting appropriate input values

6 Challenges with Surfacing
Finding appropriate values Forms have more than one value

7 Contributions Informativeness Test
Algorithm that efficiently traverses the query templets Algorithm for predicting input values

8 ISIT

9 ISIt vs. Cp vs. Tp vs. Tpl

10 Conclusion First large scale Deep-Web surfacing system
Building block for exploring the Deep Web


Download ppt "Google’s Deep Web Crawler"

Similar presentations


Ads by Google