Presentation is loading. Please wait.

Presentation is loading. Please wait.

Http://iinetsrv.washington.edu/cse490i/ncweb.

Similar presentations


Presentation on theme: "Http://iinetsrv.washington.edu/cse490i/ncweb."— Presentation transcript:

1

2 Highlights Written in C++ Uses the Wininet Libraries and the Win32 API
Focused Crawl (~6-7% of links traversed are mp3s) Runtime client for monitoring crawler status

3 Interface

4

5

6

7 Focusing The Crawl Priority Queue for links
Voting system allows for modular extension of AI Weighted Keyword Heuristic using TDIDF Weighted URL Analysis Heuristic

8 Future Extensions Fill in lost information from on online database
like Include other AI heuristics like a Bayesean classifier Distribute across multiple systems


Download ppt "Http://iinetsrv.washington.edu/cse490i/ncweb."

Similar presentations


Ads by Google