Presentation is loading. Please wait.

Presentation is loading. Please wait.

Web Crawler Agent (WCA) Presented by Kirk Martinez University of Southampton.

Similar presentations


Presentation on theme: "Web Crawler Agent (WCA) Presented by Kirk Martinez University of Southampton."— Presentation transcript:

1 Web Crawler Agent (WCA) Presented by Kirk Martinez University of Southampton

2 Introduction WCA searches for missing information (fragments) on the Web WCA structures information into ontology “place_of_birth” (Person,Place) Techniques used: NLP (Natural Language Processing), Information extraction, relation extraction, question answering

3 Overview

4 Is it something like “Google”? Search “date_of_birth” (when Rembrandt was born) with Google

5

6

7

8 Searching information with Google The “old” Web Search (eg Google) is good for getting documents but NOT for extracting concise answers –(e.g. “15-July-1606”) No analysis to “understand” the documents (e.g. “Rembrandt” can mean “hotel” or “bookstore”)

9 Information extraction on the Web data may be low quality and repeated –e.g. Seurat Georges’s date of death –29, March 1891( http://www.ibiblio.org/wm/paint/auth/seurat/) –19, March 1891 ( http://www.rickdoble.net/influence/20seurat.htm) WCA depends on: –Well-structured sentences and documents –Good named-entity recognisers

10

11

12

13

14

15

16

17

18

19

20

21

22 Future work verification performance autonomous


Download ppt "Web Crawler Agent (WCA) Presented by Kirk Martinez University of Southampton."

Similar presentations


Ads by Google