Web Crawler Agent (WCA) Presented by Kirk Martinez University of Southampton.

Slides:



Advertisements
Similar presentations
March 10, 2014 MULTICULTURAL LITERACY.  1. Create a Google Document (Title it “Holocaust”)  2. Type a word pertaining to the Holocaust on the document.
Advertisements

Introduction to WSDL presented by Xiang Fu. Source WSDL 1.1 specification WSDL 1.1 specification – WSDL 1.2 working draft WSDL.
ArtEquAKT Harith Alani, Sanghee Kim, Wendy Hall, Paul Lewis, David Millard, Nigel Shadbolt, Mark Weal.
Flegg year 8 ICT Unit Reliability & Bias 1 Who can you trust? Which web sites are reliable?
© Jalal Kawash 2010 Introduction Peeking into Computer Science 1.
CPSC 322, Lecture 19Slide 1 Propositional Logic Intro, Syntax Computer Science cpsc322, Lecture 19 (Textbook Chpt ) February, 23, 2009.
1 Automating the Extraction of Genealogical Information from the Web GeneTIQS Troy Walker & David W. Embley Family History Technology Conference March.
Ontologies and the Semantic Web by Ian Horrocks presented by Thomas Packer 1.
Resources. Overview Problem Report WebCT Faculty & Student Support Searching.
CIS392Semester Projects1 CIS392 Text Processing, Retrieval, and Mining Overview of Semester Projects.
1/7 INFO60021 Natural Language Processing Harold Somers Professor of Language Engineering.
Why I Find The Semantic Web Interesting Hugh Glaser DSSE Seminar 1/11/2.
World Wide Web As the World Wide Web increased in popularity, it was difficult to keep track of all web addresses. Search engines were created to minimize.
Agent-based Interfaces u u u
Text mining and the Semantic Web Dr Diana Maynard NLP Group Department of Computer Science University of Sheffield.
A Technical Seminar on Question Answering SHRI RAMDEOBABA COLLEGE OF ENGINEERING & MANAGEMENT Presented By: Rohini Kamdi Guided By: Dr. A.J.Agrawal.
Going Beyond Simple Question Answering Bahareh Sarrafzadeh CS 886 – Spring 2015.
Steps Toward an AGI Roadmap Włodek Duch ( Google: W. Duch) AGI, Memphis, 1-2 March 2007 Roadmaps: A Ten Year Roadmap to Machines with Common Sense (Push.
Aardvark Anatomy of a Large-Scale Social Search Engine.
Information Retrieval and Web Search Lecture 1. Course overview Instructor: Rada Mihalcea Class web page:
Google Scholar as a cybermetric tool Alastair G Smith Victoria University of Wellington New Zealand
Extracting Metadata for Spatially- Aware Information Retrieval on the Internet Clough, Paul University of Sheffield, UK Presented By Mayank Singh.
Semantic Technologies & GATE NSWI Jan Dědek.
Web Services and Application of Multi-Agent Paradigm for DL Yueyu Fu & Javed Mostafa School of Library and Information Science Indiana University, Bloomington.
NLP. Introduction to NLP Extrinsic –Use in an application Intrinsic –Cheaper Correlate the two for validation purposes.
Using a Named Entity Tagger to Generalise Surface Matching Text Patterns for Question Answering Mark A. Greenwood and Robert Gaizauskas Natural Language.
Bootstrapping Regular-Expression Recognizer to Help Human Annotators Tae Woo Kim.
October 2005CSA3180 NLP1 CSA3180 Natural Language Processing Introduction and Course Overview.
7 th grade Science RULE-BASED SUMMARIZING.  What does it mean to summarize? SUMMARIZING  State or express in a concise form.
Project Overview Vangelis Karkaletsis NCSR “Demokritos” Frascati, July 17, 2002 (IST )
Multi-agent Systems in Medicine Štěpán Urban. Content  Introduction to Multi-agent Systems (MAS) What is an Agent? Architecture of Agent MAS Platforms.
DO NOT Log on I have three things to discuss with you first: Due dates Products Missing Assignments.
Learning to Share Meaning in a Multi-Agent System (Part I) Ganesh Padmanabhan.
Using a Named Entity Tagger to Generalise Surface Matching Text Patterns for Question Answering Mark A. Greenwood and Robert Gaizauskas Natural Language.
Eg1. A square We say that a square has… It fits on itself 4 times back.
Harith Alani, Sanghee Kim, David Millard, Mark Weal, Paul Lewis, Wendy Hall, Nigel Shadbolt Using Protégé for Automatic Ontology Instantiation 7 th International.
Search Engines: A History  First search engine was Veronica for the Gopher network  1991 Gopher  After Gopher disappeared, the first one for modern.
Artificial Intelligence Techniques Internet Applications 4.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Using Semantic Relations to Improve Information Retrieval
ILLOS Interlend 2006 University of Southampton Ian Stuart, Lancaster University Library.
AI and GAMES. Some games to explore Yahoo AI Games directory. Mostly chatbots; 20 questions is good. Yahoo AI Games directory. Mostly chatbots; 20 questions.
Find death date for a member. Date of Birth Search death date for James M Clar k. Name taken from a former member list.
Explorer’s Name Explorer’s Picture Student’s Name.
© 1990—2006 Visual Knowledge Software® | Private and Confidential | 2 Semantic Agent Wikis For Engineering.
©2003 Paula Matuszek CSC 9010: AeroText, Ontologies, AeroDAML Dr. Paula Matuszek (610)
Introduction of Scratch (1/4) You can find Scratch on the Web:
WALT: Express missing number problems algebraically
By: Zainab Alblushi ID# Huda AlAmri ID:
Web Services and Application of Multi-Agent Paradigm for DL
Introduction to Data Mining
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Artificial Intelligence and Lisp Lecture 13 Additional Topics in Artificial Intelligence LiU Course TDDC65 Autumn Semester,
Christianity So…..what do you know???.
What is IR? In the 70’s and 80’s, much of the research focused on document retrieval In 90’s TREC reinforced the view that IR = document retrieval Document.
Answer the following questions on each slide.
Vocabulary and Reading Skills
Question Answering via Question-to-Question Mapping
order 4 Eg1. A square It fits on itself 4 times
Semantic Web Visioning Exercise
סדר דין פלילי – חקיקה ומהות ההליך הפלילי
FIBO-aligned Semantic Triples
Robotic Search Engines for the Physical World
The ERES Digital Library:

You must show all steps of your working out.
Question 1.
Thing / Person:____________________ Dates:_________________
LANGUAGE EDUCATION.
Trailist Focusing on document activity for assisting navigation Sun Technology Research Excellence Center (SunTREC) Salzburg Research, Austria — 16.
Presentation transcript:

Web Crawler Agent (WCA) Presented by Kirk Martinez University of Southampton

Introduction WCA searches for missing information (fragments) on the Web WCA structures information into ontology “place_of_birth” (Person,Place) Techniques used: NLP (Natural Language Processing), Information extraction, relation extraction, question answering

Overview

Is it something like “Google”? Search “date_of_birth” (when Rembrandt was born) with Google

Searching information with Google The “old” Web Search (eg Google) is good for getting documents but NOT for extracting concise answers –(e.g. “15-July-1606”) No analysis to “understand” the documents (e.g. “Rembrandt” can mean “hotel” or “bookstore”)

Information extraction on the Web data may be low quality and repeated –e.g. Seurat Georges’s date of death –29, March 1891( –19, March 1891 ( WCA depends on: –Well-structured sentences and documents –Good named-entity recognisers

Future work verification performance autonomous