Facilitating Semantic Web Search with Embedded Grammar Tags (EGTs) Gautham K.Dorai Yaser Yacoob Department of Computer Science University of Maryland –

Slides:



Advertisements
Similar presentations
Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
Advertisements

Building a Semantic IntraWeb with Rhizomer and a Wiki Roberto Garcia and Rosa Gil GRIHO (Human Computer Interaction Research Group) Universitat de Lleida,
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Sematic Web Microdata, Microformat and RDF Advanced Web-based Systems | Misbhauddin.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Galia Angelova Institute for Parallel Processing, Bulgarian Academy of Sciences Visualisation and Semantic Structuring of Content (some.
The Web of data with meaning... By Michael Griffiths.
Search Engines. 2 What Are They?  Four Components  A database of references to webpages  An indexing robot that crawls the WWW  An interface  Enables.
The State of the Art in VoiceXML Chetan Sharma, MS Graduate Student School of CSIS, Pace University.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Page 1 of 11 Craig Schock, 2003 XML – eXtensible Markup Language What is XML? What is the big fuss about XML? Will XML save the world? Limitations of XML.
15 Chapter 15 Web Database Development Database Systems: Design, Implementation, and Management, Fifth Edition, Rob and Coronel.
DARPA Agent Markup Language Ashish Jain University of Colorado at Boulder.
The Semantic Web Week 12 Term 1 Recap Lee McCluskey, room 2/07 Department of Computing And Mathematical Sciences Module Website:
Samad Paydar Web Technology Laboratory Computer Engineering Department Ferdowsi University of Mashhad 1389/11/20 An Introduction to the Semantic Web.
Russell Taylor Lecturer in Computing & Business Studies.
Tutorial 8 Sharing, Integrating and Analyzing Data
Automatic Data Ramon Lawrence University of Manitoba
CS580: Building Web Based Information Systems Roger Alexander & Adele Howe The purpose of the course is to teach theory and practice underlying the construction.
Overview of Search Engines
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Lecturer: Ghadah Aldehim
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Nilanthi Seneviratne Meghavi Doshi iAnnotate: Digital Ink Annotation on the Web.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
DC 2004 Metadata Generation and Accessibility Auditing Liddy Nevile La Trobe University, Australia Mail
Audio and Video Chris McConnell Department of Radio-TV-Film November 30, 2006.
1 Search Engines Emphasis on Google.com. 2 Discovery  Discovery is done by browsing & searching data on the Web.  There are 2 main types of search facilities.
XML A web enabled data description language 4/22/2001 By Mark Lawson & Edward Ryan L’Herault.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
McLean HIGHER COMPUTER NETWORKING Lesson 7 Search engines Description of search engine methods.
Department of computer science and engineering Two Layer Mapping from Database to RDF Martin Švihla Research Group Webing Department.
SNOWTAM Trial: REST Interface. AIXM XML Developers' Seminar 2 Contents Digital-SNOWTAM Trial Introduction REST Introduction REST in the Digital-SNOWTAM.
BioRAT: Extracting Biological Information from Full-length Papers David P.A. Corney, Bernard F. Buxton, William B. Langdon and David T. Jones Bioinformatics.
Introduction to XML By Manzur Ashraf (Shovon) Dept. of Computer Science & Engineering (BUET)
OWL Representing Information Using the Web Ontology Language.
Trustworthy Semantic Webs Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #4 Vision for Semantic Web.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
ASP. ASP is a powerful tool for making dynamic and interactive Web pages An ASP file can contain text, HTML tags and scripts. Scripts in an ASP file are.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
7. Data Import Export Lingma Acheson Department of Computer and Information Science IUPUI CSCI N207 Data Analysis Using Spreadsheets 1.
From XML to DAML – giving meaning to the World Wide Web Katia Sycara The Robotics Institute
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Invitation to Computer Science 6 th Edition Chapter 10 The Tower of Babel.
Introduction to the World Wide Web & Internet CIS 101.
Setting up a search engine KS 2 Search: appreciate how results are selected.
The Semantic Web Vision. Course Work Dr Yasser Fouad Blogs.alexu.edu.eg 2.
Search Engine and Optimization 1. Introduction to Web Search Engines 2.
SEMANTIC WEB Presented by- Farhana Yasmin – MD.Raihanul Islam – Nohore Jannat –
1 New Perspectives on Access 2016 Module 8: Sharing, Integrating, and Analyzing Data.
Website Source Code Free Download.
Data mining in web applications
Information Retrieval in Practice
David Huynh, Stefano Mazzocchi, David Karger Piggy Bank: Experience the Semantic Web inside your web browser Web Semantics: Science, Services and Agents.
Chapter Five Web Search Engines
Uppingham Community College
Contents Digital-SNOWTAM Trial Introduction REST Introduction
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
UNIT 15 Webpage Creator.
Using Access and the Web
Prepared by Rao Umar Anwar For Detail information Visit my blog:
Zachary Cleaver Semantic Web.
CS & CS Capstone Project & Software Development Project
What is a Search Engine EIT, Author Gay Robertson, 2017.
Multimedia Information Retrieval
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Semantic Markup for Semantic Web Tools:
Database Connectivity and Web Development
CSE591: Data Mining by H. Liu
Web Programming : Building Internet Applications Chris Bates CSE :
Presentation transcript:

Facilitating Semantic Web Search with Embedded Grammar Tags (EGTs) Gautham K.Dorai Yaser Yacoob Department of Computer Science University of Maryland – College Park

The Future – A Forecast Speech Grammar based Search Engine What is the value of Nasdaq today ? WWW The value is !! ???

Roadmap Forecast Problem Statement and Our Solution Related Work Demonstration Summary Future Work

Problem Statement (1) Web Content is represented for human consumption Software agents do not have interpretive tools for semantic information recovery Hence agents cannot understand web content

Problem Statement (2) Web content does not support queries by natural language interaction e.g. : Query : “What is the weather at College Park” - searches lead to links on related subject content

Our Solution (1) We embed natural language queries in the web content Embedded Grammar Tags (EGTs) represent queries in a general (parseable) format Discovery of relevant response by EGT matching

EGT – The Big Picture HTML Page EGT Annotation Internet Web Search Engines EGT Search QUERY NLP

Our Solution (2) EGT uses the general BNF grammar format to represent queries E.g.: * [is] the temperature [is] at College Park Captures queries such as - What is the temperature at College Park ? - Tell me what the temperature is at College Park ?

Our Solution (3) EGT structure : * can be replaced by any word/set of words () mandatory words for EGT match [] optional words Web Content is annotated with EGTs e.g: * [is] the weather [is] at College Park [is] mostly sunny

Our Solution (4) EGTs - More examples : Wind: * [is the] [wind] (speed|velocity) [of the] [wind] [is] at [is] 3mph * [is] [Nasdaq *] [the] (value|quote|price) [of Nasdaq] *

Related Work (1) Natural Language Processing (NLP) – attempt to uncover meaning in HTML content DAML, RDF, SHOE, XML – add metadata to describe the web content Facilitate more efficient content search

Related Work (2) E.g. : Gautham Dorai - special tags (, ) to describe content Fine grained natural language queries on the content require an expandable universally available tag database

Why EGTs ? (1) RDF Triple can also be used e.g: College Park - [weatherat]  mostly sunny Nasdaq - [quoteof]  But EGTs are naturally expandable and more amenable to change Simplifies search engine complexity

Why EGTs ? (2) EGTs describe content in an unconstrained format EGTs are already present in speech recognition technology Ease of transition from visual  phone browsers

Demonstration We annotate a given home-page with EGTs The user can query the content in natural language Search engine parses the web page for EGT match and responds

EGT Annotator (Preliminary) Create a template page that is EGT-ready, i.e., EGTs are transparent to the user The template is for home-pages at CS Dept. The user can simply copy information from the HTML page onto the annotator

Summary EGTs enable software agents to respond to natural language queries An EGT search engine can be implemented on top of conventional content search engines Responses are constructed based on the extracted information from an EGT match

Future Work Expandable Universal Grammar - Universally Available query grammar packages EGT Recognition Metrics - Statistical Analysis to search for EGT matches EGT Crawler - Crawler that parses through EGT annotated web-pages