NLP And The Semantic Web Dainis Kiusals COMS E6125 Spring 2010.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Classification & Your Intranet: From Chaos to Control Susan Stearns Inmagic, Inc. E-Libraries E204 May, 2003.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Semantic Access to Data from the Web Raquel Trillo *, Laura Po +, Sergio Ilarri *, Sonia Bergamaschi + and E. Mena * 1st International Workshop on Interoperability.
A Linguistic Approach for Semantic Web Service Discovery International Symposium on Management Intelligent Systems 2012 (IS-MiS 2012) July 13, 2012 Jordy.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Searching the Semantic Web. Introduction  Research Focuses: IE Ontologies (creating, languages, merging, storing, querying)  Next Sep: Using the Semantic.
BTW (“By The Way…”) Information Annotation By Rudd Stevens, Jason Endo University of San Francisco.
Overall Information Extraction vs. Annotating the Data Conference proceedings by O. Etzioni, Washington U, Seattle; S. Handschuh, Uni Krlsruhe.
A New Web Semantic Annotator Enabling A Machine Understandable Web BYU Spring Research Conference 2005 Yihong Ding Sponsored by NSF.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
Ontology-Based Free-Form Query Processing for the Semantic Web Mark Vickers Brigham Young University MS Thesis Defense Supported by:
Semantic Web Presented by: Edward Cheng Wayne Choi Tony Deng Peter Kuc-Pittet Anita Yong.
Semantics and Lexicology Generativist semantics. From structuralist semantics Semantic features, components.
Computer communication B Introduction to the Semantic Web.
Enhance legal retrieval applications with an automatically induced knowledge base Ka Kan Lo.
Redefining Perspectives A thought leadership forum for technologists interested in defining a new future June COPYRIGHT ©2015 SAPIENT CORPORATION.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Lecture 1, 7/21/2005Natural Language Processing1 CS60057 Speech &Natural Language Processing Autumn 2005 Lecture 1 21 July 2005.
Practical RDF Chapter 1. RDF: An Introduction
Lecture 12: 22/6/1435 Natural language processing Lecturer/ Kawther Abas 363CS – Artificial Intelligence.
RuleML-2007, Orlando, Florida1 Towards Knowledge Extraction from Weblogs and Rule-based Semantic Querying Xi Bai, Jigui Sun, Haiyan Che, Jin.
The Semantic Web William M Baker
Artificial intelligence project
1 Computational Linguistics Ling 200 Spring 2006.
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Search Engine By Bhupendra Ratha, Lecturer School of Library and Information Science Devi Ahilya University, Indore
Query Expansion By: Sean McGettrick. What is Query Expansion? Query Expansion is the term given when a search engine adding search terms to a user’s weighted.
Ontology-Based Information Extraction: Current Approaches.
Of 33 lecture 10: ontology – evolution. of 33 ece 720, winter ‘122 ontology evolution introduction - ontologies enable knowledge to be made explicit and.
Flexible Text Mining using Interactive Information Extraction David Milward
Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.
The Internet 8th Edition Tutorial 4 Searching the Web.
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
Noun-Phrase Analysis in Unrestricted Text for Information Retrieval David A. Evans, Chengxiang Zhai Laboratory for Computational Linguistics, CMU 34 th.
1 CSI 5180: Topics in AI: Natural Language Processing, A Statistical Approach Instructor: Nathalie Japkowicz Objectives of.
CSC3315 (Spring 2009)1 CSC 3315 Languages & Compilers Hamid Harroud School of Science and Engineering, Akhawayn University
You sexy beast. Ok, inappropriate. How about: Web of links to Web of Meaning Hello Semantic Web!
Knowledge Management: The On-To-Knowledge Project Hans Akkermans Free University Amsterdam VUA.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
OWL Representing Information Using the Web Ontology Language.
Introduction to the Semantic Web and Linked Data
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Aim Ability to automate the detection of financial inconsistency and irregularity Problem Need to create a unified and logically rigorous terminology.
UWMS Data Mining Workshop Content Analysis: Automated Summarizing Prof. Marti Hearst SIMS 202, Lecture 16.
The Unreasonable Effectiveness of Data
THE SEMANTIC WEB By Conrad Williams. Contents  What is the Semantic Web?  Technologies  XML  RDF  OWL  Implementations  Social Networking  Scholarly.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Introduction to the Semantic Web Jeff Heflin Lehigh University.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Semantic Wiki: Automating the Read, Write, and Reporting functions Chuck Rehberg, Semantic Insights.
King Faisal University جامعة الملك فيصل Deanship of E-Learning and Distance Education عمادة التعلم الإلكتروني والتعليم عن بعد [ ] 1 جامعة الملك فيصل عمادة.
An Ontology-based Automatic Semantic Annotation Approach for Patent Document Retrieval in Product Innovation Design Feng Wang, Lanfen Lin, Zhou Yang College.
Semantic Web. P2 Introduction Information management facilities not keeping pace with the capacity of our information storage. –Information Overload –haphazardly.
SEMANTIC WEB Presented by- Farhana Yasmin – MD.Raihanul Islam – Nohore Jannat –
Semantic and geographic information system for MCDA: review and user interface building Christophe PAOLI*, Pascal OBERTI**, Marie-Laure NIVET* University.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
The Semantic Web By: Maulik Parikh.
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Ontology.
SmaRT Visualization of Legal Rules for Compliance
Searching with context
Natural Language Processing
Chaitali Gupta, Madhusudhan Govindaraju
Presentation transcript:

NLP And The Semantic Web Dainis Kiusals COMS E6125 Spring 2010

1950s and 1960s – researchers began developing techniques aimed at understanding the ways computers could be used to provide Natural Language Processing. The ability to capture context was studied by Noam Chomsky. His theory is based upon the use of Generative grammars - constructs used to describe how a sentence is formed, which may be used to create formal grammars through which an input stream of words may be parsed as a first step toward extracting their meaning [1] Natural Language Processing [sentence] [noun phrase], [verb phrase] [determiner], [noun], [verb], [article], [adjective], [adverb]

o Morphology – different forms of words(singular/plural, tense) o Syntax – grammatical structure(verbs, nouns) o Spelling – different spelling(and misspelling) of words o Text Segmentation – identifying word boundaries o Word Sense Disambiguation – multiple word meanings NLP Issues / Challenges The company is ready to sell. runs, ran, runningbow (bend forward, weapon, ribbon, front of ship)? color/colour, organize/organise

Proposed by Tim Berners-Lee (W3C Director) as a method for adding concepts via semantic annotation to Web content. W3C standardizing the RDF and OWL protocols. At lowest level, concepts stored as triples, defined at higher levels by ontologies. Semantic Web [2] [3]

Queries are only processed as statistical analysis of keyword appearance in documents, with some advanced logical features. Does not distinguish between different interpretations of a word in given context in searched data (corpus) – search results might contain different uses of a word. Keyword Search [4]

Increased relevancy of results vs. keyword search. –Longer query phrases and questions yield better results. –Makes use of semantic information to attain better results. Users need to change (used to keywords). –NLP Search pages need to encourage use of complex queries. Web Search vs. Enterprise Search? –NLP Search may be better suited for smaller size domains. Top-Down or Bottom-Up approach? –Top-Down approach relies more on NLP processing. –Creating the Semantic Web (Bottom-Up) will be more costly. NLP / Semantic Search

NLP and the Semantic Web compliment each other and will grow together. As Semantic Web (RDL and OWL) annotation is added to Web pages, NLP search engines can take advantage of this information. NLP processes can be used to automate the generation of content to be used to populate new Semantic Web annotation. Global and domain-specific ontologies (which represent concepts and their relationships) combined with NLP techniques define the search process. NLP / Semantic Web Relationship

1.Founded in San Francisco in 2005 with a goal to create a NLP Search Engine. 2.In 2007 obtained exclusive rights to several decades of Xerox/PARC NLP research. 3.Launched first public software beta in May 2008 – NLP search website covering approx. 2.5 million Wikipedia web pages (also referenced Freebase). 4.Created innovative user interface which leveraged NLP/semantic search results (ex: highlighting of relevant phrases/sentences within a larger document). 5.Two months after public beta was acquired by Microsoft in order to be incorporated into the Bing! Search engine. Case Study:

NLP Search Companies

1.An Executive's Guide to Information Technology: Principles, Business Models and Terminology by Robert Plant and Stephen Murrell, Cambridge University Press, Enterprise 2.0 Implementation, Chapter 13 by Aaron C. Newman and Jeremy Thomas McGraw-Hill/Osborne, Encyclopedia of Knowledge Management, RDF and OWL by David G. Schwartz (ed) IGI Global, Semantic Knowledge Management: An Ontology-Based Framework by Antonio Zilli (ed) et al. IGI Global, Resources/information taken from Full Paper submitted 3/12/10. Resources