Automated Question Answering Suggestion Using User Expert and Semantic Information การแนะนำการตอบคำถามอัตโนมัติ โดยใช้ข้อมูลผู้เชี่ยวชาญ และข้อมูลเชิง.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
The CERIF-2000 Implementation. Andrei S. Lopatenko CERIF Implementation Guidelines Andrei Lopatenko Vienna University of Technology
The use of an intelligent forum crawler for data retrieval from e-learning portals Miloš Pavković and Jelica Protić, University of Belgrade School of.
Web Search - Summer Term 2006 III. Web Search - Introduction (Cont.) (c) Wolfgang Hürst, Albert-Ludwigs-University.
Search Engines and Information Retrieval
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
CIS392Semester Projects1 CIS392 Text Processing, Retrieval, and Mining Overview of Semester Projects.
6/16/20151 Recent Results in Automatic Web Resource Discovery Soumen Chakrabartiv Presentation by Cui Tao.
Web Search – Summer Term 2006 III. Web Search - Introduction (Cont.) - Jeff Dean, Google's Systems Lab:
A Flexible Workbench for Document Analysis and Text Mining NLDB’2004, Salford, June Gulla, Brasethvik and Kaada A Flexible Workbench for Document.
Information Retrieval in Practice
WebMiningResearch ASurvey Web Mining Research: A Survey By Raymond Kosala & Hendrik Blockeel, Katholieke Universitat Leuven, July 2000 Presented 4/18/2002.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Information Retrieval
Overview of Web Data Mining and Applications Part I
 Search engines are programs that search documents for specified keywords and returns a list of the documents where the keywords were found.  A search.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Search Engines and Information Retrieval Chapter 1.
Page 1 WEB MINING by NINI P SURESH PROJECT CO-ORDINATOR Kavitha Murugeshan.
Aardvark Anatomy of a Large-Scale Social Search Engine.
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
Practical Project of the 2006 Joint International Master’s Degree.
Topical Crawlers for Building Digital Library Collections Presenter: Qiaozhu Mei.
WebMining Web Mining By- Pawan Singh Piyush Arora Pooja Mansharamani Pramod Singh Praveen Kumar 1.
Use of Hierarchical Keywords for Easy Data Management on HUBzero HUBbub Conference 2013 September 6 th, 2013 Gaurav Nanda, Jonathan Tan, Peter Auyeung,
Košice, 10 February Experience Management based on Text Notes The EMBET System Michal Laclavik.
НИУ ВШЭ – НИЖНИЙ НОВГОРОД EDUARD BABKIN NIKOLAY KARPOV TATIANA BABKINA NATIONAL RESEARCH UNIVERSITY HIGHER SCHOOL OF ECONOMICS A method of ontology-aided.
EU Project proposal. Andrei S. Lopatenko 1 EU Project Proposal CERIF-SW Andrei S. Lopatenko Vienna University of Technology
Gregor Gisler-Merz How to hit in google The anatomy of a modern web search engine.
XP New Perspectives on The Internet, Sixth Edition— Comprehensive Tutorial 3 1 Searching the Web Using Search Engines and Directories Effectively Tutorial.
Search engines are the key to finding specific information on the vast expanse of the World Wide Web. Without sophisticated search engines, it would be.
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
McLean HIGHER COMPUTER NETWORKING Lesson 7 Search engines Description of search engine methods.
Course grading Project: 75% Broken into several incremental deliverables Paper appraisal/evaluation/project tool evaluation in earlier May: 25%
Search Engine Architecture
Alexey Kolosoff, Michael Bogatyrev 1 Tula State University Faculty of Cybernetics Laboratory of Information Systems.
Search Engines1 Searching the Web Web is vast. Information is scattered around and changing fast. Anyone can publish on the web. Two issues web users have.
IT-522: Web Databases And Information Retrieval By Dr. Syed Noman Hasany.
Problems in Semantic Search Krishnamurthy Viswanathan and Varish Mulwad {krishna3, varish1} AT umbc DOT edu 1.
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
Supporting Knowledge Discovery: Next Generation of Search Engines Qiaozhu Mei 04/21/2005.
The World Wide Web. What is the worldwide web? The content of the worldwide web is held on individual pages which are gathered together to form websites.
Semantic Web COMS 6135 Class Presentation Jian Pan Department of Computer Science Columbia University Web Enhanced Information Management.
September 2003, 7 th EDG Conference, Heidelberg – Roberta Faggian, CERN/IT CERN – European Organization for Nuclear Research The GRACE Project GRid enabled.
General Architecture of Retrieval Systems 1Adrienn Skrop.
SEMANTIC WEB Presented by- Farhana Yasmin – MD.Raihanul Islam – Nohore Jannat –
Lecture-6 Bscshelp.com. Todays Lecture  Which Kinds of Applications Are Targeted?  Business intelligence  Search engines.
Data mining in web applications
Information Retrieval in Practice
Search Engine Optimization (SEO)
DATA MINING Introductory and Advanced Topics Part III – Web Mining
Search Engine Architecture
Prepared by Rao Umar Anwar For Detail information Visit my blog:
Knowledge Management Systems
SEARCH ENGINE OPTIMIZATION SEO. What is SEO? It is the process of optimizing structure, design and content of your website in order to increase traffic.
Search Engines & Subject Directories
Submitted By: Usha MIT-876-2K11 M.Tech(3rd Sem) Information Technology
The Economy of Distributed Metadata Authoring
CS & CS Capstone Project & Software Development Project
Agenda What is SEO ? How Do Search Engines Work? Measuring SEO success ? On Page SEO – Basic Practices? Technical SEO - Source Code. Off Page SEO – Social.
International Marketing and Output Database Conference 2005
Search Engines & Subject Directories
Search Engines & Subject Directories
Search Engine Architecture
Information Retrieval and Web Design
Discussion Class 9 Google.
A framework for ontology Learning FROM Big Data
Presentation transcript:

Automated Question Answering Suggestion Using User Expert and Semantic Information การแนะนำการตอบคำถามอัตโนมัติ โดยใช้ข้อมูลผู้เชี่ยวชาญ และข้อมูลเชิง ภาษา Mr. Seksan Poltree ID Code Asst. Prof. Kanda Runapongsa Saikaew Advisor [Need to include slide number]

Searching Knowledge And Information

Discussion, a Question-Answer is a classical methodology!

Web Question Answering And traditional Web Forums

What's a Problem? Topics and posts duplication Post same topic without search Question and answer quality Complicated algorithm Ambiguous meaning

Objectives 1. Create a system ranking based on semantic information for quality ranking instead of only quantity ranking 2. Increase question and answer based quality and reduce topic duplication

Related Works 1. Building Experience Database from Personal Opinion and Web Document [specify cite reference number] [need to specify the strengths and weaknesses and the difference between the proposed work and each related work] Experience Database User Generated Contents Topics Experiencer Event Type Factuality Source -- W

Related Works(2) 2. Effects of Earlier Messages on Later Messages Domain : Maxnet Feeling Contribution

3. Web-Service-Based Architecture for Question Answering 4. Automated Question Answering using Semantic Web Services Related Works(3) NLP Analyzer Web Service Semantic Web Service Web Services Caller Natural Language Template Answer

Related Works(4) 5. Page Ranking Algorithms : Survey Page Rank (PR) Google Page Rank Weighted Page Rank (WPR) PR but give Weight for some Page Page Content Rank (PCR) Relevancy page using Neuron Network HITS Sampling and then find HUB and Authority

Related Works(5) 6. Relational-Based Page Rank Algorithm for Semantic Web Search Engines Keywords Pre-Search Result Ontology sub-graph Travel.O WL Final Result

Research Tools HARVESTMAN WEB SPIDER NLTK Python Programming Language [Need to specify why choosing these tools] Django Framework : Python MVC Harvestman Web Spider : Open source Crawler PyLucene : Indexer and Searching DB Neo4J an open source graph database (include Lucene) NLTK : Python Natural Language Toolkit OWL library

System Design NLTK AJAX/Web Query Interface Semantic Question Analyzer (Orchid Tagging/ Sentence Structure) Quality Ranking Indexed Database Neo 4j Messages Crawler & Web Miner Query Session Semantic Message Type Classifier & Domain Specific (RDF/OWL) Message Quality Ranking (User Rank/ Post Rank) HARVESTMAN WEB SPIDER Suggestion Module (Domain & Related) NLTK Type a Post Update Search Suggest Query exist posts

Data Collection Using Resource Description Framework (RDF) to embed semantic information in to the posts. Using Web Ontology Language(OWL) to store ontology information, and XML to store description information. Using Lucene indexing format for searching and ranking database including updated results [Should have the example of system design, data collection, and data result analysis by giving two posts that are the same and go through the process of the system that will detect such duplication]

Data Result Analysis Data structure and statistical analysis Number of system and suggestion usage Newly created topics and suggestion selective ratio Number of knowledge and Information message types Output Ontology

Limitations of Study Using only one web forum as a main resource, searching in local web forum database Considering semantic information, sentence structure, and ontology representation based on only Thai language

Anticipated outcomes Fewer duplicated questions in the web forum using the implemented automated question answering system Higher quality suggestion for posting contents

Research Plan and Procedures

References [1] K. Inui, S. Abe, K. Zuo, K. Morita and C. Sao, Experience Mining: Building a Large- Scale Database of Personal Experiences and Opinions from Web Documents. IEEE/WIC/ACM International Conference on Web Inteligence and Intelligent Agent Technology,2008. [2] G. W. Chen and M. M. Chiu, Online Discussion Process: Effects of Earlier Messages'Evolutions, Knowledge Content, Social Cues and Personal Information on Later Messages. In preceeding of Sixth International Conference on Advanced Learning Technologies, [3] Zhe Chen and Dunwei Wen, A New Web-Service-Based Architecture for Question Answering. In preceeding of IEEE International Conference on Natural Language Processing and Knowledge Engineering, [4] M. Jang, J. Sohn, and H. Cho, Automated Question Answering using Semantic Web Services. IEEE Asia-Pacific Services Computer Conference, [5] N. Duhan, A. K. Shama, and K. K. Bhatia, Page Rank Algorithms: A Survey. IEEE International Adcaned Computing Conference (IACC), 2009.

Special Thanks for Resources