Intelligent Database Systems Lab N.Y.U.S.T. I. M. 1 Mining knowledge from natural language texts using fuzzy associated concept mapping Presenter : Wu,

Slides:



Advertisements
Similar presentations
Modelling with expert systems. Expert systems Modelling with expert systems Coaching modelling with expert systems Advantages and limitations of modelling.
Advertisements

CILC2011 A framework for structured knowledge extraction and representation from natural language via deep sentence analysis Stefania Costantini Niva Florio.
Dialogue – Driven Intranet Search Suma Adindla School of Computer Science & Electronic Engineering 8th LANGUAGE & COMPUTATION DAY 2009.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A novel document similarity measure based on earth mover’s.
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Shared Ontology for Knowledge Management Atanas Kiryakov, Borislav Popov, Ilian Kitchukov, and Krasimir Angelov Meher Shaikh.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Quality evaluation of product reviews using an information.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Fast exact k nearest neighbors search using an orthogonal search tree Presenter : Chun-Ping Wu Authors.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 An Efficient Concept-Based Mining Model for Enhancing.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
Research Information System for Materials - Database, Simulation and Knowledge Toshihiro Ashino Toyo University
Defining Text Mining Preprocessing Transforming unstructured data stored in document collections into a more explicitly structured intermediate format.
Using Text Mining and Natural Language Processing for Health Care Claims Processing Cihan ÜNAL
Intelligent Database Systems Lab Presenter : YAN-SHOU SIE Authors : JEROEN DE KNIJFF, FLAVIUS FRASINCAR, FREDERIK HOGENBOOM DKE Data & Knowledge.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Evaluation of novelty metrics for sentence-level novelty mining Presenter : Lin, Shu-Han Authors : Flora.
Intelligent Database Systems Lab Presenter : WU, MIN-CONG Authors : Jorge Villalon and Rafael A. Calvo 2011, EST Concept Maps as Cognitive Visualizations.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A Comparison of SOM Based Document Categorization Systems.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. A Web 2.0-based collaborative annotation system for enhancing knowledge sharing in collaborative learning.
Intelligent Database Systems Lab Presenter : JHOU, YU-LIANG Authors :Shady Shehata, Fakhri Karray, Mohamed S. Kamel, Fellow 2012, IEEE An Efficient Concept-Based.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Extracting meaningful labels for WEBSOM text archives Advisor.
Intelligent Database Systems Lab Presenter : JIAN-REN CHEN Authors : Sheng-Tun Li a,b,*, Fu-Ching Tsai a 2013, KBS A fuzzy conceptualization model for.
1 Learning Sub-structures of Document Semantic Graphs for Document Summarization 1 Jure Leskovec, 1 Marko Grobelnik, 2 Natasa Milic-Frayling 1 Jozef Stefan.
Research Topics CSC Parallel Computing & Compilers CSC 3990.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Automatic Recommendations for E-Learning Personalization.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. An IPC-based vector space model for patent retrieval Presenter: Jun-Yi Wu Authors: Yen-Liang Chen, Yu-Ting.
Intelligent Database Systems Lab Presenter : Kung, Chien-Hao Authors : Yoong Keok Lee and Hwee Tou Ng 2002,EMNLP An Empirical Evaluation of Knowledge Sources.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Learning Phonetic Similarity for Matching Named Entity.
Towards the Semantic Web 6 Generating Ontologies for the Semantic Web: OntoBuilder R.H.P. Engles and T.Ch.Lech 이 은 정
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Development of a reading material recommendation system based on a knowledge engineering approach Presenter.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Externally growing self-organizing maps and its application to database visualization and exploration.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 An Adaptation of the Vector-Space Model for Ontology-Based.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 2007.SIGIR.8 New Event Detection Based on Indexing-tree.
Intelligent Database Systems Lab Presenter : Chang,Chun-Chih Authors : CHRISTOS BOURAS, VASSILIS TSOGKAS 2012, KBS A clustering technique for news articles.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Using Text Mining and Natural Language Processing for.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Unsupervised word sense disambiguation for Korean through the acyclic weighted digraph using corpus and.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Psychiatric document retrieval using a discourse-aware model Presenter : Wu, Jia-Hao Authors : Liang-Chih.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 1 Visualization of multi-algorithm clustering for better economic decisions - The case of car pricing.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Mining massive document collections by the WEBSOM method Presenter : Yu-hui Huang Authors :Krista Lagus,
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Semantic segment extraction and matching for Internet.
Commonsense Reasoning in and over Natural Language Hugo Liu, Push Singh Media Laboratory of MIT The 8 th International Conference on Knowledge- Based Intelligent.
TWC Illuminate Knowledge Elements in Geoscience Literature Xiaogang (Marshall) Ma, Jin Guang Zheng, Han Wang, Peter Fox Tetherless World Constellation.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 1 Mining concept maps from news stories for measuring civic scientific literacy in media Presenter :
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 1 Identifying Domain Expertise of Developers from Source Code Presenter : Wu, Jia-Hao Authors : Renuka.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Towards comprehensive support for organizational mining Presenter : Yu-hui Huang Authors : Minseok Song,
Chapter 7 K NOWLEDGE R EPRESENTATION, O NTOLOGICAL E NGINEERING, AND T OPIC M APS L EO O BRST AND H OWARD L IU.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Providing Justifications in Recommender Systems Presenter.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Automatic Document Indexing in Large Medical Collections.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology ACM SIGMOD1 Subsequence Matching on Structured Time Series.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Mining Advisor-Advisee Relationships from Research Publication.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Community self-Organizing Map and its Application to Data Extraction Presenter: Chun-Ping Wu Authors:
Intelligent Database Systems Lab Presenter : JHOU, YU-LIANG Authors : Jae Hwa Lee, Aviv Segev 2012 CE Knowledge maps for e-learning.
Intelligent Database Systems Lab Presenter : YU-TING LU Authors : Hsin-Chang Yang, Han-Wei Hsiao, Chung-Hong Lee IPM Multilingual document mining.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Named Entity Disambiguation by Leveraging Wikipedia Semantic Knowledge Presenter : Jiang-Shan Wang Authors.
Sentence Similarity Based on Semantic Nets and Corpus Statistics
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Enhancing Text Clustering by Leveraging Wikipedia Semantics.
Developing a diagnostic system through integration of fuzzy case-based reasoning and fuzzy ant colony system Expert Systems with Applications 28(2005)
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 1 Ubiquitous learning website: Scaffold learners by mobile devices with information-aware techniques.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. A method of extracting malicious expressions in bulletin board systems by using context analysis Presenter:
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Boosting the Feature Space: Text Classification for Unstructured.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Learning Portfolio Analysis and Mining for SCORM Compliant Environment Pattern Recognition (PR, 2010)
Intelligent Database Systems Lab Presenter: YU-TING LU Authors: Yong-Bin Kang, Pari Delir Haghighi, Frada Burstein ESA CFinder: An intelligent key.
Using lexical chains for keyword extraction
Guangbing Yang Presentation for Xerox Docushare Symposium in 2011
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Describing Documents Ch3 in textbook Organizing Knowledge: An
Presentation transcript:

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 1 Mining knowledge from natural language texts using fuzzy associated concept mapping Presenter : Wu, Jia-Hao Authors : W.M. Wang, C.F.Cheung,W.B. Lee, S.K. Kwork IPM (2008) ˜

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 22 Outline Motivation Objective Methodology Experiments Conclusion Comments

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 3 Motivation The amount of data of all kinds available electronically is increasing dramatically.  In the enterprises, about 80-98% of all data is consists of unstructured or semi-structured documents. Knowledge presented in may documents has an informal, unstructured shape.  It has to be converted to a formal shape, with precisely defined syntax and semantics. (ex: document annotations)

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 4 Objective Extracting the propositions in text so as to construct a concept map automatically.  The technique, Fuzzy Association Concept Mapping (FACM), is consists of a linguistic module and a recommendation module. Provides a method which can be easily convert by computer.  Users can convert scientific and short texts into a structured format.  Provides knowledge workers with extra time to rethink their written text and to view their knowledge from another angle.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 5 Objective (Cont.)

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 6 Methodology-FACM The relations and concepts are generated from the document itself rather than retrieved from predefined ontologies.  It uses the syntactic structure of the sentences to find relations between the words. An anaphoric resolution is applied based on rule-based reasoning (RBR) and case-based reasoning (CBR) for solving ambiguities arising during the syntactic analysis.  This enables a dynamic method of anaphoric resolution that is continually improved.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 7 Methodology-Architecture of FACM. Step 1.Input the Sentence. Step 2.Parsing by POS tagger. Step 3.Case encoding Step 4.Produce the Solution.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 8 Methodology-FACM’s Anaphora resolution The similarity between the new case and old cases is calculated based on nearest neighbor matching. (1) (2)

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 9 Methodology-Proposition recommendation The normalized frequency of concept i and concept j co- existing in the same or adjacent sentence is calculated:

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 10 Methodology-the relationship between concepts. (a) (b) (c) IF the normalized frequency of two concepts co-existing in the same sentence is High, THEN the relationship between the two concepts is High(0.7). IF the normalized frequency of two concepts co-existing in the adjacent sentence is High, THEN the relationship between the two concepts is Medium(0.2). The COG of fuzzy set A on the interval a 1 to a 2 with membership function u A is given:

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 11 Experiments-SCI abstracts & News from CNET

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 12 Experiments-Results of algorithm evaluation

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 13 Conclusion Provides an interactive way for concept map builders.  Rethink their concept maps.  Adapt and Refine the suggestions for completing the concept maps. A human-like construction of concept maps can be achieved.  The highly accurate for use in extracting concepts from scientific and short texts such as abstract databases, news groups, s, discussion forums, etc. Future work  The system should be evaluated on bigger collections with more candidate users.  The evaluation of the interactive process of the framework is also an essential element.  Qualitative methods may be used to evaluate the effectiveness of the recommendation process.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. 14 Comments Advantage  The convenient mining knowledge method. Drawback  How to use the equation to produce the concept map. Application  To analyze Abstract.