Presentation is loading. Please wait.

Presentation is loading. Please wait.

Intelligent Database Systems Lab Presenter : JIAN-REN CHEN Authors : Sheng-Tun Li a,b,*, Fu-Ching Tsai a 2013, KBS A fuzzy conceptualization model for.

Similar presentations


Presentation on theme: "Intelligent Database Systems Lab Presenter : JIAN-REN CHEN Authors : Sheng-Tun Li a,b,*, Fu-Ching Tsai a 2013, KBS A fuzzy conceptualization model for."— Presentation transcript:

1 Intelligent Database Systems Lab Presenter : JIAN-REN CHEN Authors : Sheng-Tun Li a,b,*, Fu-Ching Tsai a 2013, KBS A fuzzy conceptualization model for text mining with application in opinion polarity classification

2 Intelligent Database Systems Lab Outlines Motivation Objectives Methodology Experiments Conclusions Comments

3 Intelligent Database Systems Lab Motivation Most existing document classification algorithms are easily affected by ambiguous terms. The ability to disambiguate for a classifier is thus as important as the ability to classify accurately. - opinion polarity classification

4 Intelligent Database Systems Lab Objectives We propose a concept driven text classification approach based on Formal Concept Analysis (FCA) to train a classifier using concepts instead of documents, so as to reduce the inherent ambiguities. We further utilize fuzzy formal concept analysis (FFCA) to take uncertain information into consideration.

5 Intelligent Database Systems Lab Formal concept analysis Objects: {Review6,Review7} Attributes: {Phenomenal, Fantastic, Love} => formal concept positive class: ‘‘Phenomenal’’, ‘‘Fantastic’’ and ‘‘Love’’ {Review1, Review4, Review6 and Review7} neutral class: ‘‘Cover’’ {Review5} negative class: ‘‘Awful’’ {Review2, Review3}

6 Intelligent Database Systems Lab Formal concept analysis positive class: {Review1, Review4, Review6, Review7} negative class: {Review2, Review3} neutral class: {Review5}

7 Intelligent Database Systems Lab Methodology - Architecture

8 Intelligent Database Systems Lab Methodology tf-idf: Inverted Conformity Frequency (ICF): Uniformity (Uni): tf-idf > 26 ICF < log(2) Uni > 0.2

9 Intelligent Database Systems Lab Methodology

10 Intelligent Database Systems Lab Methodology

11 Intelligent Database Systems Lab Experiments - Data set and evaluation Data set:  Reuter-21578  movie review  e-book review Evaluation

12 Intelligent Database Systems Lab Experiments (parameters)

13 Intelligent Database Systems Lab Experiments

14 Intelligent Database Systems Lab Experiments (conceptualization)

15 Intelligent Database Systems Lab Experiments

16 Intelligent Database Systems Lab Experiments

17 Intelligent Database Systems Lab Conclusions FFCM successfully reduce the impact from textual ambiguity. The results from the experiments show that FFCM outperforms other state-of-the-art algorithms for both Reuters-21578 and two opinion polarity collections.

18 Intelligent Database Systems Lab Comments Advantages - the formal concepts plays an important role Disadvantage - α may differ from various datasets - only focuses on single-class classification Applications - text mining


Download ppt "Intelligent Database Systems Lab Presenter : JIAN-REN CHEN Authors : Sheng-Tun Li a,b,*, Fu-Ching Tsai a 2013, KBS A fuzzy conceptualization model for."

Similar presentations


Ads by Google