Presentation is loading. Please wait.

Presentation is loading. Please wait.

Collecting Evaluative Expression for Opinion Extraction Nozomi Kobayasi, Kentaro Inui, Yuji Matsumoto (Nara Institute) Kenji Tateishi, Toshikazu Fukushima.

Similar presentations


Presentation on theme: "Collecting Evaluative Expression for Opinion Extraction Nozomi Kobayasi, Kentaro Inui, Yuji Matsumoto (Nara Institute) Kenji Tateishi, Toshikazu Fukushima."— Presentation transcript:

1 Collecting Evaluative Expression for Opinion Extraction Nozomi Kobayasi, Kentaro Inui, Yuji Matsumoto (Nara Institute) Kenji Tateishi, Toshikazu Fukushima (NEC Internet System Lab) IJCNLP 2004 Lun Wei Ku, 2005/04/21

2 What are they going to do? The seats are very comfortable and supportive. But the back seat room is tight. –

3 Related Work Classify reviews into recommended or not recommended. Positive sentences and negative sentences. Acquiring subjective words – adjectives, nouns, verbs and adverbs. Using patterns

4 Related Work 1.Bing Liu, Minqing Hu and Junsheng Cheng. "Opinion Observer: Analyzing and Comparing Opinions on the Web" To appear in Proceedings of the 14th international World Wide Web conference (WWW-2005), May 10-14, 2005, in Chiba, Japan. 2.Mining and summarizing customer reviews". Proceedings of the ACM SIGKDD 2004

5 Attribute and Value Take orientation as a special type of Value (I like the lether seats of Product_X) of is

6 Collecting Expressions Iterate the following two steps: Candidate generation: –Web documents –Coocurrence patterns –Subject/attribute/value dictionary –Coocurrence Candidate selection: –Human judge –Update dictionaries

7 Collecting Expressions -- Example Pattern: is Sentences: –… is and … –… is … Provide only highly ranked candidates to the human judge.

8 Experiment Resources Domain: cars and video games 15,000 reviews (230,000 sentences) for cars and 9,700 reviews (90,000 sentences) for games. Dictionaries: –Subject: 389 for cars (“BMW”,”TOYOTA”) and 660 for games (“Dark Chronicle”, “Seaman”)

9 Experiment Resources –Attribute: 7 for both domains. (cost/price/service/performance/function/suppor t/design) –Value: using thesaurus, 247 mostly adjectives. (good/beautiful/bright/like/favorite/high) –Patterns: select 8 patterns, decide which pattern to use according to POS. Scores are given to these patterns.

10 Results

11 Discussions No convergence: compound expressions Coverage: 45% (car), 35% (game)

12 Discussions Value patterns outperformed attribute patterns. –Value coocurrs with not only attributes, but also named entities and general nouns. – There are problems in deciding attribute scope. Character Face character Motion character

13 Discussions

14 Conclusions A semi-automatic methods based on cooccurrence patterns of subjects, attributes and values. More efficiently than manual collection. Cooccurrence patterns works well across different domains. Future work: directly extract triplets from Web.


Download ppt "Collecting Evaluative Expression for Opinion Extraction Nozomi Kobayasi, Kentaro Inui, Yuji Matsumoto (Nara Institute) Kenji Tateishi, Toshikazu Fukushima."

Similar presentations


Ads by Google