AMANDA COHEN MOSTAFAVI Applying Entity Discovery and Assignment to video games in order to mine opinions.

Slides:



Advertisements
Similar presentations
The importance of social media monitoring in the video games industry Nick
Advertisements

Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
The Assembly Language Level
Problem Semi supervised sarcasm identification using SASI
TEMPLATE DESIGN © Identifying Noun Product Features that Imply Opinions Lei Zhang Bing Liu Department of Computer Science,
Chapter 13 Working with Sources. Copyright © Houghton Mifflin Company. All rights reserved.13 | 2 Chapter overview Looks at how researchers use sources.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Mining and Summarizing Customer Reviews Advisor : Dr.
Aki Hecht Seminar in Databases (236826) January 2009
Topics in AI: Applied Natural Language Processing Information Extraction and Recommender Systems for Video Games: Gameplay Krishna Achuthan, Stephanie.
Topics in AI: Applied Natural Language Processing Information Extraction and Recommender Systems for Video Games Supervised by Dr. Noriko Tomuro Fall –
CALL: Computer-Assisted Language Learning. 2/14 Computer-Assisted (Language) Learning “Little” programs Purpose-built learning programs (courseware) Using.
Evolutionary Games The solution concepts that we have discussed in some detail include strategically dominant solutions equilibrium solutions Pareto optimal.
Assignment 2 Shijing Xiong. Blog Blogs are personal Websites consisting of regularly updated entries displayed in reverse chronological order. They read.
Game Peripherals BRAD SAYE. The computer mouse Douglas Engelbart at the Stanford Research Institute (now SRI International) invented the first mouse prototype.
Purpose:  Provided students with an opportunity to learn how to improve or maintain their physical fitness through novel technological based tools instead.
Japanese Game Sales and Demographics A study of the influence of culture on the video game market.
Personalized Spam Filtering for Gray Mail Ming-wei Chang University of Illinois at Urbana-Champaign Wen-tau Yih and Robert McCann Microsoft Corporation.
Sentiment Analysis with a Multilingual Pipeline 12th International Conference on Web Information System Engineering (WISE 2011) October 13, 2011 Daniëlla.
By meg. Video game consoles How have video games consoles changed?
Mining and Summarizing Customer Reviews
Theory Of Automata By Dr. MM Alam
Computing Gaming History, Present (and future?). History of gaming.
By Brian Stoepker.  A situation in which the effect of one explanatory variable on the outcome is affected by the value of a second explanatory variable.
The Basics Title: Guitar Hero III Legends of Rock Company: Neversoft and Activision Type: Music Rhythm Price: (New) $59.99 Minimum Requirments: *Console.
Administrative Policy Writing Spring Administrative Policy Writing Spring 2011 Introduction This week we are discussing a type of public-policy.
Choosing, Mapping, and Narrowing a Topic
Climbing Mountains What it takes to make AAA games.
World of Warcraft The World of Warcraft series is only for the PC. It was developed by Blizzard Entertainment. It mostly got a 9 out of 10. More than.
1 Entity Discovery and Assignment for Opinion Mining Applications (ACM KDD 09’) Xiaowen Ding, Bing Liu, Lei Zhang Date: 09/01/09 Speaker: Hsu, Yu-Wen Advisor:
Nachos Phase 1 Code -Hints and Comments
Writing a Book Review Danika Rockett University of Baltimore Summer 2009.
CSC 213 – Large Scale Programming Lecture 2: Object-Oriented Analysis & Object-Oriented Design.
The Developmental Reading & English Placement Test
Using Text Mining and Natural Language Processing for Health Care Claims Processing Cihan ÜNAL
CHRISTOPHER RHODES UHD-REU 17 JULY 2009 Clinical Free Processing.
RESPRO Area 1C Area 1C RESPRO Meeting RESPRO Area 1C November 24, 2009.
Querying Structured Text in an XML Database By Xuemei Luo.
CSC 395 – Software Engineering Lecture 13: Object-Oriented Analysis –or– Let the Pain Begin (At Least I’m Honest!)
A Graph-based Friend Recommendation System Using Genetic Algorithm
1 §3.1 Implicit Differentiation The student will learn about implicit differentiation.
ECE450 - Software Engineering II1 ECE450 – Software Engineering II Today: Design Patterns IX Interpreter, Mediator, Template Method recap.
VIDEO GAME experts INDUSTRY Some of our clients. We translate video game scripts, localize game UI and provide multilingual voice acting services in over.
Bash ’em Smash ’em Violent Video Games and Their Effects on Aggression By: Katrina Deutsch, Whitney Evans, and Rachel Schloss For: Communications 111.
,, By Infinity Ward and Activision, Released November 9, 2007 in France on PC, PS3, Mac, Xbox, Nintendo, Under 16 Under 16, Over 15 million games sold.
BY : Sukhman Bhathal Grade 9 Block:3 IT9. Starting The XBOX 360 is a video game console made by microsoft and it is the second console in the XBox series.
How did these popular video game brands evolve and turn into how they are today?
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Using Text Mining and Natural Language Processing for.
INTRODUCTION TO COMPILERS(cond….) Prepared By: Mayank Varshney(04CS3019)
Video Games By: Trenton 12/18/2015. Next Generation Game Systems Xbox 360 PlayStation 3 Nintendo Wii Arcade- $ GB- $249.99$ Pro- $ GB-
Video Games. . One Of The Top Systems Of The Year (Along With The Playstation 3). This Amazing System Has Over Titles And Amazing Futures. One Of.
YouTube.com Optimize For SEO. This is a guide to setting up each video to Maximize it’s potential reach through Search Engine Optimization These steps.
ITCS 6265 Details on Project & Paper Presentation.
The Quality of Peoples Opinions ‘From Coffee Shop Talk to Referenced Opinion’ Writing a Positional paper.
Short Video Metadata Acquisition Game Aleš Mäsiar, Jakub Šimko
Extracting and Ranking Product Features in Opinion Documents Lei Zhang #, Bing Liu #, Suk Hwan Lim *, Eamonn O’Brien-Strain * # University of Illinois.
Abstracting.  An abstract is a concise and accurate representation of the contents of a document, in a style similar to that of the original document.
Opinion Observer: Analyzing and Comparing Opinions on the Web WWW 2005, May 10-14, 2005, Chiba, Japan. Bing Liu, Minqing Hu, Junsheng Cheng.
PAIR project progress report Yi-Ting Chou Shui-Lung Chuang Xuanhui Wang.
Multi-Class Sentiment Analysis with Clustering and Score Representation Yan Zhu.
Skills for change Hot off the press! How to get media coverage.
Video Games Add pics. Then And Now SabrinaBoulay.
Lecture 2 Introduction to Programming
History of x-box 360 By:cajan.
Web Data Extraction Based on Partial Tree Alignment
5S Numbers Game The purpose of this game is to demonstrate the impact the way our workplace is arranged impacts on how the work is done.
Presentation by Julie Betlach 7/02/2009
R.Rajkumar Asst.Professor CSE
Social media as market research
PURE Learning Plan Richard Lee, James Chen,.
TECHNICAL REPORTS WRITING
Presentation transcript:

AMANDA COHEN MOSTAFAVI Applying Entity Discovery and Assignment to video games in order to mine opinions

Project Purpose Many differing opinions on a video game’s quality, difficult to determine general opinion Usually look to professional video game reviews Can gather review scores, normalize and average score in order to determine general consensus – Done on GameRankings.com However, this ignores the discussion by everyday players – Debate takes place most commonly on message boards

Project Purpose Solution: Mine opinions expressed on message board posts and derive a consensus from the results Using the algorithm for entity discovery and assignment and opinion mining as defined in this paper:  Entity Discovery and Assignment for Opinion Mining Applications. Xiaowen Ding, Bing Liu, Lei Zhang. SIGKDD, 2009

Goal: To mine opinions on selected games expressed on video game message boards, derive an average opinion and compare results to the review scores gathered by GameRankings.com

Games Total games examined: 10 All released in were top-selling games of the year, according to the NPD group (market research group that studies the video game industry, among other things) 5 are among the highest reviewed games according to GameRankings.com Ensures a mix of critically and commercially successful Note: Duplicate Games are removed

Games High selling – Halo 3 (360, Microsoft) million – Wii Play with Wii Remote (Wii, Nintendo) million – Call of Duty 4: Modern Warfare (360, Activision) million – Guitar Hero III: Legends of Rock (PS2, Activision) million – Super Mario Galaxy (Wii, Nintendo) million Highly Reviewed – The Orange Box (PC, Xbox 360) – 96% – BioShock (PC, Xbox 360) – 94% – Elder Scrolls IV: Oblivion (PS3) - 92% – God of War II (PS2) – 92% – Team Fortress 2 (PC) – 92%

Game Issues Alternate Names: Games are often referenced by shorthand or abbreviation Solution: include an array of possible alternate names in defining the entity object

Message Boards Principally from video game websites, or websites with large portions devoted to video games Looking at comments in relation to articles about top selling games or reviews to ensure that the posts are relevant to the games  Lots of comparative statements as well

Message Board Posts 1UP.com: 26 posts Gamespot.com: 14 posts IGN.com: 20 posts Total: 60 posts

Post Issues Unusual ways of expressing opinions: message board posters may not express their opinions in the same way as someone writing a review would. For instance:  “Call of Duty 4 was a very good game” <- this sentence would make for very easy opinion mining  “COD4 IS TEH WIN, OMG!!!!111” <- more likely on a message board, and much harder to mine Solution: The opinion mining algorithm allows for “opinion grammar”. More later…

The Process Implements Entity Discovery and assignment algorithm, with a couple modifications:  Entity discovery section reduced to better fit purposes of the project  Ordinarily would use pattern mining in order to find entities, not an issue in this case since there are a predetermined set of games examined

Data preprocessing Each word in every post is given a part-of-speech tag  Designates the grammatical role of each word  Done with Stanford’s POS tagger, developed by the Stanford Natural Language Processing group  A list of the entities used are created, and their alternate names are define

Entity Discovery and Assignment Each post is parsed to separate sentences and find each entity If entity is found, and matches the game title, the entity is assigned to that sentence If there is no entity, the entity of the previous sentence is assigned  Works on the assumption that when someone starts talking about an entity, subsequent sentences deal with the same entity without explicitly stating it If an alternate name for the entity is found, it is replaced with the original title to reduce future processing time

Opinion Grammar The original authors suggest that hard-coding every possible opinion words is not recommended Instead, they suggest using a system to define grammar that will pick out opinion words and statements A combination of hard-coded word list and grammar rules were used for this project  Hard coded words for regular English grammar, defined rules for more unexpected words and phrases

Indicator Word Symbols Po: Positive Ne: Negative Neu: Neutral Ng: Negation But: But-like

Opinion Mining Step 1: Apply indicator word symbols Step 2: Apply phrase rules Step 3: Search for negations, and change the opinion of the subsequent word (if it was positive, it would be negative and vice versa) Step 4: Aggregate opinions  Search for indicators, Po = 1, Ne = -1, Neu = 0

Comparative Sentences If a sentence has more than one entity, it is a comparative sentence  This sentence compares one entity to another, i.e. “Game-A is better than Game-B” In order to find the superior and inferior entities, look for comparative or superlative words (according to POS tags) and whether it’s a positive or negative word  If negative, the entity after the comparative word is superior. If positive the entity before the comparative word is superior

Up next: demo and results…