The Sellout: Readers Sentiment Analysis of 2016 Man Booker Prize Winner Paper ID : 748.

Slides:



Advertisements
Similar presentations
Usage statistics in context - panel discussion on understanding usage, measuring success Peter Shepherd Project Director COUNTER AAP/PSP 9 February 2005.
Advertisements

1 A Systematic Review of Cross- vs. Within-Company Cost Estimation Studies Barbara Kitchenham Emilia Mendes Guilherme Travassos.
Chapter 5: Introduction to Information Retrieval
2015 SLA IT Webinar Using Analytics to Understand Social Media Activity Michelle Chen School of Information San José State University February 4 th, 2015.
Mapping Studies – Why and How Andy Burn. Resources The idea of employing evidence-based practices in software engineering was proposed in (Kitchenham.
A Brief Overview. Contents Introduction to NLP Sentiment Analysis Subjectivity versus Objectivity Determining Polarity Statistical & Linguistic Approaches.
A Framework for Automated Corpus Generation for Semantic Sentiment Analysis Amna Asmi and Tanko Ishaya, Member, IAENG Proceedings of the World Congress.
Social network analysis of the West African Ebola Outbreak INTRODUCTION The Ebola outbreak in West Africa has drawn the World’s attention given the spread.
Sentiment Lexicon Creation from Lexical Resources BIS 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam
Application of Apriori Algorithm to Derive Association Rules Over Finance Data Set Presented By Kallepalli Vijay Instructor: Dr. Ruppa Thulasiram.
Aparna Kulkarni Nachal Ramasamy Rashmi Havaldar N-grams to Process Hindi Queries.
Extracting Opinions, Opinion Holders, and Topics Expressed in Online News Media Text Soo-Min Kim and Eduard Hovy USC Information Sciences Institute 4676.
More than words: Social networks’ text mining for consumer brand sentiments A Case on Text Mining Key words: Sentiment analysis, SNS Mining Opinion Mining,
Opinion mining in social networks Student: Aleksandar Ponjavić 3244/2014 Mentor: Profesor dr Veljko Milutinović.
Evidence Based Medicine Meta-analysis and systematic reviews Ross Lawrenson.
Thanks to Bill Arms, Marti Hearst Documents. Last time Size of information –Continues to grow IR an old field, goes back to the ‘40s IR iterative process.
What Do We Know about Defect Detection Methods P. Runeson et al.; "What Do We Know about Defect Detection Methods?", IEEE Software, May/June Page(s):
A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources Author: Carmen Banea, Rada Mihalcea, Janyce Wiebe Source:
Descriptive Statistics Prepared by: Asma Qassim Al-jawarneh Ati Sardarinejad Reem Suliman Dr. Dr. Balakrishnan Muniandy PTPM-USM.
Text Mining & NLP based Algorithm to populate ontology with A-Box individuals and object properties Alexandre Kouznetsov and Christopher J. O. Baker, University.
Software Quality in Use Characteristic Mining from Customer Reviews Warit Leopairote, Athasit Surarerks, Nakornthip Prompoon Department of Computer Engineering,
Creating Subjective and Objective Sentence Classifier from Unannotated Texts Janyce Wiebe and Ellen Riloff Department of Computer Science University of.
Information Transfer through Online Summarizing and Translation Technology Sanja Seljan*, Ksenija Klasnić**, Mara Stojanac*, Barbara Pešorda*, Nives Mikelić.
Recognizing Stances in Online Debates Unsupervised opinion analysis method for debate-side classification. Mine the web to learn associations that are.
AN EFFECTIVE STATISTICAL APPROACH TO BLOG POST OPINION RETRIEVAL Ben He Craig Macdonald Iadh Ounis University of Glasgow Jiyin He University of Amsterdam.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
UWMS Data Mining Workshop Content Analysis: Automated Summarizing Prof. Marti Hearst SIMS 202, Lecture 16.
1 CS 430: Information Discovery Lecture 8 Automatic Term Extraction and Weighting.
Spam Detection Kingsley Okeke Nimrat Virk. Everyone hates spams!! Spam s, also known as junk s, are unwanted s sent to numerous recipients.
Key Stage 1 and 2 Tests 2016 Presentation to Parents and Carers Otterbourne Primary School April 2016.
Opinion spam and Analysis 소프트웨어공학 연구실 G 최효린 1 / 35.
More than words: Social network’s text mining for consumer brand sentiments Expert Systems with Applications 40 (2013) 4241–4251 Mohamed M. Mostafa Reporter.
Vince Graziano English Librarian
An Effective Statistical Approach to Blog Post Opinion Retrieval Ben He, Craig Macdonald, Jiyin He, Iadh Ounis (CIKM 2008)
Holy Trinity CE Junior School KS2 SAT Meeting Welcome & Thank you for coming.
Key Stage 1 National Curriculum Assessments. In 2014/15 a new national curriculum framework was introduced by the Government for Years 1, 3, 4 and 5.
A systematic literature review of empirical evidence on computer games and serious games Wakana Ishimaru Leo Liang.
Queensland University of Technology
Sentiment Analysis of Twitter Data
Name: Sushmita Laila Khan Affiliation: Georgia Southern University
Cheryl Ng Ling Hui Hee Jee Mei, Ph.D Universiti Teknologi Malaysia
Advanced Higher Modern Languages
Introduction to Corpus Linguistics
Sentiment analysis algorithms and applications: A survey
An assessment framework for Intrusion Prevention System (IPS)
Text Mining – Part Two “The word cloud technique first originated online in the 1990s as tag clouds (famously described as “the mullets of the Internet”),
Lecturer: Dalia Mirghani
Erasmus University Rotterdam
E-Commerce Theories & Practices
CS 430: Information Discovery
Social Knowledge Mining
A STUDY ON THE USAGE OF SOCIAL MEDIA AMONG MIZORAM UNIVERSITY STUDENTS WHO VISIT THE CENTRAL LIBRARY B. Lalhlimpuii Assistant Librarian Mizoram University.
6th grade English Language Arts Curriculum Night
Critical Analysis CHAPTER 7.
Key Stage 1 SATs Meeting.
GCSE ENGLISH LITERATURE J360
Best practices for the self-assessment
Text Analysis and Search Analytics
Text Mining & Natural Language Processing
Introduction to Text Analysis
Text Analysis and Search Analytics
Sentiment Analysis In Student Learning Experience By Obinna Obeleagu
Sentiment Analysis In Student Learning Experience By Obinna Obeleagu
Questioning and evaluating information
Section 28.2 Types, Trends, and Limitations of Marketing Research
Parents Information Meeting
Information Retrieval and Web Design
Key Stage 1 SATs Meeting.
Forecasting methods Presented by: 29 January 2014.
Reliability Vs. Validity by
Presentation transcript:

The Sellout: Readers Sentiment Analysis of 2016 Man Booker Prize Winner Paper ID : 748

INTRODUCTION Sentiment analysis also sometimes term as opinion mining refers to the use of natural language processing, text analysis, computational linguistics, and biometrics to systematically identify, extract, quantify, and study affective states and subjective information. Sentiment analysis is widely applied to understand customer using reviews and survey responses, online and social media, and healthcare materials for applications that range from marketing to customer service to clinical medicine. This paper mainly focuses on the sentiment analysis of the 2016 Man Booker prize winner book “The Sellout” in Goodreads website reviews.

Synonymous & Interchangeable Names Definition The term “sentiment” is used in reference to the automatic analysis of evaluative text and tracking of the predictive judgments. - Das and Chen Synonymous & Interchangeable Names Subjective Analysis Review Mining Opinion Mining Appraisal Extraction

Methodology The data on book reviews was collected manually for the study from Goodreads website (www.goodreads.com). Booker prize award 2016, The Sellout by Paul Beatty was selected for the study which had a total of 3,150 reviews. The first 10 pages of the reviews had been considered for the study. Each page consists of thirty reviews. Data was collected for the period of January 09, 2015 to April 25, 2017. Total of 292 reviews was collected from the Goodreads website. Reviews in English language was only selected for analysis, hence 8 reviews has been excluded from the study as it was not in English language. Out of 292 reviews, 189 reviews occurred before the announcement of the result, and 103 reviews occurred after the announcement of the result.

Reviews were analysed in R Reviews were analysed in R. Text analysis and sentiment polarity were developed in R using various functions. The extracted text preparation is clean by converting plural to singular words, removing punctuations and numbers, URL, symbols, spaces etc. before any analysis. Sentiment scores has been developed for each review using sentiment function. Positive, Neutral and Negative sentiment were calculated separately.

Total Sentiment Polarities Results Total Sentiment Polarities

Book review on “The Sellout” has more negative reviews Book review on “The Sellout” has more negative reviews. Maximum (47%) of the users have critically reviewed the novel, while 42.1% of the users have reviewed it with positive reviews. However, 10.9% of the users have reported it with neutral reviews.

Sentiment Analysis Before and After the Result

Sentiment Analysis before and after the result

Sentiment Analysis before the result Sentiment polarity of the reviews are analysed to understand whether the reviews are of positive or negative sentiment. Out of 292 reviews, 189 reviews occurred before the announcement of the result.

Sentiment Analysis after the result It was found that there was no significant change in the reviewer’s criticism, a large number of reviewers continue to review the novel critically with evidences of negative sentiment being more than the positive sentiment. Before the result of the award, there were 83 positive sentiments, 18 neutral and 88 negative. Subsequently, the number reduces to 40 positive sentiments, 14 neutral and 49 negative.

Most frequent word in the reviews The Most frequent words used in book reviews have been shown in the Table below. The first 20 words with maximum frequency have been listed below Sl. No. Word Frequency 1. book 531 2. read 279 3. black 251 4. beatty 214 5. satire 176 6. novel 164 7. race 147 8. sellout 139 9. funny 135 10. people 126

WordCloud WordCloud of frequently used words in the reviews is considered for visual understanding/analysing the data. The most occurred word in the reviews has been made as wordcloud for the easy visualization.

WordCloud

Conclusion The number of reviews for different book varies, but the reviews provide accessible and plentiful data for relatively easy analysis for a range of applications. This paper seeks to apply and extend the current work in the field of natural language processing and sentiment analysis to data retrieved from Goodreads. Positive, Neutral and Negative sentiment were calculated separately. On the whole, negative sentiment is more than the positive sentiment. Word frequencies were also formed and the most occurred word in the reviews has been tabulated.

Conclusion Public sentiments which is difficult to comprehend differs with the expert, at least in this case of Goodreads users. Popular reading differs from acclaimed or accomplish novel. Unlike Twitter predictive data, Goodreads reviews did not show any indication that the book reviews would predict award winning novel by analysing the sentiments of the readers. Hence, users driven acquisition are a popular choice among Librarian and it is believed that sentiment analysis has an important role to play.

Thank you